I have in a first step done a LCA where a six cluster solution revealed the best fit of data. In the next step I want to analyze the relation between these clusters and sickness absence measured in days. Sickness absence has a preponderance of zero values and the distribution of values above zero is skewed to the right. I decided to use the data twostep in MPlus in order to do a zero-inflated gamma regression but the beta coefficients for my clusters make no sense in the logistic regression part, e.g. one cluster, according to data, have a larger proportion of zero’s compared to the reference cluster but MPlus gives the opposite. I have found a reference how a zero-inflated gamma regression is carried out in SAS, syntax is written by Dale McLerran and have been referenced by others. Applying that code in SAS gives beta coefficients that fit my data better. Specifically, the difference in beta coefficients from the logistic regression is merely a shift of + or – sign, the magnitude is exactly the same, for every cluster between MPlus and SAS whereas the beta coefficients from the gamma regression, reminded of each other. Thus, the magnitude of intercept and the beta coefficients for clusters respectively in MPlus were 113%, 67%, 66%, 71%, 64%, 57% compared to the ones obtained from SAS. Why are there differences in either magnitude or sign for beta coefficients between softwares?
Note that 2-part modeling is not the same as zero-inflated modeling; the latter is a mixture model and the former is not. So the results should not be expected to be the same, only similar. This is described in chapter 7 of our new book
Note also that the binary part of 2-part modeling in Mplus describes the probability of not being at zero. Zero-inflated modeling typically describes the probability of being in the zero class. The latter is the case for example using the censored-inflated model in Mplus.
Also, we request that postings are limited to one window. Longer questions should be sent to Mplus Support.