Message/Author 


dear all, I'm trying to run a two part growth mixture model. I did a lcga for the binary part first. 1.) 3 or 4 class solutions fitted better than a one class or two class solution, but I get bad average post probabilities for the 3 or 4 classes. If I have more than two groups, at least on group has a averagepostprob of 6070 (diagonal of matrix). What is an acceptable value? I thought of 80!? Are there any chances to isolate more than two meaningful and valide groups in a lggmm of boh parts? Is it worth it to proceed? 2.) I went to joint LGMM of both parts. I used the means of the growth factors lcga in the following lgmm as starting values, but it didn't converge. After nearly 2 hrs, I get the following messages: THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY DUE TO A NONZERO DERIVATIVE OF THE OBSERVEDDATA LOGLIKELIHOOD. THE MCONVERGENCE CRITERION OF THE EM ALGORITHM IS NOT FULFILLED. CHECK YOUR STARTING VALUES OR INCREASE THE NUMBER OF MITERATIONS. ESTIMATES CANNOT BE TRUSTED. THE LOGLIKELIHOOD DERIVATIVE FOR PARAMETER 20 IS 0.55772741D01. Any suggestions with regard to model convergence, additional starting values, may be variances? I was inspired by the recent Hser et al. article (2007). I think they used one latent class for both model parts. Is it significantly more computational demanding to use 2 latent class variables for both parts separately? 


1. I would say any entropy value is acceptable in the sense that entropy does not have to do with model fit or the best choice of number of classes. But to be useful for classification, I would think you want higher numbers. 2. Just increase to say miter=1000; as suggested in the error message. Can you give the Hser ref. 


1. o.k., thanks. 2. I will try but I'm nt very optimistic that it works. I'm very interested in isolating a "zerogroup" (structural zeroes). How do I specify such a group? I've heard of threshold = 15 for the binary part and zero mean for intercept and slope of the continuous part. Do I have to add zero means of the growth parameters for the binary part or is threshold = 15 sufficient? regards, michael 


sorry, forgot the reference: Hser, Huang, Chou & Anglin (2007); Trajectories of Heroin Addiction: Growth Mixture Modeling Results based on a 33Year FollowUp Study. EVALUATION REVIEW, 31, 548. They used one latent class variable. And I'm quite interested in the steps needed to let such a complex model converge. 


To make a zero class, fix the thresholds of your outcomes to plus 15 and the growth factor means and variances to zero. If you cannot get your model to converge, send your input, data, output, and license number to support@statmodel.com. 


thank's, I will try further runs and then may be send it to support. 


o.k., I have given it up, it is far too complex and miles away from convergence. I focus on an alternative, namely to isolate groups within the entire model (both parts) on the basis of LCGA, which is often done by people who use SAS (group based approach, Nagin). I know the disadvantages (too many groups may be isolated by LCGA) and it is rough and dirty but may be sufficient for me. Entropy looks very good. 1.)Is LCGA compatible with twopart in general? I only have read an article by Bengt, using LCGA on the binary part (but not on both parts simultanously). 


Twopart modeling requires that the two parts correlate in some way. Generally this is through the growth factors. If you do an LCGA, the growth factor variances are fixed at zero and there can be no growth factor covariances between the two processes. LCGA can be done with a twopart model if the latent class variable influences both parts. The following paper on twopart factor mixture may be of interest. It can be downloaded from our website. # Kim, Y.K. & Muthén, B. (2007). Twopart factor mixture modeling: Application to an aggressive behavior measurement instrument. 


o.k., your first statement was the reason for asking this question :). "LCGA can be done with a twopart model if the latent class variable influences both parts." Sorry, this was a missunderstanding. Bengt used LCGA on the separated binary part to get a clue about the number of groups in general. This was followed by LGMM on both parts. Now I'm separating groups on the basis of both parts simultanously with one class variable in my "Variable" command. Your answer implies, that this seems o.k., albeit having no correlated parts within the classes. 


That's ok. 


is there any way to get the estimated means for my trajectories when using "imputation" for my covariates? 


With TYPE=IMPUTATION, these means are not available. 


thank you again, as always :) So I have to use the means not adjusted for covariates for the visual plot in my paper, if this is not possible with imputation. I want to calculate confidence intervals for these means. Using "cinterval" only gives me cintervals for my model but not for the estimated means. I there any way to get them out of mplus? 


Not that I know of. 


I noticed that the current version 5.2 contains improvements concerning type=imputation. Has anything changed regarding means available in this kind of analysis? This would, for instance, alleviate the exploration of LGMM conditioned on imputed covariates, since sometimes the mean trajectories of the classes look different as compared to unconditional models. So far, the only chance to have a closer look on this issue is with nonimputed conditional models. 


I'm not sure what you mean. Can you be more specific about which means you are talking about. There are no plots available with multiple imputation. 


Sorry, I was talking about 'estimated means' of LGM or LGMM growth curves. Your answer implies that these are still not available in mplus 5.2 when using multiple imputation!? 


The PLOT command is not available for TYPE=IMPUTATION. 

ywang posted on Monday, May 12, 2014  10:44 am



Dear Drs. Muthen, I am interested in twopart growth mixture modeling. I did find two papers on this topic("TwoPart Growth Mixture Modeling" and "Trajectories of heroin addiction: growth mixture modeling results based on a 33year followup study"), but could not find any input example Can you advise where I can find the input examples? Thanks! 


It would be an extension of Example 6.16 in the user's guide. I don't have any other example. 

Mirim Kim posted on Tuesday, March 21, 2017  10:20 pm



Dear Dr. Muthen, After reading the Kim and Muthen (2009), I want to ask you about the way to specify the distribution of your study. Was it the same way in Muthen and Muthen (2002); How to use a Monte Carlo study to decide on sample size and determine power? How can I set the specific parameter? For example, in the Kim and Muthen (2009), the data generation was based on lognormal distribution (M=2, sigma=1). It would be really appreciated if you could share your idea. Thank you. 


Are you doing a Monte Carlo study? 

Mirim Kim posted on Wednesday, March 22, 2017  7:33 pm



Thanks for the reply. Yes, I'm doing on that and try to make skewed data fitted to the twopart model. 

Samuli Helle posted on Tuesday, February 04, 2020  11:52 am



Hi, Is it possible to use latent variables with categorical indicators as twopart DVs? If yes, any code examples available? Thanks! 


I think you would have to use mixtures with one class for zero. See for example: Wall, M. M., Park, J. Y., & Moustaki, I. (2015). IRT modeling in the presence of zeroinflation with application to psychiatric disorder severity. Applied Psychological Measurement. DOI: 10.1177/0146621615588184 view abstract 

Samuli Helle posted on Wednesday, February 05, 2020  1:01 pm



Thanks Bengt! After reading the article, I am unsure that mixture modeling fits to my data (I might well be wrong). So for each child we have asked some questions about his/her grandparents. But since some grandparents were dead we have missing data (nonresponse, not zeros) for the questions for those dead grandparents. Thus, I am wondering whether sample selection model would be more appropriate here? 


I think sample selection models need a lot of good covariates to be successful; not sure you have them here to make it worthwhile. 


No, not many just one... Any other options in Mplus that I could try? I checked your RMA book and in chapter 10 (Missing Data) there is a NMAR selection model where missing data indicator is regressed on Y (Table 10.18). I tried to run a similar model in my case but Mplus did not even produce an output to show what went wrong. The only difference was that I had a latent variable as an outcome (code clip below), variable "miss" being 1 if data on "x1x3" was missing and else 0. MODEL: F BY X1 X2 X3; F ON mf pm pf; miss ON F; 


Shouldn't be a problem. We need to see your full output (maybe you caught a glitch)  send to Support. 

Back to top 