In a recent conversation with Linda, we were alerted to a potential problem with empirical identification (e.g., the data could not support the model and we were finding what probably were local minima). We have adjusted our model somewhat (a growth model and the reason I am writing in this section of the board) and find we still cannot obtain convergence when we use the program's default start values. After some work with adjusting start values, we are able to obtain convergence. As a check, we have been changing the start values (all except one set at the final solution and making one start value equal to the final solution plus a one-half standard deviation adjustment). Is this sufficient to check for local minima or should we do something more drastic, such as set new start values for several parameters simultaneously with the above approach?
I would suggest changing the centering to see if you also get the same chi-square. This in effect changes the starting values. Perhaps just change the centering to the second time point to not make such a big change as when you center at the average time point.
I'm new to MPlus and LGM. I've fit a simple LGM to 12 time points (n=41). This worked fine (if the fit could be better) while I didn't ask for the latent variable means [level trend], but no SE's were calculated when I did because 'the model may not be identified'. If non-identification is indeed the problem, how might the latent variable mean estimates be involved? Otherwise, things to try?
I would need to see your output to answer your question. Please send it to firstname.lastname@example.org. If you have not asked for TECH1 in the OUTPUT command, please add that and rerun it before you send it to me.
yesterday, a collaborator approached me with a problem, which I thought was quite puzzeling: He had analyzed the growth in depression with three time points. A linear growth model over three time points without covariates has nine free parameters in the unrestricted H1 model (three means and six variances-covariances). For the one class H0 growth model 8 parameters are required (two means, 2 variances, 1 covariance, 3 residual variances). This leaves one degree of freedom and the model is identified. Surprisingly enough, he was able to run models with as many as five classes, which all converged perfectly (with decreasing BIC and fantastic entropy). The question is: How is that possible? I would appreciate any comments. On a second note, what do you think of using covariates to increase the number of df to fit more complex model and therefore outweigh the restrictions due to smaller number of available time points.
The way you are thinking about degrees of freedom does not apply to mixture models where the information does not come only from the covariance matrix and mean vector but also from the raw data. See, for example, the Fisher iris data mixture example on the Mplus website.
Along this line, using covariates to increase the degrees of freedom is not necessary. Covariates in the model can help in finding classes however.
thank you for your response. I was wondering about the formula commonly used to determine the df in a growth mixture model. I am not quite sure that I understand when you say that the information not only comes from the covariance matrix and mean vector but aslo from the raw data. Could you elaborate on that? Finally, if the above is true what is the base for the advice to have at least four time points for a growth mixture model? Thank you for comments.
There is no formula for determining the degrees of freedom in a growth mixture model. With growth mixture models, the model is not fit to the mean vector and covariance matrix. These are not sufficient statistics for this type of model. There is no unrestricted model as there is in a random coefficient growth model. Therefore, no chi-square and degrees of freedom can be estimated. You can think of it like LCA and LPA. With LCA, there is an unrestricted model. It is the contingency table of the latent class indicators. Therefore, degrees of freedom and a chi-square can be estimated. In LPA, this is not the case because there is no unrestricted model.
The advice about four time points is for a growth model not a growth mixture model. Four time points are not required but are recommended to give modeling flexibility. This same recommendation applies to each class of a growth mixture model.
Growth curve models are fit to mean and covariance matrices. Growth mixture models are not. Mixture models do not work with a normality assumption for the observed variables and therefore mean and covariance matrices are not sufficient statistics for the estimation of such models. Raw data are needed. Identification in mixture models is discussed to some extent in Muthen and Shedden (1999).
Anonymous posted on Thursday, January 20, 2005 - 11:29 am
I'm runing a growth curve model with 5 data points assessed at age 5,7,10,14, and 17. One of the reviewers has suggested that we center at the last data point. I first ran the linear model (fixed the slope factor loadings at -12, -10, -7, -3 and 0) but didn't fit very well, so I tried a linear spline model (fixed the last two time points and freed the last three). The model worked fine (significantly improved) but the direction of the slope changed from positive to negative in the linear spline model. From the average mean values over time, I think the slope should be positive... Do you have any idea what's happening here? Thank you so much.
BMuthen posted on Thursday, January 20, 2005 - 8:13 pm
Try fixing the first to minus one and the last to zero and have them free in between. The slope then refers to the change from the first to the last time point.
Anonymous posted on Friday, January 21, 2005 - 3:29 am
I am new to Mplus and do not know how to modify my model/analysis in order to sort out the problem of a non postive def. matrix and achieve convergence in my growth mixture model.I tried modifying the starting values - running a 1 class model and using the values from there as starting values for my 3- classes model by didn't work.I also tried increasing the STARTS values.In the output I got a message saying "problem involving parameter 13" [ my model is: i BY s0@0s2@2; s q BY s2@2s7@7s9@9s13@13; i s q c#1 c#2 on ag;] can you tell me which parameter is parameter 13? thanks !
TECH1 will tell you which parameter is number 13. However, you syntax does not look like a growth model. See Chapter 6 of the Mplus User's Guide for growth model examples. Also see Chapter 16 which describes the growth modeling language. If you continue to have problems, send your complete output and data to email@example.com.
I'm running some hefty growth modeling using this rather amazing Mplus 3.1. Quite often the message ".Variance covariance .not trustworthy.. Condiotion number xxx. Problem involving parameter 40" shows up, even with a terminated process with convergence, fits, etc. Two questions: What is the optimal value for the Condition number? A read about it in the manual, but couldn't find it again(by the way, in the sentence above D is used for marking the exponent, later on E is used). What do you actually do with this parameter 40? I've got a third question: after a normal terminated process, a lot of warnings pops up, all of them telling you about problems with residual variances and the parameter in question. I don't quite understand the significans of this, as the process terminated normally, giving fits, entropi, mean, SE etc.
bmuthen posted on Saturday, February 26, 2005 - 5:38 pm
The condition number is computed in connection with the SEs of the model and relates to the identification of the model. The number is the ratio of the lowest to the highest eigenvalue of the information matrix. That statement may not be accessible in non-technical terms, but a value equal to or close to zero means that the model is not identified and therefore has problems. "Close to zero" is about 1 to the power of -10 in Mplus. You don't want small values that go below that. When it refers to parameter 40, Tech1 will tell you which one that is and the task is then to figure out why the model isn't identified when this parameter is included in the model. If you don't see the problem, send your output to firstname.lastname@example.org.
Warnings about residual variances have to do with negative variances or correlations that are not in the -1, 1 range. If you don't see the problem, again send to support.
I am a non-sophisticated user of MPlus, trying to look at growth mixture modeling with multiple groups. With the Users Guide I determined I should be using the KNOWNCLASS statement to identify the groups (in my case, 3 race/ethnicity groups). However, as I attempt to build my model, moving from one latent class in addition to the 3 race classes to a 2 X 3 class model, I get error messages saying model estimation did not terminate normally - change my model or starting values. I see the examples of how to specify starting values for each class in the Users Guide, but don't know how to begin to determine which starting values I would specify. Is there another source where I could read more on this topic?
I'm guessing that the message may have more to do with the model than the starting values. You may want to fit a growth model for each ethnicity group separately much like you would do a factor analysis for each group separately as a first step. You might want to read the Muthen paper in a book edited by Kaplan. This paper can be downloaded from the website.
quick question: I am using MPlus to estimate GGMM for antisocial behaviors in boys and girls. I find, for girls, the VUONG-LO-MENDELL-RUBIN LIKELIHOOD RATIO TEST FOR 2 (H0) VERSUS 3 CLASSES (p = .65) whereas the VUONG-LO-MENDELL-RUBIN LIKELIHOOD RATIO TEST FOR 3 (H0) VERSUS 4 CLASSES (p = 0.0426).
Model estimation appears to have terminated normally for each model (i.e., the loglikelihoods were replicated).
New but related question. I now have two LRTs: 1) VUONG-LO-MENDELL-RUBIN LIKELIHOOD RATIO TEST FOR 1 (H0) VERSUS 2 CLASSES (p=.08), and 2) PARAMETRIC BOOTSTRAPPED LIKELIHOOD RATIO TEST FOR 1 (H0) VERSUS 2 CLASSES (p=.0000).
Is it a concern the two LRTs do not agree?
Are there additional checks for model stability in Mplus?
TECH11 and TECH14 can disagree. In the following paper which is available on the website, TECH14 came out best:
Nylund, K.L., Asparouhov, T., & Muthen, B. (2006). Deciding on the number of classes in latent class analysis and growth mixture modeling. A Monte Carlo simulation study. Accepted for publication in Structural Equation Modeling.
See the following paper which is also available on the website for suggestions of how to assess model fit:
Muthén, B. (2004). Latent variable analysis: Growth mixture modeling and related techniques for longitudinal data. In D. Kaplan (ed.), Handbook of quantitative methodology for the social sciences (pp. 345-368). Newbury Park, CA: Sage Publications.
We give the warning whenever there are more than two classes. You have replicated your loglikelihood so this does not apply to you.
The fact that you have r-squares of zero make me think you must have fixed some parameters to make this happen. Otherwise, you would need to send the full output and your license number to email@example.com
zhenli posted on Monday, April 02, 2007 - 12:30 am
Dear Dr. Muthen when we fit a latent growth model, we usually fit two slope loadings at 0 and 1 to identify the model no matter linear, free estimate, or quadratic. I wonder can we fit these two slope loadings at other values. why we fit one of the slope loadings at 1. is that because we need to scale the latent variables (intercept and slope factors)?
It is not necessary to use zero and one as time scores. The intercept growth factor is defined at the time score of zero so it is convenient to have that as the first time score if you are interested in initial status or the last if you are interested in final status etc. Regarding the other time scores, they need only reflect the distance between measurement occasions. So it can be 0 1 2 or 0 .1 .2.
This message will be in two posts: I am new to GMM and am trying to fit a GMM for weight status (Body Mass Index) data. I first fit a simple model with the default constraints across classes and found a three class solution was the best fit. But, within this model I saw that the residual variances jumped quite high for the last time point. A colleague who has a bit more experience with these types of analyses suggested I examine the distributions within each class for normality (I did, the distributions didn't look too skewed) and try rerunning things with unconstrained variances. I did, and although the three class model fit better than the constrained version, the four class model gave me the following error message: THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY DUE TO A NON-ZERO DERIVATIVE OF THE OBSERVED-DATA LOGLIKELIHOOD.
THE MCONVERGENCE CRITERION OF THE EM ALGORITHM IS NOT FULFILLED.CHECK YOUR STARTING VALUES OR INCREASE THE NUMBER OF MITERATIONS.ESTIMATES CANNOT BE TRUSTED. THE LOGLIKELIHOOD DERIVATIVE FOR PARAMETER 10 IS 0.15289800D-04.
I also tried running the model with class-specific covariation between s and i, which gave me a slightly better fitting model for 3 classes, but more error messages for 4 classes about an ill-conditioned and non-positive definite fisher information matrix and not being able to calculate S.E.s. I am not sure where to go from here. Do these error messages with the unconstrained models mean: (1) that the constrained model fits the best? (2) that the 3 class unconstrained model fits better than the 4 class? (3) that I am doing something wrong or need to try something different in setting up the unconstrained models?
I should add that the unconstrained models also had the same pattern of increasing residual variance at the later time points, and it was more dramatic within each class when I let the variances be class-specific. Is this something I should be concerned about, or is it something that would be expected with longitudinal data?
Apologies for such a long question; thank you for any assistance you can provide.
If a post does not fit in the space provided in one window, it is too long for Mplus Discussion. Please do not double post in the future.
A residual variance can be larger at one timepoint. This does not necessarily indicate a problem.
Estimating a model with class-varying variances can be problematic. I would run the model with the variances held equal across classes and ask for plots using the PLOT command. I would look at the estimated mean and observed individual plot to see if it appears that there are different variances in the classes. If one class has a smaller or larger variance than the others, I would free the variance for that class.
None of what you are seeing says anything about model fit.
Joan W. posted on Tuesday, October 14, 2008 - 8:37 am
Dear Dr. Muthen,
I have a question with regard to your recent post on June 10, 2008 on degrees of freedom. When you say degrees of freedom are relevant when means, variances and covariances are sufficient statistics for model estimation, are you referring to both latent class models with continuous as well as categorical indicators? I asked this because I have frequently read papers on how to calculate degrees of freedoms for latent class analysis with categorical variables, and in those papers, df=R-q-1, where r is equal to the number of unique response patterns, and q is equal to the number of free parameters.
A related question to the degrees of freedom is the absolute goodness-of-fit index. I remember i read an earlier post from mplus discussion, saying that because there are no sufficient statistics and no unrestricted model in mixture models, there is no chi-square statistics. I wonder whether this is applicable to continuous indicators as well as categorical indicators too. Likewise, I've seen papers discussing chi-square statistics as a goodness-of-fit index for analyzing contingency tables by comparing the estimated frequency with the observed frequency in each cell. I'm just confused...
For latent class analysis and categorical outcomes, the H0 model has q parameters, the H1 model has R-1 parameters where R is the number of cells in the multiway frequency table so the degrees of freedom is R - q - 1. Chi-square tests are available to test the fit of the observed versus expected cells frequencies. These tests are usually not useful when there are more than 8 latent class indicators.
For continuous latent class indicators, a frequency table test is not relevant. No chi-square test is available.
microlily posted on Saturday, December 13, 2008 - 9:59 am
Dear Dr. Muthen: I am running a-57-cases quadratic growth model assessed posttreatment pain at 7, 10, 30, and 90 hours. I got "NO CONVERGENCE. NUMBER OF ITERATIONS EXCEEDED". Is it because a small sample or how should i do to make data convergence? Thank you so much.
ywang posted on Wednesday, August 05, 2009 - 12:37 pm
Dear Drs. Muthen:
I have a question about the LGM with three time points on continuous indicator variables.
Mplus kept showing information like " THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE TRUSTWORTHY FOR SOME PARAMETERS DUE TO A NON-POSITIVE DEFINITE FIRST-ORDER DERIVATIVE PRODUCT MATRIX. THIS MAY BE DUE TO THE STARTING VALUES BUT MAY ALSO BE AN INDICATION OF MODEL NONIDENTIFICATION. THE CONDITION NUMBER IS -0.186D-16. PROBLEM INVOLVING PARAMETER 11".
but it also showed "THE MODEL ESTIMATION TERMINATED NORMALLY"
I kept fixing the problems. However, after I fixed one problem, e.g. by fixing the variance at 0, it would tell me there is another problem involving a different parameter. What can I do with this?
Dear Dr. Muthen I am new to Mplus. I am trying to fit a quadratic growth curve model. The outcomes are binary (0 1) responses. i first fitted the linear model and i did not have any problems. but when i try to fit the quadratic model i get this message:
THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL. PROBLEM INVOLVING PARAMETER 23. THE CONDITION NUMBER IS 0.430D-10.
FACTOR SCORES WILL NOT BE COMPUTED DUE TO NONCONVERGENCE OR NONIDENTIFIED MODEL.
I am working with a colleague to fit a growth model with observed affect data at 5 time points. An binary event occurred at the 3rd time point, and so we set the T3 intercept/time @ 0. We generated temporal offsets (measured in hours) for the remaining four variables (V1,V2,V4,V5), reflecting the number of hours removed those observations were from the event of interest. Offsets prior to the event (T1 & T2) are negative while offsets following the event (T4 & T5) are positive.
Our theory and findings from repeated measures ANOVA suggest that a positive quadratic is found among individuals experiencing the event. However, we have been unable to get the model to converge. We have managed convergence when we use simple time points (-2, -1, 0, 1, 2) and/or drop the quadratic term… but the quadratic is central to our hypotheses and reviewers are requiring us to parameterize time in terms of actual time before/since the event.
I'm concerned that the sample is simply too small (N = 36) to effectively fit a model of this type?
Thanks in advance for your help! Sincerely, Nicole
Dear Dr. Muthen, I'm trying to run a simple latent growth model with 3 timepoints. There is a problem in model identification because of a non-positive def. matrix error which shows up as a negative slope variance. I am not sure how to handle this. Fixing at least the residual variances of the first/second or second/third timepoint for a common estimation yields a model identification at last. Do you have any idea what the problem is? Kind regards,
A linear growth model with a continuous outcome and three time points is identified. I would have to see the problem to say what the message means. Please send the output and your license number to firstname.lastname@example.org.
I'm conducting several LGM. In some models I get the "Nonconvergence" Message from Mplus. If I fix the residual variance of some parameter X to a small positive amount (.01) the analysis works. But I cannot theoretically justify that way of handling it, because that parameter X does not seem to have a different residual variance compared to other parameters Y and even in the data there is nothing, that would point to that special parameter. My question is: Is this an arguable method or should I try to find some other solution? I hope the question is not answered somewhere else, I could not find any matching posts. Thank you in advance Best regards Till
thank you very much for your fast response. I would like to engross the question a little bit more. I'm examining the impact of pesonality on life satisfaction. For that purpose I specifiy the model described above first for three factors F1 F2 F3 with F1 by n1 n2 n3; F2 by e1 e2 e3; F3 by g1 g2 g3; and i s on F1 F2 F3; and then once for every Factor alone. I have the CONVERGENCE Problem for two of the three models with only one factor, but not for the basismodel and I have problems interpreting that. I suppose that if the problems were due to missspecification, none of the models should work. In my interpretation I assume that the problem arises in part due to the very small sample size (N=99) but still I do not understand why it works sometimes and sometimes not. Is there, in your opinion, any possible explanation? Thank you very much for your time! Best regards Till
Xiaolu Zhou posted on Monday, January 30, 2012 - 10:49 am
I am comparing two SEMs with Mplus. The one is 3 latent factors related to X latent factor. The other is the same 3 latent factors related to Y/Z latent factors. X represents the 1-factor structure of C scale. Y and Z represent the 2-factor structure of the same C scale. The SEM for the X latent factor is good. However, I can not get the model fit of the SEM for the Y/Z factors. The output showed that MAXIMUM LOG-LIKELIHOOD VALUE FOR THE UNRESTRICTED (H1) MODEL IS -1344.889. NO CONVERGENCE. NUMBER OF ITERATIONS EXCEEDED. Could you please tell me what account for this? Thanks a lot!
I have four waves of data and I am estimating a quadratic 5 class GMM. I have found that a four class model fits the data well, but a five class model seems to have even better “fit” based on the BIC, best LL value, and the LMR. However, for this 5-class model, the estimates for the means of the growth factors are completely difference than the estimates I found for a 4-class model. In fact, when I looked at the estimated means graph for the 5 class model, it looked completely different from the one for a 4-class. It is my understanding that in GMM, whenever a model is estimated with a new class, the new class would be class #1 in the new model and that the other classes should remain pretty much the same. Is this correct?
I have found that after constraining the growth factor variances (i s q) to zero, as in latent class growth analysis, the estimates for the means of the growth factors appear to be more consistent with those of the four class model (with small differences) and include the new class. The only problem is that the BIC is higher than that for a 4 class (for which I set q@0). Would it make sense to choose the 5 class model with i-q set at zero as the correct model?
No, it is not correct that the new class would be number 1.
No, it would not make sense to choose the 5-class model and fix i-q at zero.
No, it is not possible that four waves are not enough for a 5-class model. I would use the 5-class model with free growth factor variances.
Please limit your posts to one window in the future.
iolivares posted on Monday, April 23, 2012 - 5:14 pm
Thank you for your prompt response. I apologize for my multiple posts earlier. I think I am misunderstanding something, which may be basic. If growth factor estimates (means) for a 5 class model with better fit are completely inconsistent with those of a 4 class model, would it still make sense to choose the 5 class model? I am making the assumption that models with k classes have the same growth estimates as one with k-1 classes with the exception of the new class and some minor variations in estimates for the old classes which may occur as a result of extracting the new class.
This is not true. A five-class solution does not have the four-class solution with one new class. It has 5 new classes which may or may not resemble the classes from the four-class solution.
iolivares posted on Monday, April 23, 2012 - 5:29 pm
Thank you! That certainly makes sense!
lam chen posted on Tuesday, May 22, 2012 - 3:50 am
Dera Dr. Muthen,
I am modeling a GMM with four observation points.When it is two classes,convergence is gotten.However, once improving the number of class, PSI matrixs are not positive identificated. How should I slove this problem? I also found that after adding covariates, convergence was gotten.I guess the reason is that there not enough observation points to model the 3 classes and abouve GMM,is it?
I have been trying to fit a 2nd order growth curve to data with three time points, with 3 indicators per time point. Each time I receive an error message stating that the model may not be identified, and that there is an error with one of the parameters (the alpha matrix of the intercept; this has happened with two separate models).
I've tested and imposed measurment (scalar) invariance across time, and have set intercepts to zero for first order constructs.
Any ideas as to why this model might be unidentified?
I am running a piecewise growth model using a cohort sequential study with time as age. The model runs for the total sample of 662 and for males only (n=320). But I cannot get the model to converge for females (n=335)
The error I get is NO CONVERGENCE. SERIOUS PROBLEMS IN ITERATIONS. CHECK YOUR DATA, STARTING VALUES AND MODEL.
Do you have an suggestions for what might be wrong? or what I should try?
Im having some trouble with a LGCA...when I test for a one class model, I keep getting an impossible mean value for one of my variables.
I have checked my SPSS data and MPLUS data file and I cannot find any problems there....the minimum value for this variable is 0, and the maximum is 5...but the output gives me a mean of 13.7 for this variable. Is there anything I can do to find the problem?
I have a question concerning the fixing of growth parameters. When I see in my diagramm that the trend of my data points go from left above to right down then the parameters must be fixed negative like:
The time scores should be positive and reflect the distance between the measurement occasions. The sign of the trend will be seen in the mean of the slope growth factor. See the Topic 3 course video and handout for a discussion of these issues and other types of growth curves.
When I run it, "the model estimation does not terminate normally due to a change in the likelihood during the last e step". After I increase MIterations and MConvergence, I get the same error and: SLOW CONVERGENCE DUE TO PARAMETER 24, which is the PSI of im.
You should build the model up in parts to see when the problem occurs. I would fit each growth model separately to see if there are any problems there. If not put them together. Then add ON statements and then the interactions.
In order to assess program effects (N = 126 RCT: 1 experimental group, N = 70 and 1 control group, N = 56), I'm conducting a two-part latent growth model for a semicontinuous outcome measure (1 part: a binary variable and 2nd part: a continuous variable). Unfortunately, I could not get fit indices (chi-square test) for the experimental group (I conducted seperate analyses for each group). Also, after trying several options, I've got several warnings (NEGATIVE VARIANCE/RESIDUAL VARIANCE FOR A LATENT VARIABLE), based on these input commands:
Dear Prof. I have a problem with tech11 in a cubic GMM. I ran the model from 1 to 5 classes and kept the intra-class variance fixed for the Q and Cu term. When I ran the 5-class model, I got this in TECH11:
Mean ***************** Standard Deviation ***************** P-Value 0.1752 I didn't receive any warnings. Can the P-values be trusted even though the mean and SD can't be computed? To solve the problem, I've tried to use both the optseed function (i have used number of seeds from best loglikelihood found previously) and increased the number of miterations, but this gives me a new error:
THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY DUE TO A NON-ZERO DERIVATIVE OF THE OBSERVED-DATA LOGLIKELIHOOD.
THE MCONVERGENCE CRITERION OF THE EM ALGORITHM IS NOT FULFILLED. CHECK YOUR STARTING VALUES OR INCREASE THE NUMBER OF MITERATIONS. ESTIMATES CANNOT BE TRUSTED. THE LOGLIKELIHOOD DERIVATIVE FOR THE FOLLOWING PARAMETER IS -0.12168984D-04: Parameter 32, %C#5%: [ CU ].
And all fitestimates changes. I'm not sure what i should do next?