Mplus Discussion >> Two part mixture

Topics
Last Day
Last 3 Days
Last Week
Tree View

Edit Profile


Two part mixture

Mplus Discussion > Latent Variable Mixture Modeling >

Message/Author

Michael Spaeth posted on Thursday, March 27, 2008 - 11:13 am

dear all,
I'm trying to run a two part growth mixture model.
I did a lcga for the binary part first.

1.) 3 or 4 class solutions fitted better than a one class or two class solution, but I get bad average post probabilities for the 3 or 4 classes. If I have more than two groups, at least on group has a averagepost-prob of 60-70 (diagonal of matrix). What is an acceptable value?
I thought of 80!? Are there any chances to isolate more than two meaningful and valide groups in a lggmm of boh parts? Is it worth it to proceed?

2.) I went to joint LGMM of both parts. I used the means of the growth factors lcga in the following lgmm as starting values, but it didn't converge.

After nearly 2 hrs, I get the following messages:
THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY DUE TO A NON-ZERO
DERIVATIVE OF THE OBSERVED-DATA LOGLIKELIHOOD.

THE MCONVERGENCE CRITERION OF THE EM ALGORITHM IS NOT FULFILLED.
CHECK YOUR STARTING VALUES OR INCREASE THE NUMBER OF MITERATIONS.
ESTIMATES CANNOT BE TRUSTED. THE LOGLIKELIHOOD DERIVATIVE
FOR PARAMETER 20 IS -0.55772741D-01.

Any suggestions with regard to model convergence, additional starting values, may be variances?
I was inspired by the recent Hser et al. article (2007). I think they used one latent class for both model parts. Is it significantly more computational demanding to use 2 latent class variables for both parts separately?

Bengt O. Muthen posted on Friday, March 28, 2008 - 11:43 am

1. I would say any entropy value is acceptable in the sense that entropy does not have to do with model fit or the best choice of number of classes. But to be useful for classification, I would think you want higher numbers.

2. Just increase to say miter=1000; as suggested in the error message.

Can you give the Hser ref.

Michael Spaeth posted on Tuesday, April 08, 2008 - 7:33 am

1. o.k., thanks.

2. I will try but I'm nt very optimistic that it works.

I'm very interested in isolating a "zero-group" (structural zeroes). How do I specify such a group? I've heard of threshold = 15 for the binary part and zero mean for intercept and slope of the continuous part. Do I have to add zero means of the growth parameters for the binary part or is threshold = 15 sufficient?

regards, michael

Michael Spaeth posted on Tuesday, April 08, 2008 - 7:38 am

sorry, forgot the reference:
Hser, Huang, Chou & Anglin (2007); Trajectories of Heroin Addiction: Growth Mixture Modeling Results based on a 33-Year Follow-Up Study. EVALUATION REVIEW, 31, 548.

They used one latent class variable. And I'm quite interested in the steps needed to let such a complex model converge.

Linda K. Muthen posted on Tuesday, April 08, 2008 - 9:54 am

To make a zero class, fix the thresholds of your outcomes to plus 15 and the growth factor means and variances to zero.

If you cannot get your model to converge, send your input, data, output, and license number to support@statmodel.com.

Michael Spaeth posted on Wednesday, April 09, 2008 - 1:08 am

thank's, I will try further runs and then may be send it to support.

Michael Spaeth posted on Tuesday, April 15, 2008 - 10:28 am

o.k., I have given it up, it is far too complex and miles away from convergence. I focus on an alternative, namely to isolate groups within the entire model (both parts) on the basis of LCGA, which is often done by people who use SAS (group based approach, Nagin). I know the disadvantages (too many groups may be isolated by LCGA) and it is rough and dirty but may be sufficient for me. Entropy looks very good.

1.)Is LCGA compatible with two-part in general? I only have read an article by Bengt, using LCGA on the binary part (but not on both parts simultanously).

Linda K. Muthen posted on Tuesday, April 15, 2008 - 11:29 am

Two-part modeling requires that the two parts correlate in some way. Generally this is through the growth factors. If you do an LCGA, the growth factor variances are fixed at zero and there can be no growth factor covariances between the two processes. LCGA can be done with a two-part model if the latent class variable influences both parts. The following paper on two-part factor mixture may be of interest. It can be downloaded from our website.

# Kim, Y.K. & Muth�n, B. (2007). Two-part factor mixture modeling: Application to an aggressive behavior measurement instrument.

Michael Spaeth posted on Friday, April 18, 2008 - 6:46 am

o.k., your first statement was the reason for asking this question :-).

"LCGA can be done with a two-part model if the latent class variable influences both parts."

Sorry, this was a missunderstanding. Bengt used LCGA on the separated binary part to get a clue about the number of groups in general. This was followed by LGMM on both parts.
Now I'm separating groups on the basis of both parts simultanously with one class variable in my "Variable" command. Your answer implies, that this seems o.k., albeit having no correlated parts within the classes.

Bengt O. Muthen posted on Friday, April 18, 2008 - 5:24 pm

That's ok.

Michael Spaeth posted on Saturday, April 26, 2008 - 7:38 am

is there any way to get the estimated means for my trajectories when using "imputation" for my covariates?

Linda K. Muthen posted on Monday, April 28, 2008 - 11:18 am

With TYPE=IMPUTATION, these means are not available.

Michael Spaeth posted on Friday, May 02, 2008 - 6:16 am

thank you again, as always :-) So I have to use the means not adjusted for covariates for the visual plot in my paper, if this is not possible with imputation.
I want to calculate confidence intervals for these means. Using "cinterval" only gives me cintervals for my model but not for the estimated means. I there any way to get them out of mplus?

Linda K. Muthen posted on Saturday, May 03, 2008 - 9:17 am

Not that I know of.

Michael Spaeth posted on Thursday, December 18, 2008 - 6:07 am

I noticed that the current version 5.2 contains improvements concerning type=imputation. Has anything changed regarding means available in this kind of analysis? This would, for instance, alleviate the exploration of LGMM conditioned on imputed covariates, since sometimes the mean trajectories of the classes look different as compared to unconditional models. So far, the only chance to have a closer look on this issue is with non-imputed conditional models.

Linda K. Muthen posted on Thursday, December 18, 2008 - 3:17 pm

I'm not sure what you mean. Can you be more specific about which means you are talking about. There are no plots available with multiple imputation.

Michael Spaeth posted on Monday, January 05, 2009 - 3:03 am

Sorry, I was talking about 'estimated means' of LGM or LGMM growth curves. Your answer implies that these are still not available in mplus 5.2 when using multiple imputation!?

Linda K. Muthen posted on Monday, January 05, 2009 - 6:49 am

The PLOT command is not available for TYPE=IMPUTATION.

ywang posted on Monday, May 12, 2014 - 10:44 am

Dear Drs. Muthen,

I am interested in two-part growth mixture modeling. I did find two papers on this topic("Two-Part Growth Mixture Modeling" and "Trajectories of heroin addiction: growth mixture modeling results based on a 33-year follow-up study"), but could not find any input example Can you advise where I can find the input examples?

Thanks!

Linda K. Muthen posted on Tuesday, May 13, 2014 - 10:07 am

It would be an extension of Example 6.16 in the user's guide. I don't have any other example.

Mirim Kim posted on Tuesday, March 21, 2017 - 10:20 pm

Dear Dr. Muthen,

After reading the Kim and Muthen (2009), I want to ask you about the way to specify the distribution of your study.
Was it the same way in Muthen and Muthen (2002); How to use a Monte Carlo study to decide on sample size and determine power?
How can I set the specific parameter? For example, in the Kim and Muthen (2009), the data generation was based on log-normal distribution (M=-2, sigma=1).
It would be really appreciated if you could share your idea.

Thank you.

Bengt O. Muthen posted on Wednesday, March 22, 2017 - 6:58 pm

Are you doing a Monte Carlo study?

Mirim Kim posted on Wednesday, March 22, 2017 - 7:33 pm

Thanks for the reply.
Yes, I'm doing on that and try to make skewed data fitted to the two-part model.

Samuli Helle posted on Tuesday, February 04, 2020 - 11:52 am

Hi,

Is it possible to use latent variables with categorical indicators as two-part DVs? If yes, any code examples available?

Thanks!

Bengt O. Muthen posted on Tuesday, February 04, 2020 - 5:17 pm

I think you would have to use mixtures with one class for zero. See for example:

Wall, M. M., Park, J. Y., & Moustaki, I. (2015). IRT modeling in the presence of zero-inflation with application to psychiatric disorder severity. Applied Psychological Measurement. DOI: 10.1177/0146621615588184
view abstract

Samuli Helle posted on Wednesday, February 05, 2020 - 1:01 pm

Thanks Bengt! After reading the article, I am unsure that mixture modeling fits to my data (I might well be wrong). So for each child we have asked some questions about his/her grandparents. But since some grandparents were dead we have missing data (non-response, not zeros) for the questions for those dead grandparents. Thus, I am wondering whether sample selection model would be more appropriate here?

Bengt O. Muthen posted on Thursday, February 06, 2020 - 3:17 pm

I think sample selection models need a lot of good covariates to be successful; not sure you have them here to make it worthwhile.

Samuli Helle posted on Friday, February 07, 2020 - 6:32 am

No, not many just one... Any other options in Mplus that I could try? I checked your RMA book and in chapter 10 (Missing Data) there is a NMAR selection model where missing data indicator is regressed on Y (Table 10.18). I tried to run a similar model in my case but Mplus did not even produce an output to show what went wrong. The only difference was that I had a latent variable as an outcome (code clip below), variable "miss" being 1 if data on "x1-x3" was missing and else 0.

MODEL:
F BY X1 X2 X3;
F ON mf pm pf;
miss ON F;

Bengt O. Muthen posted on Friday, February 07, 2020 - 2:46 pm

Shouldn't be a problem. We need to see your full output (maybe you caught a glitch) - send to Support.