I work with a large sample of twins, and I am using the clustered data option (TYPE=complex) to adjust for nonindependence. I am wondering if I should use TYPE=TWOLEVEL instead. What would be the difference? Is one approach better than the other?
My general question is: In which cases do you use TYPE=COMPLEX and in which cases do you use TYPE=TWOLEVEL?
Thank you for your reaction. I had already read that chapter, but unfortunately I am still not able to decide when I should use one approach versus the other. What would you recommend in the case of twins and what would you recommend in the case of siblings and parents (so 3 family members). Or does the choice depend on your specific research question, interests and analyses (I perform GMM), and not so much on the design of the study? I read that the multilevel approach allows random intercepts and random slopes that vary across clusters. Does the cluster-approach not do that?
TYPE=COMPLEX computes standard errors and a chi-square test of model fit taking into account stratification, non-independence of observations due to cluster sampling, and/or unequal probability of selection. TYPE=TWOLEVEL specifies a model for each level of the multilevel data thereby modeling the non-independence of observations due to cluster sampling. So it depends on what you want. Do you want to only correct standard errors and chi-square or do you want a model for the between level. Your research questions should guide you.
Student 09 posted on Monday, March 02, 2009 - 7:34 am
A colleague of mine claims that type= complex is not appropriate if the aim is to examine the effect of a between-level variable on a dependent variable measured at the individual level.
According to his view, the type = complex framework does not take into account that in a twolevel data structure, there a always less observations on the higher- as compared to the lower-level of analysis. His example: If there are n1 = 1000 pupils nested in n2 = 50 schools and one would examine the effect of "school denomination" as level-2 variable on "math achievement" (measured on level 1), type = complex would not be aible to take into account that "school denomination" refers to 50 (level 2)cases, and not to 1000 pupils.
Is this right or wrong? Are there any references available explaining the logic of the Mplus robust s.e. in the type= complex framework?
I believe that the standard errors are correct with TYPE=COMPLEX when some variables are measured on the cluster level and some on the individual level. To be certain, however, you would need to do a Monte Carlo simulation and see if you obtain the correct standard errors.
Student 09 posted on Monday, March 02, 2009 - 11:29 am
Dear Dr. Muthen
I like the idea of a MC simulation, but maybe it would even be sufficient if the formula for the type = complex (using MLR) robust se's would be available, just to see whether the s.e.'s take into account the different N's of level-2 vs. level-1 variables ?
I am currently looking into the random-intercept cross-lagged panel model (RICLPM) (Hamaker, Kuiper, & Grasman, 2015), which allows to separate within - and between variance. As I read above, the difference between type= complex and type = twolevel, is that in the latter you can specify a between-person model.
Now my question is whether the cluster approach would also be appropriate in the context of RI-CLPM, since it allows to disentangle within and between person variance.
Sorry for the follow-up question, but to be certain, it is sufficient to cluster the data without specifying a between model to capture the essence of RI-CLPM? Especially if you're interested in the within-person level, this makes the model easier to handle.
Let me change my answer to say that you don't use type=twolevel or type=complex for the RI-CLPM. It should be done as a single-level, wide-format model as shown in the Hamaker et al Figure 1, right-hand-side. The model already has within- (time) and between (person)-level features, where the random effects kappa and omega for the two outcomes represent the between person variation.
Hi, sorry if this is a silly question, but regarding Bengt's comment on December 24, 2016 - 1:23 pm, I understand that the RI-CLPM has within (time) and between (person) features in a single-level wide-format model. But I thought that by "within" Hamaker and Martjin were referring to the random intercept feature of the RI-CLPM. That is, that random intercepts make it so that parameters are estimated based on the individual's personal "within" change and not a change in their rank order in the group from one time to the next.
Following that, and the example 9.1 of random intercepts, shouldn't RI-CLPMs be entered simply like this?
%WITHIN% x2 y2 on x1 y1; x3 y3 on x2 y2; x1-x3 pwith y1-y3;
Or am I wrong and Omega and Kappa do RI-CLPMs? Thanks.