Is type=complex or type=multilevel th... PreviousNext
Mplus Discussion > Multilevel Data/Complex Sample >
Message/Author
 Alex Zablah posted on Thursday, March 10, 2005 - 11:31 am
Hi, hope all is well. I was hoping to get your insight/input on the following:

I have survey data (6 dependent variables) for 300 customers. These 300 customers belong to one of 10 different providers. Hence, the cases are not IID.

I also have survey data from these 10 providers (1 independent variable).

I want to test the relationship between the provider-level independent variable and three of the customer-level dependent variables using structural equation modeling (in Mplus 3.10). Given that my between-level sample size is only 10, that means that multi-level modeling is out of the question. Right?

If I disaggregate the provider level data (i.e. assign the same value for the independent variable to each customer who shares the same provider) would it be appropriate to run the SEM model (n=300) using the type=complex design?

I'd appreciate any feedback/suggestions you could offer.

Best regards,

Alex
 bmuthen posted on Thursday, March 10, 2005 - 4:27 pm
Yes, I think 10 clusters makes 2-level modeling not perform well. Unfortunately, our simulations suggest that type=complex also needs more clusters than 10 - at least 20. So perhaps the only way is to view "provider" as a fixed effect instead of a random effect and use 9 provider dummy variables as covariates. Bayesian analysis using priors is theonly way I have seen that attempts to deal with such a small number of clusters.
 Herb Marsh posted on Monday, April 08, 2013 - 6:33 pm
Why is it no longer possible to use Type=complex to get correct standard errors for analyses are done at level 1 when there are three levels.

For example, results at the student level when students are nested within classes, and classes are nested within schools.
 Linda K. Muthen posted on Tuesday, April 09, 2013 - 9:43 am
COMPLEX TWOLEVEL, THREELEVEL, and COMPLEX THREELEVEL are all available. There have been no changes. I'm not sure I understand your question.
 Herb Marsh posted on Tuesday, April 09, 2013 - 9:14 pm
Linda: Here is what I did and the error message that I got. I recall that it was possible to have two cluster variables when analyses were done only at level 1, but maybe I am mistaken. In any case, why is it apparently not allowed.

cluster is ID_Schl Id_Class ;
...
ANALYSIS: type= complex; ESTIMATOR=MLR;
....


*** ERROR in VARIABLE command
Two cluster variables are allowed for TYPE=TWOLEVEL COMPLEX. Only one
cluster variable is allowed for TYPE=COMPLEX (single level). Limit on
the number of cluster variables reached.
 Linda K. Muthen posted on Wednesday, April 10, 2013 - 6:54 am
We've never allowed more than one cluster variable with TYPE=COMPLEX. You would need to use TWOLEVEL COMPLEX to handle two cluster variables.
 Melissa Kull posted on Monday, April 22, 2013 - 7:45 am
Hello,

I have a three level model with observations (level 1) nested within individuals (level 2) nested within cities (level 3). The data I am using requires sampling weights and we only
have observations at 2 timepoints. From Chapter Nine of the users guide (p. 252) I am not sure whether I should treat this model as TYPE=TWOLEVEL (and treat our two observation points as "time") or TYPE=THREELEVEL and treat our first level as cross sectional? In addition, the outcome we are using is a count variable and it is my understanding that users can't use sample weights with count variables in TYPE=THREELEVEL?

Many thanks,
Melissa
 Linda K. Muthen posted on Monday, April 22, 2013 - 12:09 pm
I would treat this a a TWOLEVEL analysis with data in the wide format. THREELEVEL is not available for count variables.
 anonymous posted on Wednesday, May 08, 2013 - 12:04 pm
Hello,
I'm aware that TYPE=COMPLEX with the cluster option adjusts for non-independence in terms of the chi-square statistic and the standard errors of the model, but not the parameter estimates (parameter estimates are adjusted for with the multi-level modeling). Is it that the TYPE=COMPLEX and cluster option only adjusts for the parameter estimates' significance, but not their magnitude? I'm wondering whether it is appropriate to estimate a model of treatment effects involving children nested within schools using the TYPE=COMPLEX and cluster option.
 Linda K. Muthen posted on Wednesday, May 08, 2013 - 12:17 pm
Parameter estimates are adjusted if the WEIGHT option is used. There is no difference between COMPLEX and TWOLEVEL in this regard.
 Elina Dale posted on Sunday, May 12, 2013 - 9:11 am
Dear Dr. Muthen,

I am wondering about the difference between TYPE=COMPLEX ad TYPE=TWOLEVEL analysis of SEM in MPlus.

In traditional regression modeling, there is a distinction between population average and subject specific models. Population average models such as GEE describe the covariance among clustered observations, whereas SS/hierarchical models explain the source of this covariance. So, the coefficients are interpreted differently: PA model estimates the difference in Y b/n group A with X and group B without X; the SS model the expected change in individual's probability of Y given change in X.

I am wondering if I use TYPE=COMPLEX in my SEM as I have clustered data, the coefficient from my structural model - effect of treatment X on a latent factor F - is it interpreted as PA or SS? In other words, with specification COMPLEX, do we have a population average model or random effects model in MPlus?

Do we need to specify TWOLEVEL to have a subject specific interpretation of coefficients? Thank you!
 Linda K. Muthen posted on Monday, May 13, 2013 - 8:46 am
Subject-specific refers to random coefficients. You would need to use TYPE=TWOLEVEL RANDOM with random coefficients. TYPE=COMPLEX adjusts the standard errors for non-independence of observations.
 Elina Dale posted on Monday, May 13, 2013 - 10:34 am
So, TYPE=COMPLEX is a marginal model?

Are the coefficients interpreted as population average as in marginal models explained in papers by Zeger et al, 1988? It would be helpful to get a bit more explanation as to how some of MPlus specifications relate to more widespread / traditional types of analyses.

Thank you!
 Bengt O. Muthen posted on Monday, May 13, 2013 - 8:46 pm
A single-level regression model (linear or logistic) is a "widespread/traditional type of analysis" - if you have a regression model and use TYPE=COMPLEX you are doing regression analysis and you get your SEs adjusted for complex survey data features. So the interpretation is the usual one for regression modeling. Same for factor analysis. If you have two-level data and don't do TYPE=TWOLEVEL but do TYPE=COMPLEX you get a so called "aggregated" model using terms in well-known complex survey data literature such as the 1989 Analysis of Complex Surveys book edited by Skinner, Holt, and Smith.

GEE is a limited-information estimator, not a full-information maximum-likelihood estimator. You can see the relationship between GEE estimation and the closely related limited-information WLSMV estimation in Mplus in the paper on factor analysis on our website:

Muthén, B., du Toit, S.H.C., & Spisic, D. (1997). Robust inference using weighted least squares and quadratic estimating equations in latent variable modeling with categorical and continuous outcomes. Unpublished technical report.
 Elina Dale posted on Tuesday, May 14, 2013 - 10:14 am
Thank you, Dr. Muthen! So does this mean that TYPE=COMPLEX specifies design-based or model-based analysis?

It is vital for me to understand what this specification implies, so please, forgive my persistence.

Skinner et al. distinguish (A) design vs. (B) model based approaches to analysis. Within model-based approach we have (a) aggregated (marginal) and (b) disaggregated (random effects) models.

"A basic distinction is between design-based and model-based inference.... Aggregated analysis may therefore alternatively be referred to as
marginal modelling and the distinction between aggregated and disaggregated analysis is analogous, to a limited extent, to the distinction between population-averaged and subject-specific analysis, widely used in biostatistics."

Thank you again!
 Bengt O. Muthen posted on Tuesday, May 14, 2013 - 6:13 pm
You may be interested in the chapter:

Muthén, B. & Satorra, A. (1995). Complex sample data in structural equation modeling. Sociological Methodology, 25, 267-316.

The labels you refer to are not always clear cut (at least not to me) so I'll describe what we do instead. With TYPE=COMPLEX we do complex survey SEs using the Hubert-White sandwich estimator. The parameters are the usual single-level parameters. The fact that we can also handle SE calculations based on replicate weights might qualify us for the design-based camp; I am not sure about these distinctions. TYPE=COMPLEX does an aggregated analysis when data are hierarchical (say twolevel) because it doesn't model parameters on both levels. In contrast, TYPE=TWOLEVEL or TYPE = COMPLEX TWOLEVEL does a disaggregated analysis. I discuss the difference in the above chapter in terms of factor analysis.

You can also read more about what we do by reading the papers under our Complex Survey Data section:

http://www.statmodel.com/resrchpap.shtml
 Bengt O. Muthen posted on Tuesday, May 14, 2013 - 6:23 pm
Another useful book is the 2003 Chambers & Skinner Wiley book.
 Elina Dale posted on Wednesday, May 15, 2013 - 5:37 am
Thank you so much, Dr. Muthen! Really appreciate your response, I think I understand now. I have also started looking at Chambers & Skinner 2003 book. Will check out your paper for which you sent me the link.
 Elina Dale posted on Monday, August 12, 2013 - 10:06 am
Dear Dr. Muthen, I have re-read your paper (B. Muthen & A.Sattora, 1995) on complex sample data in SEM and I still have clarifying questions on the procedure used by MPlus when I specify "COMPLEX" in the Analysis.
On pp. 281-288, you describe the aggregated analysis, which Chambers & Skinner (2003) say "may alternatively be referred to as marginal modeling".
I would greatly appreciate it if you could clarify:
1) whether the aggregated approach as described in Muthen & Sattora (1995) is a model or design-based approach to inference, b/c it can be used in either according to Chambers & Skinner (2003);
2) whether "COMPLEX" specification is a model-based aggregated approach.
Last question. Typically, as you say, design-based analysis uses weights in parameter estimation. I wonder if weights are required when using "COMPLEX". Thank you!
 Bengt O. Muthen posted on Tuesday, August 13, 2013 - 10:30 am
1) I see it as a model-based approach

2) I see COMPLEX as a model-based aggregated approach.

Weights are not required when using COMPLEX. For instance, there may be just clustering.
 Elina Dale posted on Wednesday, August 14, 2013 - 8:11 am
Thank you, Dr. Muthen! This is very helpful.
 Christoph Weber posted on Friday, February 14, 2014 - 7:10 am
Dear Dr. Muthen!
I am analysing threelevel data (students, classes, schools). The question is, if a school system reform has an effect on the achievement of students.

8. grade students were tested before the reform was implemented and then 8. grade students after the reform (using the same schools).

I'm using a threelevel model with "reform (0/1)" on the class level and estimate the effect on achievemnt (class level). Is this correct?
Further I wonder why I get a different estimate for "achievement ON reform", when I use type = complex (cluster = class)? The estimate for type complex is equal to the simple mean difference between reform = 0 and reform = 1 using SPSS.

Thanks
christoph weber
 Bengt O. Muthen posted on Friday, February 14, 2014 - 11:57 am
You say that this is a school system reform; isn't your "reform" variable a school-level variable and not a classroom-level variable?

The 3-level model results won't agree with Type=complex with cluster=class because the latter takes only classroom clustering into account. You would do better with Type=Complex Twolevel and define 2 cluster variables: school and classroom (see UG).
 Christoph Weber posted on Friday, February 14, 2014 - 1:55 pm
Thanks, I treat reform as a class level variable, because we have a kind of trend analysis. The same schools were tested two times (8.graders 2008 and 8.graders 2012), thus there is variation of "reform" within the school clusters.

I thought that taking the complex design into account (complex or multilevel) just affects the SE, not the estimates.
Sorry, what does UG mean?

Christoph
 Christoph Weber posted on Friday, February 14, 2014 - 2:01 pm
I get it, users guide
 Christoph Weber posted on Tuesday, February 25, 2014 - 6:10 am
One more question. I read a pdf "Mplus Short courses topic 7 multilevel modeling with ...".
There is a ranom effects ANOVA example comparing type = twolevel with type = complex. Both models yield the same mean and SE. When I compare the two types with my data I get different means. Is this because of different cluster sizes?. The Anova Example uses data with equal cluster size.

thanks
Christoph Weber
 Bengt O. Muthen posted on Tuesday, February 25, 2014 - 12:20 pm
Yes, my results were for equal cluster sizes.
 Christoph Weber posted on Tuesday, February 25, 2014 - 12:32 pm
Will twolevel and complex only yield the same results with equal cluster sizes?
 Linda K. Muthen posted on Tuesday, February 25, 2014 - 1:16 pm
Yes.
 Christoph Weber posted on Saturday, July 19, 2014 - 3:56 am
To come back to the question:
Why do type complex and twolevel yield different means with unequal cluster sizes?
Thanks
Christoph
Back to top
Add Your Message Here
Post:
Username: Posting Information:
This is a private posting area. Only registered users and moderators may post messages here.
Password:
Options: Enable HTML code in message
Automatically activate URLs in message
Action: