Dear Drs, I am running a CFA on data from 2 separate studies. In each study the data were collected across a number of teams. I feel I should account for clustering at the 'study' and 'team' level as not doing that may lead to bias in estimates and SDs (correct me if I am wrong). This is the script I wrote: cluster =study team; USEVARIABLES are bQPR1  bQPR22; ANALYSIS: TYPE = TWOLEVEL COMPLEX; estimator =mlr; MODEL: %within% TOT by bQPR1  bQPR22; OUTPUT: stdyx; I only specified the %within% option as the items in the CFA are questionnaire items which were answered by the participants in the study. So there is no need to specify the %between% option. Is script OK? Many thanks! 


If your studies consist of independent samples of individuals you would treat this as a multiplegroup situation, not as clustering/multilevel. Even if you measure variables for individuals, these variables can have betweenlevel variation. This means that you want to specify a betweenlevel model such as a twolevel factor model (see UG examples). 

Hi Bengt, Thanks for your reply  the samples from the two studies are independent so are you suggesting that using a multigorup approach, I: 1. test for measurement invariance; and if I find that the assumption of invariance holds 2. I can run the PCA on the data from the two studies Thus, I should not worry about specifying the study as a cluster Please correct me if I am wrong (which UG examples should I look at?) thanks again! 

one more question, could this just not be a case which requires Multilevel CFA ? 


I meant that "study" can be treated as multiplegroup. "Team" would still call for multilevel I think. A paper that deals with both issues is on our website: Muthén, B., Khoo, S.T. & Gustafsson, J.E. (1997). Multilevel latent variable modeling in multiple populations. Unpublished technical report. 

Thanks Bengt, I have now read the paper you recommended. I noticed that in your paper, school clusters were completely independent across catholic and public schools. In contrast, in my dataset some of the teams were recruited in both study 1 and study 2 so the clusters within the studies are not completely independent  is this an issue? 


Yes, this is an issue and may call for 3level modeling. 

hi Bengt, Thanks for your earlier reply  following your comment and example 9.22 I set up the Mplus commands as follows: However, it did not run so smoothly: 1. I received the following warning: *** WARNING Clusters for TEAM with the same IDs have been found in different clusters for STUDY. These clusters are assumed to be different because clusters for how can I specify this data complexity in Mplus? 2. more worringly the whole thing just crashed and I received this message: THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY. FAILURE TO GENERATE TRUNCKATED NORMAL DEVIATE. THE PROBLEM OCCURRED IN CHAIN 1. THE PROBLEM OCCURRED ON THE BETWEEN LEVEL 3. THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY. THE THETA MATRIX IS NOT POSITIVE DEFINITE, AS IT SHOULD BE. PROBLEM INVOLVING VARIABLE BQPR1. THE PROBLEM OCCURRED IN CHAIN 1. I am not sure what this mean  what can I do to solve this? Thanks again! 


Please send those 2 outputs and data to support with your license number. 

