Message/Author 


I would like to estimate a model to predict a grouplevel outcome that is measured at three times. The main predictor is "team climate" which is measured at the individual level, but is aggregated to a group level variable. Furthermore, I have some control variables at the individual level (e.g., sex, age). The basic idea is to create a multilevel model that accounts for (1) individual variance both in the measurement of the team climate variable and in the prediction of the teamlevel outcome, and (2) the variability of the outcome across time. How can I specify such a model in Mplus? Your help is much appreciated. 


(1) Here is one way to think about it. You may compare your case with the UG ex 9.1 figure on page 239. For the Within level (individuals) it sounds like you have individuals' team climate ratings as y, and control variables as x's. For Between (group) you have the y circle as a random intercept which varies across groups. That is your aggregate team climate, expressed as a latent variable. On Between it sounds like you don't have any w or xm variables, so you can just say y; (2) Here the question is if you want to study growth or if time is just a nuisance and you simply want to take into account correlation across time. Multilevel growth models are shown in UG ex 9.12 and on. 

Murphy T. posted on Wednesday, March 02, 2011  1:50 am



Thanks for your answer. I have some follow up questions. To specify: (1) I have team performance as the dependent variable (measured at the team level only) and I want to regress it on team climate (team level) and control variables (individual level). Can I specify team performance (measured at team level) as the dependent variable on both within and between? Or do I have to specify team performance on between only and some other dependent variable on within? (2) I just want to take it into account and not study growth. How can I specify this? Thanks very much from a new Mplus user. 


(1) You say Between = teamperf; in the VARIABLE command and in the MODEL command: %Within y on x1 x2; ! x1 x2 are control variables and y refers to !team climate %Between% teamperf on y; ! y is between part of team climate (the ! random intercept) Is only the grouplevel outcome team performance measured 3 times, or are the other variables also measured 3 times? To learn quicker, you may want to consider attending our multilevel course that we give end of March at Johns Hopkins. 

Murphy T. posted on Friday, March 04, 2011  9:50 am



Thank you very much. (1) Do I understand it correctly that team climate has to be the individual level team climate variables rather then the (teamlevel) aggregated scores? (2) Only the grouplevel outcome team performance is measured at 3 times; the other variables are measured at one time. 


1. If you have individuallevel control variables x, then using the individual level team climate in the way shown seems best. 2. Then you can handle that simply by saying %Between% teamperf1teamperf3 on y; That is, you have 3 betweenlevel team performance variables as 3 columns in your data. 

Murphy T. posted on Wednesday, September 21, 2011  12:58 am



Thank you! I have now specified the model and it works (I decided to use only one measurement point for theoretical reasons, however). Now I tried to specify an interaction between two latent variables at the between level. Both are individuallevel variables that reflect teamlevel constructs. I used the XWITH command but got the error message: "The XWITH option is not available for observed variable interactions. Use the DEFINE command to create an interaction variable. Problem with: ZSOCC_CS  ZSOCCYN XWITH ZCS" My input was: CLUSTER = tid; BETWEEN = Zaewg_1; CENTERING = GRANDMEAN (ZCS ZAR ZEM ZMP Zsex Zage Zsoccyn Zaewg_1); Analysis: Type = twolevel RANDOM; ALGORITHM = INTEGRATION; MODEL: %WITHIN% ZCS ZAR ZEM ZMP Zsoccyn on Zsex Zage; %BETWEEN% Zaewg_1 on ZCS ZAR ZEM ZMP Zsoccyn; Zsocc_CS  Zsoccyn XWITH ZCS; Zaewg_1 on Zsocc_CS; Where "ZCS", "ZAR", "ZEM", "ZMP", "Zsoccyn" are the team climate variables; "Zsex" and "Zage" are individuallevel controls and "Zaewg_1" is the teamlevel outcome. It would be great if you could help me. Thank you very much. 


You can put a factor behind each of them on between, for example, f1 BY Zsoccyn; Zsoccyn@0; and use the factors in XWITH. 


I have a dataset of days clustered within people. My indirect model is all at a within level (all day level variables). I want to control for a between (level 2) variable. Since the analysis is Type=Twolevel, I have the MODEL: %Within% followed by the model relationships. How do I specify the controls? It seems that since the outcomes are at L1 and the controls are at L2, it will not allow me to regress on one the other in either a %between% or %within% statement. Thank you! 


I assume that your daylevel variables have variation across level2 units. If so, their betweenlevel parts, their random intercepts, can be related to the control variable. That's how variables can relate across levels. 


Hi Bengt/Linda, I have a dataset of individuals nested in teams. Some individuals, however, are members of several teams (e.g. 5 teams). Furthermore, my outcome variable is measured at the team level, while all predictors are measured at the individual level. How would I construct a model incorporating the fact that the outcome variable is measured on the group level and the predictors on the individual level, while also taking into account that some individuals are members of multiple teams? I've not seen an example in the literature on the combination of these two issues. Your help is greatly appreciated. 


You may want to take a look at the multiple membership literature: http://www.bristol.ac.uk/cmm/team/hg/xcmmrev2.pdf and perhaps also the crossclassified literature: Gonzalez, De Boeck, Tuerlinckx (2008) A DoubleStructure Structural Equation Model for ThreeMode Data. Psychological Methods, 337  353 


Hello, we want to analyse multileveldata (indiviudals nested in teams) with a level 2 outcome (e.g. leaders' satisfaction), and a level 2 moderator (e.g. a leaders' trait). The independent variable is on level 1. This is our syntax. We are not sure if this is correct. Any corrections or hints are welcome! Does the (crosslevel) interaction have to be defined as a between variable? usevar = Leader_A Member_A Leader_J IactA; CLUSTER IS TEAM_3; BETWEEN ARE Leader_A Leader_J; DEFINE: IactA=Leader_A*Member_A; center Leader_A (grandmean) Member_A (groupmean); ANALYSIS: TYPE IS TWOLEVEL RANDOM; MODEL: %BETWEEN% Leader_J on Leader_A Member_A IactA; 


So you intend "Member_A" to be the latent betweenlevel part of the Member_A variable. Read about that under Part 2 of the UG ex 9.1 on page 262. You should drop RANDOM in the Analysis command since you have only a random intercept/mean. 


I am interested in analyzing data consisting of repeated measures in clusters (schools) but with different individuals (students) at each time point. The objective is to analyze whether certain intervention had effect on the smoking prevalence in these schools, at two time points after the baseline. Everything is measured at the individuallevel, but I'm using some of the measures as aggregated means on schoollevel, to serve as indicators of the school tobacco control policies. For me, measuring change over time is important, so could you advice how to analyze that in Mplus with this kind of data? I would prefer using binary outcome variable (daily smoker/other). 


So are you saying that you want a binary growth model for 3 time points where the repeated outcome is an aggregate over students in the schools? Is the unit of analysis school? How many schools do you have? 


Yes, that is my basic objective and the unit of analysis is school. However, I'm also interested whether it is possible to use individual outcome here. I have altogether 339 schools with data from all three time points. There are altogether 108599 students in the data, but as I mentioned, each student has data only from one time point. The variables of interest are gender, age, parental smoking, general attitudes towards smoking (these I would like to keep on individual level), school type and four variables related to school tobacco control policies (aggregated to schoollevel). The studied intervention relates to legislation so there is no specific intervention variable in the data, the time perspective is important for that. Then is the outcome for current student smoking, which could be used on individual level or aggregated to school mean. If I wanted to study possible moderation effects (e.g. of some schoollevel policy), what would be a suitable model to test that in this setting? I very much appreciate your help! 

Back to top 