Mplus Discussion >> Multiple group analysis

Topics
Last Day
Last 3 Days
Last Week
Tree View

Edit Profile


Multiple group analysis

Mplus Discussion > Structural Equation Modeling >

Message/Author

Anonymous posted on Tuesday, July 11, 2000 - 10:51 am

I'm doing a structural equation model with 3 latent variables, a number of exogenous x variables and 4 groups. The model converges and has an rmsea of .044. I am now trying to test invariance across groups. I see that intercepts are constrained to be equal across groups by default, as are factor loadings. I see how to constrain residuals to be equal across groups, and I've also successfully constrained the betas, but I can't figure out how to constrain the gammas. Is it possible?

Linda K. Muthen posted on Tuesday, July 11, 2000 - 2:58 pm

If you have constrained the betas (f1 ON f2), the gammas (f1 ON x)are done in the same way. For example, in the overall model statement:

MODEL:

f1 ON x1 (1);
f1 ON x2 (2);

will constrain the two regression coefficients to be equal across groups.

Let me know if this is not what you mean.

Lisa Pellerin posted on Wednesday, July 12, 2000 - 6:42 am

(previously Anonymous) My problem was that I have several x variables in the model, and I wanted to constrain the coefficients to be equal across groups only, and not also across all x variables, which is what happens with this:
y on x1 x2 x3 (1);

I did figure out a solution to my problem. Since a regression equation can be on more than one line and the (#) has to be on the same line as the variable(s) it acts upon, I just used multiple lines for my equation:
y on x1 (1)
x2 (2)
x3 (3);
and this worked!

Linda K. Muthen posted on Wednesday, July 12, 2000 - 7:50 am

Yes, this is the case. Only one parenthesis can be on a line and it applies to all parameters on the line. The overall model statement sets equalities within and between groups. An equality statement in a group specific model statement sets equalities within a group.

Anonymous posted on Wednesday, January 31, 2001 - 5:37 pm

I want to use Mplus to construct a multigroup SEM that includes two CFAs for categorical data (two factors, 3+ dichotomous indicators each). Is it the case that Mplus will allow me to run these models without any invariance assumptions whatsoever ? I get the impression that I have to constrain at least one of the three sets of parameters either for identification or convergence: loadings, thresholds, means, scale factors. Maybe this is because when I try to relax any of the Mplus default invariance assumptions I get an error msg stating that the standard errors for the model cannot be calculated. Is the problem with my data (lack of variance ?) or with the identification of the model ?

Bengt O. Muthen posted on Thursday, February 01, 2001 - 5:30 pm

Multiple-group CFA with categorical outcomes uses the default of holding thresholds and loadings invariant across groups, fixing the factor means to zero in the first group while letting them be free in the other groups, and fixing the delta scale factors to one in the first group while letting them be free in the other groups.

If you instead want to have no invariance restrictions across groups you should repeat the thresholds and loadings in each group so that they are group-specific. Note, however, that in this case you need to fix to zero the factor means in all groups (you cannot identify both group-specific thresholds and group-specific factor means) and fix the scale factors to one in all groups (they can only be identified when thresholds and loadings are invariant). You can also accomplish no invariance by doing separate-group analyses.

Anonymous posted on Friday, February 02, 2001 - 12:31 pm

Following up on your recommendation in the 2nd paragraph above: is there a particular interpretation to setting the scale factors equal to 1 (as opposed to 2 or 3, etc.) ? Also, regarding the scale factors themselves, do they refer to the variance of the underlying (continuous) y variable, to the error in measuring that variable via the categorical measure or both ? Given this, how "strong" is the assumption of equal scale factors in the multigroup model where loadings and thresholds are allowed to vary and factor means are set to zero, etc. ?

Bengt O. Muthen posted on Friday, February 02, 2001 - 3:17 pm

The scale factors refer to the inverted standard deviations of the latent response variables y*. This means that they are functions of loadings, factor variances, and residual variances. If one or more of those three components vary, the scale factor would vary. So, equal scale factors when loadings vary does not make sense.

Holmes Finch posted on Tuesday, February 27, 2001 - 6:42 am

I am trying to compare two groups (ed and noned) on a confirmatory factor analysis solution. I have used the following command structure in MPlus, which I thought would work, but which isn't giving me the anticipated output. Again, what I want to be able to do in the end is determine whether the model is the same for the two groups. Thanks for your help.

model: intern by withd somat anx;
model: extern by del aggress;
model ed: withd somat anx (1);
model ed: del aggress (1);
model noned: intern by withd somat anx (2);
model noned: del aggress (2);

Linda K. Muthen posted on Tuesday, February 27, 2001 - 8:08 am

If you use the following syntax:

MODEL: intern BY withd somat anx;
extern BY del aggress;

the factor loadings will be held equal across groups. It is not clear what you are trying to do with the statements you have sent. If you tell me in words which parameters you are trying to hold equal and whether they are to be held equal within and/or across groups, I can then help you.

The two model ed commands that you have above will hold all residual variances equal across variables for ed and the residual variances for del and aggress held equal to each other and also equal to the factor loadings for intern in the noned group.

By the way, one overall MODEL command and one group-specific model command is sufficient for any input.

Holmes finch posted on Tuesday, February 27, 2001 - 11:15 am

Linda,

Thanks for your response. What I want to do is compare the two groups over all the parameters, and then maybe look at individual ones. The bottom line is, I want to be able to say that the same model does, or does not fit both groups. Does that make sense? Thanks.

Holmes

Linda K. Muthen posted on Tuesday, February 27, 2001 - 12:53 pm

If you send me your fax number, I will fax you several pages we use when we teach. These show setups for a variety of multiple group models that test a variety of hypotheses.

Anonymous posted on Tuesday, June 26, 2001 - 2:43 am

I�am trying to do a multiple group analysis. All measurement parameters are held equal across groups by default. Is it possible to hold specific variances of latent factors equal across the groups? Which syntax do I have to use?

Linda K. Muthen posted on Tuesday, June 26, 2001 - 6:40 am

Any parameter that is not held equal by default can be held equal. Any parameter that is held equal by default can have that equality relaxed.

To hold a parameter equal, specify it in the overall MODEL command with a number in parentheses following it. One number in parentheses is allowed per record (line) of the input file. In a three factor model, the variances of the factors will be held equal across groups by adding the following to the overall MODEL command:

f1 (1);
f2 (2);
f3 (3);

Lee-Fay posted on Tuesday, July 17, 2001 - 4:34 pm

I am trying to run a twolevel analysis. But I get an error message telling me that 'the sample covariance matrix for the variables cannot be inverted'. I have checked my covariance matrices and no two variables are perfectly correlated and no variable has no variation. I have 11 homes with 349 subjects in total. The dependent variable is continous, and I have 4 within-level predictors and 5 between-level predictors. What am I doing wrong?

Linda K. Muthen posted on Wednesday, July 18, 2001 - 7:54 am

Even though you cannot see any correlations of 1 in your sample between covariance matrix, there may be dependencies that result in singularity of the matrix. You mention that you have 10 variables and 11 homes. Having 11 homes is like having 11 observations at the between level. You would not be able to have more than 10 variables. So if there are variables you are not mentioning, this could also be the problem. We recommend at least 30-50 clusters for this type of modeling. You can try analyzing the sample between matrix to see if it can be inverted. Or you can send the input and data to support@statmodel.com and I will take a look at it.

Sandra Lyons posted on Tuesday, October 30, 2001 - 8:13 am

Can you please elaborate on the steps in multiple group analysis. I want to test group differences in two beta and two gamma coefficients. Am I correct that the model fitting steps leading up to testing the beta/gamma coefficients are to test assumptions of measurement invariance?

I understand that the first step is to fit the SEM model separately in each group. Then the next three steps are to fit the model in all groups (1) allowing all parameters to be free, (2) holding factor loadings equal, and (3) holding factor loadings and intercepts equal. Given the defaults for multiple group analysis with categorical indicators, am I correct that these three steps require that parameters that are constrained or fixed by default need to be relaxed. If so, could you please elaborate on which defaults to relax? I assume that if factor loadings and intercepts are invariant, then the default settings would be appropriate for testing differences in the beta/gamma coefficients.

Linda K. Muthen posted on Tuesday, October 30, 2001 - 9:57 am

The steps in looking at measurement invariance are slightly different with categorical indicators. For one thing, you are dealing with thresholds instead of intercepts You want to compare two models rather than three to test measurement invariance.

Model 1 - This is the default model in Mplus. The thresholds are held equal across groups and the factor loadings are held equal across groups. The scale factor is fixed to one in the first group and free in the others. The factor means are zero in the first group and free in the others.

Model 2 - The thresholds and factor loadings are free across groups. Scale factors are one in all groups and factor means are zero in all groups.

Sandra Lyons posted on Tuesday, October 30, 2001 - 11:17 am

Thank you for the clarification. Could you point me to a good reference on examining measurement invariance with categorical indicators?

Linda K. Muthen posted on Tuesday, October 30, 2001 - 1:19 pm

I think you may find something relevant at www.statmodel.com under REFERENCES/CATEGORICAL/MIMIC.

Sandra Lyons posted on Friday, November 16, 2001 - 12:44 pm

I have run the group analysis below and now want to examine the model by 3 household types. Jaccard and Wan (1996) suggest examining three way interactions using multiple group analysis [in this case six groups].

Are there alternative approaches? For instance, would a two-level or MIMIC model be appropriate?

Grouping is t1totsup (0=below 1=above);
Usevariables are q21a q21b q21c q21d p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 lifevent nparpro aparpro;
Categorical are q21a q21b q21c q21d
p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13;
Define:
Cut t1totsup(36);
Model:
F1 by q21a q21b q21c q21d;
F3 by p1 p5 p6 p9;
F4 by p2 p3 p10 p11 p12 p13;
F5 by p4 p7 p8;
F6 by F3 F4 F5;
F6 on F1;
F6 on lifevent;
nparpro on F6;
aparpro on F6;

Sandra Lyons posted on Sunday, November 18, 2001 - 2:02 pm

Can you please tell me how to obtain the sample correlation matrix in order to report it with the analysis? Thanks!

Linda K. Muthen posted on Monday, November 19, 2001 - 7:58 am

You could do a multiple group analysis with six groups or a MIMIC with five dummy variables. Unless you have clustered data, TWOLEVEL would not be appropriate.

Multiple group analysis gives you the most flexibility if you have enough subjects per group. MIMIC cannot look at as many parameters but does not require as many subjects.

You can obtain a sample correlation matrix using SAVEDATA: TYPE (SAMPLE) IS CORRELATION;

Sandra Lyons posted on Wednesday, November 21, 2001 - 11:04 am

Thank you for your prompt and helpful support!

The group analysis below produced the error message that follows it. Is the solution to this problem to remove the offending indicators from the relevant groups?

Grouping is t1hhsup (1=losingle 2=hisingle 3=lopartner 4=hipartner
5=loexfam 6=hiexfam);

Usevariables are q21a q21b q21c q21d p1 p2 p3 p4 p5 p6 p7 p8

p9 p10 p11 p12 p13 lifevent nparpro aparpro;

Categorical are q21a q21b q21c q21d

p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13;

Missing = Blank;

Model:

F1 by q21a q21b q21c q21d;

F3 by p1 p5 p6 p9;

F4 by p2 p3 p10 p11 p12 p13;

F5 by p4 p7 p8;

F6 by F3 F4 F5;

F6 on F1;

F6 on lifevent;

nparpro on F6;

aparpro on F6;

Output:

Standardized;

*** ERROR
Group 2 does not contain
all values of categorical variable: P2
*** ERROR
Group 4 does not contain
all values of categorical variable: P2
*** ERROR
Group 5 does not contain
all values of categorical variable: P2
*** ERROR
Mplus VERSION 2.02 PAGE 3
hhstructure moderation model 1 all paths free

Group 6 does not contain
all values of categorical variable: P3

Linda K. Muthen posted on Wednesday, November 21, 2001 - 12:25 pm

In multiple group analysis with categorical outcomes, each variable must have the same values in each group. You would need to collapse categories of p2 and p3 to obtain this condition.

Sandra Lyons posted on Monday, November 26, 2001 - 9:31 am

Your reply of 11/19 said:

Multiple group analysis gives you the most flexibility if you have enough subjects per group. MIMIC cannot look at as many parameters but does not require as many subjects.

Jaccard and Wan (1996) recommend a minimum of 75 subjects per group (100 preferred), but this must depend on several factors such as the number of variables in the model. Can you suggest how to determine the minimum number of subjects needed for group analysis? My smallest group size is 50.

Also, I have convergence problems with a single six group model, but not when the same model is run in 3 separate analyses with two groups each. What are the implications of this?

Linda K. Muthen posted on Tuesday, November 27, 2001 - 7:33 am

As you said, sample size depends on many things. As a minimum for each group, you would want to have more observations than the number of variables. You would want to have 5 to 10 observations for each parameter. For categorical outcomes, you usually need more observations than for continuous outcomes. Sample size 50 seems small particularly for categorical outcomes. Regarding convergence, the measurement invariance restrictions that you are probably imposing may not hold across all groups.

Sandra Lyons posted on Saturday, December 01, 2001 - 12:33 pm

I've looked at the Mplus MIMIC examples and observed that none of them have independent latent variables. Hence, I'm wondering whether MIMIC is a good alternative to group analysis for the SEM I'm testing which is:

F1 by q21a q21b q21c q21d;
F3 by p1 p5 p6 p9;
F4 by p2 p3 p10 p11 p12 p13;
F5 by p4 p7 p8;
F6 by F3 F4 F5;
F6 on F1 lifevent;
nparpro aparpro on F6;

I'm primarily interested in group differences in the path coefficients.

If MIMIC is indeed appropriate for this analysis, is it analogus to ols regression with dummy variables?

In multigroup analysis with categorical dependent variables, if measurement invariance is not of substantive interest, would it be appropriate to fix measurement parameters across groups to those obtained in the single group analysis in order to circumvent nonconvergence possibly due to measuement invariance?

bmuthen posted on Saturday, December 01, 2001 - 5:25 pm

The term MIMIC analysis is typically reserved for models with observed covariates influencing factors that have a set of indicators. But you can certainly put grouping variables as covariates into any SEM including yours above. Using grouping variables as covariates makes it possible to have different means (intercepts) of the variables that they are specified to influence (observed and latent). If you are interested in group differences in path coefficients (slopes), however, having grouping variables as covariates will not help.

I would not recommend fixing measurement parameters to single-group analysis values because you want to see that the measurement part of the model is not changed in important ways when doing the joint analysis of several groups - a convergence problem can be an indication of model misspecification.

Catherine Stanger posted on Monday, April 08, 2002 - 12:04 pm

I am trying to run a multiple group [mothers vs. fathers] two level [children within families] model:

CLUSTER IS sid;
GROUPING IS ptsex (1=fathers 2=mothers);

ANALYSIS:
TYPE = MEANSTRUCTURE TWOLEVEL;
MODEL:
%WITHIN%
dbnew on monitor (1)
agecb (2);
aggress on monitor (3)
agecb (4);
monitor on agecb (5);

%BETWEEN%
dbnew on monitor@0 agecb@0;
aggress on monitor@0 agecb@0;
monitor on agecb@0;
aggress with dbnew@0;

I'm trying to constrain the path coefficients to be the same for mothers and fathers. The code above yields different coefficents for mothers vs. fathers and the results are identical to code that omits the #'s in parentheses...what am I doing wrong??
thanks!!

Linda K. Muthen posted on Monday, April 08, 2002 - 12:37 pm

If you send your complete output to support@statmodel.com, I will look at it.

Anonymous posted on Sunday, June 23, 2002 - 3:33 pm

Sanity check needed: I'm running a multigroup SEM in Mplus with several ordered categorical variables as outcomes.

In Group 1 I specify my thresholds as follows:

[outcome$1*-2];
[outcome$2*-.8];
[outcome$3@0];
[outcome$4*1.5];

and in Group 2 I specify:

[outcome$1*-1];
[outcome$2@0];
[outcome$3@0];
[outcome$4*.5];

Is this the same as recoding my outcome variable for Group 1 but not Group 2 ? Mplus doesn't seem to allow group specific recodes using CUT on the DEFINE command, and doesn't give me an error msg when I use the above specification.

Thanks !

LMuthen posted on Monday, June 24, 2002 - 6:52 am

I don't believe that you can use thresholds to recode your data. You should be able to use DEFINE to recode data for one group, for example,

DEFINE:

if (group eq 1 and y1=2) then y1=1;

Sandra Lyons posted on Monday, July 22, 2002 - 10:28 am

Someone asked me why SEM uses multiple group rather than products of factors to assess moderating effects? Do you have a brief explantion for this or could you point me to the literature?

bmuthen posted on Monday, July 22, 2002 - 5:26 pm

Multiple groups can be, but doesn't have to be, used when a categorical variable is involved. This gives more modeling flexibility than using products since for example variances can be different across the groups.

Sandra Lyons posted on Monday, July 22, 2002 - 5:47 pm

Methods I have seen described for interacting latent variables with a continuous observed variable seem quiet complex (Jaccard & Wan, Kenny & Judd) relative to group analysis. What method do you generally recommend? For example, I have the following model:

f2 on f1 x1;
x2 x3 on f2;

where f1 and f2 are latent variables with dichotomous indicators, and x1 - x3 are observed continuous variables

I want to test the moderating effects of a continuous variable on each path in the model. Would you recommend group analysis or product terms. If product terms, what method do you suggest.

bmuthen posted on Tuesday, July 23, 2002 - 8:30 am

Since the moderating variable is observed and not latent, the simplest approach would be to categorize the continuous moderating variable and do a multiple-group analysis. There are many methods for analysis of latent variable interactions (which includes your case), but I hesitate to recommend any. A new method for ML analysis by Andreas Klein seems superior but is not yet easily available in software form.

Anonymous posted on Tuesday, August 13, 2002 - 5:52 pm

We have been conducting a multigroup analysis with two groups and continuous indicators. We want to test whether some of the structural path coefficients are significantly different for group 1 vs. group 2.

e.g., for structural path x:
Group 1 standardized coefficient = .609
Group 2 standradized coefficient = .216

How can we determine if these coefficients for the same path but different groups are significantly different from one another?

Linda K. Muthen posted on Wednesday, August 14, 2002 - 8:27 am

To test whether some paths are different between two groups, you can run two models -- one with the paths held equal and the second with the paths not constrained to be equal. Then do a chi-square difference test. This is not a test of the standardized coefficients rather the unstandardized coefficients.

Anonymous posted on Friday, August 16, 2002 - 1:33 pm

I have a question about comparing multigroup SEM coefficients across groups.

Is it the case that the MG approach "controls" on differences in levels of my exogenous variables across groups ?

For example, I'm running a model on two groups, the first of which has much higher income and intelligence scores than the second group. Income and intelligence are one of about 10 different x variables used to predict an outcome variable y. Is it valid to compare differences in the direction and sizes of the effects of x1, x2, x3,...,x10 on y across groups ?

Anonymous posted on Friday, August 16, 2002 - 1:42 pm

I should have appended this second question to the one I originally submitted above:

Is there a convenient way to determine if structural coefficients are equal across groups in a MG SEM without having to resort to Chi-Square (WLS) tests ?

I ask because I have a large number of variables in my models and using individual Chi-Square tests would be tedius, and I think the significance of coefficients would be biased by the order in which I imposed the restrictions.

bmuthen posted on Saturday, August 17, 2002 - 9:39 am

Regarding your first question about controlling for differences, you confuse me by first talking about groups defined by income and intelligence and then talking about these variables as x variables. Let me answer the question as an MG situation where one x variable is used as a grouping variable, and therefore not used as one of the x variables. You should think of this as regular regression in two groups, where we know that the regression slope can be compared even if the x mean is different in the two groups.

Yes, you can print out (TECH3) the estimated covariance matrix for the parameter estimates and do a "correlated t test".

Anonymous posted on Wednesday, August 27, 2003 - 8:38 am

On August 17, Bengt recommends doing a correlated t-test to examine whether or not the coefficients for two groups in a multigroup model are different. I'm wondering if this is the appropriate test to use in all situations.

If one is working with data were individuals are not assigned to groups randomly, when the number of persons in the two groups differs considerably, and where the SEs for the coefficients of interest also vary considerably, shouldn't one use an unequal variance t-test or a t-test for independent samples ?

Also, in Bengt's original recommendation, wouldn't the df for a pooled t-test always be df=(number of groups - 2) = 0 ?

Thanks.

bmuthen posted on Wednesday, August 27, 2003 - 9:07 am

I was using "correlated t test" merely as an analogy. The TECH3-based test I have in mind is asymptotically normal, so the z test analogy is better.

Anonymous posted on Wednesday, August 27, 2003 - 9:20 am

I'm following up to your response to make sure I understand how comparing coefficients across groups in a multigroup model corresponds to common t-tests for comparing means across groups.

TECH3 would be needed to determine the covariance between a given pair of model parameters.

However if the two groups are independent (which I believe is an appropriate assumption if cases are assigned to groups based on non-random factors -- i.e., students allocated to schools, workers allocated to firms or sectors of the labor market), TECH3 wouldn't be needed and n1 and n2 would be the sizes of the two groups from which the coefficients (treated as averages) were obtained.

Thanks again.

bmuthen posted on Wednesday, August 27, 2003 - 11:48 am

Here is my understanding of this. I think this question was regarding a SEM, testing equality of structural coefficients. Even if the 2 groups correspond to independent samples, the invariance restrictions across groups typically imposed on measurement parameters could make the structural coefficients estimates from the two groups correlated - so that is where I was thinking TECH3 comes in. As far as I see it, the differences in group sample sizes are already taken into account in the 3 TECH3 components - this is unlike t tests where sample size enters because a variance for a sample mean is figured via the variance for each variable in the mean. So the resulting (approximate) z score ratio is correct.

Anonymous posted on Wednesday, September 24, 2003 - 9:46 am

Just to clarify on the testing equality of structural coefficients. Say, I have latent variables x1, x2 and x3 predicting latent variable y. I look at the difference in chi-squares if I fix everything to be equal between two groups and if I fix everything except the path from x1 to y --- does LM test tell me if this structural coefficient (y on x1) is significantly different between groups? Should I repeat the procedure two more times for x2 and x3?
Thank you in advance.

bmuthen posted on Wednesday, September 24, 2003 - 7:11 pm

Not quite the way you said it, I think. Instead:

To test if y on x1 is different across groups, you would run with the slope held equal across groups and then run allowing it to differ. Then do the same for y on x2, then for y on x3.

But if your hypothesis is that all 3 (y on x1, on x2, on x3) are equal across groups, then you would do one run with equal for all 3 across groups and one run letting them be different.

Daniel posted on Tuesday, March 30, 2004 - 10:32 am

In presenting the results of a multi-group LGM, is it appropriate to present standardized or raw path coefficients in a figure? I read in the Loehlin "LATENT VARIABLE MODELING" text that population differences in range on specific variables can influence comparability of standardized scores across populations? Is this a problem in multi-group analysis? Or are the standardized path coefficients based on values appropriate to the entire population?

Linda K. Muthen posted on Wednesday, March 31, 2004 - 6:44 am

I would report the raw coefficients and their standard error in addition to the standardized coefficents. Don't forget that the significance test is for the raw coefficient. The standardizations are computed using the variances for each group. There are different opinions about this.

Daniel posted on Wednesday, March 31, 2004 - 11:32 am

Thanks very much once again for your help. One of the difficult parts of being a researcher rather than statistician by training is that I must learn much technique on my own. So, while I have been reading a tremendous amount of text on a variety of subjects in SEM, it is some times difficult to see the forest for the trees! That's when the help of experts like yourself and Bengt's is much appreciated.

Daniel posted on Wednesday, March 31, 2004 - 11:33 am

Thanks very much once again for your help. One of the difficult parts of being a researcher rather than statistician by training is that I must learn on my own. So, while I have been reading a tremendous amount of text on a variety of subjects in SEM, it is some times difficult to see the forest for the trees! That's when the help of experts like yourself and Bengt is much appreciated.

Jen Bailey posted on Wednesday, April 28, 2004 - 5:13 pm

Is it possible to run a multigroup model in which a latent factor that exists in one group does not exist in the other?

Here's the scenario: I'm looking at within-individual continuity in latent substance use across adolescence and adulthood. Some of the members of the sample have children, and some do not. I'm interested in how parental substance use affects child problem behavior. My sample of parents is small (n = 200), and my substance use model is fairly large, since I have multiple indicators and multiple time points. Therefore, I would like to take advantage of my whole sample (n = 800) in estimating the substance use part of the model.

A colleague suggested that I do a multigroup model, leaving out the "child problem behavior" factor in the group that doesn't have children. The child problem behavior variables are, obviously, missing for all non-parents. The thought was that a multigroup model would be superior to mixing the parent and non-parent populations and using FIML because it would explicitly acknowledge that there are two populations in the sample. I've tried specifying a new latent factor in the model statement for my second group, but the program (Version 3) doesn't seem to like that.

What are your thoughts on using a multigroup approach in this case? How would I program such a model?

Thank you!

Linda K. Muthen posted on Thursday, April 29, 2004 - 8:22 am

Yes, this is possible. But you need to define the factor in the overall MODEL command not in a group-specific MODEL command. Then you need to set all of the factor loadings to zero in the group-specific MODEL command. The overall MODEL command is the model assigned to each group and then modified by the group-specific MODEL commands. Chapter 13 has a discussion of this.

Following is an example of how this can be done:

MODEL: f1 BY y1-y4;
f2 BY y5 y6 y7;
MODEL males: f2 BY y5@0 y6@0 y7@0;

Jen Bailey posted on Thursday, April 29, 2004 - 11:03 am

Hi Linda,

Thanks for your reply - I appreciate your syntax suggestion. I still have a problem, however. I wrote the syntax as you suggested, and got an error message saying that all cases in one group were missing data on some variables. This is true - in my non-parents group, there ARE no data for the indicators of child problem behavior, because there are no children.

Any suggestions for getting around the fact that the child problem behavior factor doesn't exist and its indicators are all missing data in the non-parent group?

Thanks again!

Linda K. Muthen posted on Friday, April 30, 2004 - 9:38 am

I think the only thing you can do is run the model with the factors and variables shared by all groups and test invariance of the factors over groups for those factors. Then you would have to run the group separately that has more factors and variables. Establishing measurement invariance would not be as issue for those factors.

Jen Bailey posted on Monday, May 03, 2004 - 10:27 am

Thank you for your time and advice. I very much appreciate having this discussion board as a resource.

Anonymous posted on Monday, June 14, 2004 - 6:06 am

Hello,

I am running a multi-group analysis with three racial groups - black, white, and hispanic. You will see in the input file below that I allow 2 variable (ED and CMR) paths (slopes, gammas) to be freely estimated among the three groups.

How can I allow one of the variables (MV1) to be constrained to be equal for the first two groups (black and white) and freely estimated/different for the third group (hispanic)?

VARIABLE:
GROUPING IS RAC (1=black 2=white 3=hispanic);
MISSING IS .;

MODEL:
F1 BY RE SF MH SFV SO QOL;

RE WITH SF MH SFV;
SF WITH MH SFV;
MH WITH SFV;

F1 ON MV1 (1)
AGE (2)
ED
MAR (3)
CMR
TR (4);

OUTPUT: STANDARDIZED;

Thank you in advance for your reply.

Linda K. Muthen posted on Monday, June 14, 2004 - 6:41 am

Add:

MODEL hispanic:

f1 ON mv1;

This will relax the equality constraint for the hispanic group.

Anonymous posted on Monday, August 16, 2004 - 10:42 am

Hi,

I was wondering if my code is correct to test measurement invariance (has SOME categorical factor indicators and covariates). It is my understanding that I should use the theta parameterization. Is this correct? I believe I should run a model where everything is free (model 1), where factor loadings are held constant across groups (model 2), where variances of latent variables are held constant and factor loadings (model 3), where covariances of latent variables, variances of latent variables and factor loadings are equal (model 4), and finally where regression paramaters, covariances of latent variables, variances of latent variables, and factor loadings are held constant (model 5). I am not specifying thresholds. All of my categorical variables are coded 0-absent, 1-present. I read on page 67 of the User's Guide that if the thresholds are free across groups (I believe this is the default) and a factor loading for a categorical factor indicator is free across groups, the residual variance for the variable must be fixed to one in these groups for identification purposes. Do I need to fix the variance of pardep and fhdadc to one...or some other variable? I am having some identification issues. I am particularly interested in whether the regression weights are equal across groups.

Model 1: grouping is sex (0=male 1=female);
IDVARIABLE = subno;
missing=.;
categorical are fhdadc parsuic pardep late;

ANALYSIS: TYPE = mgroup;
parameterization=theta;
iterations= 50000;
MODEL:
suicide BY late@1 (1);
suicide by middle (2);
suicide by early (3);

attemp by mlife@1 (4);
attemp by lalife (5);
attemp by elife (6);

parprob by fhdadc@1 (7);
parprob by parsuic (8);
parprob by pardep (9);

extrov BY ext3@1 (10);
extrov by ext2 (11);
extrov by ext1 (12);

psychot BY psychot2@1 (13);
psychot by psychot1 (14);
psychot by psychot3 (15);

neurot BY neurot3@1 (16);
neurot by neurot2 (17);
neurot by neurot1 (18);

!parsuic@1;
!pardep@1;

attemp on suicide pareduc parprob extrov
psychot neurot careloss divorce nphycnt
nvbscnt nncnt cle31;

model female:
suicide BY late@1 (101);
suicide by middle (102);
suicide by early (103);

attemp by mlife@1 (104);
attemp by lalife (105);
attemp by elife (106);

parprob by fhdadc@1 (107);
parprob by parsuic (108);
parprob by pardep (109);

extrov BY ext3@1 (110);
extrov by ext2 (111);
extrov by ext1 (112);

psychot BY psychot2@1 (113);
psychot by psychot1 (114);
psychot by psychot3 (115);

neurot BY neurot3@1 (116);
neurot by neurot2 (117);
neurot by neurot1 (118);

MODEL 2:
missing=.;

categorical are fhdadc parsuic pardep late;

ANALYSIS: TYPE = mgroup;
parameterization=theta;
iterations= 50000;
MODEL:
suicide by late@1 middle early;
attemp by mlife@1 lalife elife;

parprob by fhdadc@1 parsuic pardep;

extrov by ext3@1 ext2 ext1;

psychot by psychot2@1 psychot1 psychot3;

neurot by neurot3@1 neurot2 neurot1;

attemp on suicide pareduc parprob extrov
psychot neurot careloss divorce nphycnt
nvbscnt nncnt cle31;

Model 3:
Add this to model 2.....
suicide (30);
parprob (31);
extrov (32);
psychot (33);
neurot (34);
attemp (35);

model female:

suicide (30);
parprob (31);
extrov (32);
psychot (33);
neurot (34);
attemp (35);

Model 4:
add this to MOdel 3....
parprob with extrov (44);
parprob with psychot (45);
parprob with neurot (46);
extrov with psychot (47);
extrov with neurot (48);
psychot with neurot (49);
model female:

parprob with extrov (44);
parprob with psychot (45);
parprob with neurot (46);
extrov with psychot (47);
extrov with neurot (48);
psychot with neurot (49);

MODEL 5:
add this to model 4....
attemp on suicide (149);
attemp on pareduc (150);
attemp on parprob (151);
attemp on extrov (152);
attemp on psychot (153);
attemp on neurot (154);
attemp on careloss (155);
attemp on divorce (156);
attemp on nphycnt (157);
attemp on nvbscnt (158);
attemp on nncnt (159);
attemp on cle31 (160);
Model female :
attemp on suicide (149);
attemp on pareduc (150);
attemp on parprob (151);
attemp on extrov (152);
attemp on psychot (153);
attemp on neurot (154);
attemp on careloss (155);
attemp on divorce (156);
attemp on nphycnt (157);
attemp on nvbscnt (158);
attemp on nncnt (159);
attemp on cle31 (160);

Are these the models you suggest? Is my syntax correct? Do I need to set the residual variance to one for parsuic and pardep (or other variables)? Thank you so much in advance.

Linda K. Muthen posted on Monday, August 16, 2004 - 6:14 pm

You can use either the delta or theta parameterization to test measurement invariance. Many of the equalities that you want to test are not measurement invariance in my opinion. Differences between factor means, variances, and covariances and regression coefficients describe population heterogeneity rather than measurement invariance. Factor loadings and thresholds are related to measurement invariance. Some see residual variances of factor indicators as measurement parameters. I would not require them to be equal for measurement invariance to hold.

Example 5.16 in the Mplus User's Guide shows a multiple group CFA with categorical factor indicators. To test measurement invariance, you would first run the default overall model where factor loadings and thresholds are held equal as the default. The second model is one where factor loadings and thresholds are unequal across groups. How to relax the default equality is shown in Example 5.16. With the THETA parameterization, residual variances instead of scale factors are fixed to one.

Anonymous posted on Tuesday, September 21, 2004 - 5:59 pm

Does Mplus 3 generate modification indices that rank the equality constraints in terms of their effects on overall model chi-square? If not, what is your recommended strategy for localizing areas of relatively worse "misfit" in complex multigroup SEMs? Thanks!

Linda K. Muthen posted on Wednesday, September 29, 2004 - 3:45 pm

No. No general strategy comes to mind. Just look for the largest ones and also see what difference it makes for parameter estimates when they are relaxed.

Anonymous posted on Friday, October 08, 2004 - 1:30 pm

I am testing measurement invariance of factor loadings where indicators are categorical. I consistently get an error message that the standard errors cannot be estimated because my model may not be identidfied. Hoping to fix this problem, I would like to constrain my factor means to zero. Someone else had this same problem...and the posted response was:

"If you instead want to have no invariance restrictions across groups you should repeat the thresholds and loadings in each group so that they are group-specific. Note, however, that in this case you need to fix to zero the factor means in all groups (you cannot identify both group-specific thresholds and group-specific factor means) and fix the scale factors to one in all groups (they can only be identified when thresholds and loadings are invariant)."

How do I fix to zero the factor means in all groups? What does the code look like?

Thank you!

Linda K. Muthen posted on Tuesday, October 12, 2004 - 4:49 pm

To fix a factor mean to zero, use the square bracket option in the overall MODEL command:

MODEL:

[f@0];

Madeline posted on Thursday, October 28, 2004 - 4:20 pm

Hi - I am testing measurement invariance of factor loadings across gender. My less restrictive model is giving me the following message: "THE MODEL ESTIMATION TERMINATED NORMALLY THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL.

Here is my code. Can you tell me what I am doing wrong?

Thanks!!!

INPUT INSTRUCTIONS

!Measurement Invariance of Factor Loadings across sex
TITLE: Invariance: Male vs Female
DATA: FILE IS Y:\Madeline\name1.dat;

VARIABLE: NAMES ARE
id
caring
friendly
join
betrfren
holiday
silly
partyfun
betrmood
drive
homework
lvcut
lvweapon
lvpunish
lvdamage
lvbeaten
lvthrt
lvver
lvstolen
everalc
binge30
lvdaybin
lvbinint
alc30
daysalc
drinkday
grade
a5
a6;

USEVARIABLES ARE everalc lvcut
lvweapon
lvpunish
lvdamage
lvbeaten
lvthrt
lvver
lvstolen
caring
friendly
join
betrfren
holiday
silly
partyfun
betrmood
drive
homework;

grouping is a5 (1=male 2=female);

MISSING = . ;

IDVARIABLE = id;

categorical are
everalc
caring
friendly
join
betrfren
holiday
silly
partyfun
betrmood
drive
homework
lvcut
lvweapon
lvpunish
lvdamage
lvbeaten
lvthrt
lvver
lvstolen;

ANALYSIS: TYPE = missing h1;
parameterization=theta;
iterations= 50000;

MODEL:

delinq by lvdamage@1;
delinq by lvcut (2);
delinq by lvweapon (3);
delinq by lvpunish (4);
delinq by lvbeaten (5);
delinq by lvthrt (6);
delinq by lvver (7);
delinq by lvstolen (8);

expec by partyfun@1;
expec by friendly (10);
expec by join (11);
expec by betrfren (12);
expec by holiday (13);
expec by silly (14);
expec by caring (15);
expec by betrmood (16);
expec by drive (17);
expec by homework (18);

everalc on delinq expec;
expec on delinq;

model female:
delinq by lvdamage@1;
delinq by lvcut (102);
delinq by lvweapon (103);
delinq by lvpunish (104);
delinq by lvbeaten (105);
delinq by lvthrt (106);
delinq by lvver (107);
delinq by lvstolen (108);

expec by partyfun@1;
expec by friendly (110);
expec by join (111);
expec by betrfren (112);
expec by holiday (113);
expec by silly (114);
expec by caring (115);
expec by betrmood (116);
expec by drive (117);
expec by homework (118);

OUTPUT: tech1 tech2 tech4 STANDARDIZED ;
SAVEDATA: DIFFTEST IS sexload.dat;

*** WARNING
Data set contains unknown or missing values for GROUPING,
PATTERN, COHORT and/or CLUSTER variables.
Number of cases with unknown or missing values: 454
1 WARNING(S) FOUND IN THE INPUT INSTRUCTIONS

Linda K. Muthen posted on Thursday, October 28, 2004 - 5:26 pm

With categorical outcomes, you must have thresholds and factor loadings both held equal or both free. You can't relax the constraint on a factor loading without relaxing the constraint on the threshold for the same item. I don't see that you have thresholds free in your MODEL command.

Examples 5.16 and 5.17 in the Mplus User's Guide show a multiple group CFA with categorical factor indicators. To test measurement invariance, you would first run the default overall model where factor loadings and thresholds are held equal as the default. The second model is one where factor loadings and thresholds are unequal across groups. In this model, with the Delta parameterization, scale factors must be fixed to one in all groups and factor variances fixed to zero in all groups. With the Theta parameterization, residual variances must be fixed to one in all groups and factor means fixed to zero in all groups. How to relax the default equality is shown in Example 5.16. With the THETA parameterization, residual variances instead of scale factors are fixed to one.

Sarah Meadows posted on Monday, November 08, 2004 - 6:15 pm

Hello,

I am running a series of multigroup (male and female) CFA's with continous factors in an attempt to test measurement invariance (a la Bollen 1989). Moving to increasingly more restrictive constraints (factor loadings, intercepts, means, and variance-co-variances) I am now ready to constrain error variance-co-variances. However I am unclear on 1)what the default treatment of error variances is in Mplus and 2) how to constrain them to be equal between groups. Can you tell me what programming language I need to constrain error variances?

An example program is as follows:

GROUPING is female (0=male 1=female);

USEVARIABLES ARE da1 da2 da3 da4 da5 da6 da7 da9 da10 da11 da12 da13da14 da15 da16 da17 da18 da19;

missing = .;

ANALYSIS: type=meanstructure;

MODEL:
depress by da1-da5* da6@1 da7* da9-da19*;
da14 with da17;
da15 with da11;
da9 with da19;
da7 with da18;
da4 with da11;
da4 with da15;
da18 with da5;
da5 with da7;

!Variances;
da1 (1)
da2 (2)
da3 (3)
da4 (4)
da5 (5)
da6 (6)
da7 (7)
da9 (8)
da10 (9)
da11 (10)
da12 (11)
da13 (12)
da14 (13)
da15 (14)
da16 (15)
da17 (16)
da18 (17)
da19 (18);
depress (19);

MODEL female:
[depress@0];

Thanks very much for your help!

bmuthen posted on Sunday, November 14, 2004 - 11:24 am

The default is that the error (co-)variances are allowed to differ across groups. Your input specifies that the error variances are the same across groups since you have in the overall part of your model the statements

da1 (1);
etc

Larry Cashion posted on Monday, November 15, 2004 - 6:30 pm

This is probably too basic a question, but when asked by my PhD supervisor I was unable to answer. He has no experience with MPlus, and we are both on a steep learning curve. I am running SEM with three groups of about 70 participants of 6, 8, and 10 years. If I use a multiple group format for the SEM, what exactly am I doing. Am I correcting for or accounting for group?differences? Similarly, when would I use CLASS and when would I use CLUSTER?

Thank you.

Mary posted on Tuesday, November 16, 2004 - 6:15 am

Dear Mr and Mrs Muthén,

I have a very simple question regarding the grouping option. Besides the constraint that forces the loadings to be equal across the groups, are there any other differences between runnning a regression with the grouping option or running each group as a different regression?

Thank you very much!

Linda K. Muthen posted on Tuesday, November 16, 2004 - 8:19 am

Re: Larry Cashion. Multiple group analysis is used to study parameter estimates across groups of different observations. In your case, you would be studying difference in parameter estimates acroos age. The CLASSES option is used to define categorical latent variables in mixture models. The CLUSTER option is used to name the cluster variable in an analysis of complex survey data, that is, data that are not collected as a simple random sample.

Linda K. Muthen posted on Tuesday, November 16, 2004 - 9:29 am

I assume that you are asking whether a CFA with covariates will result in different parameter estimates when all parameters are free or if you run the anslysis on each group separately. If all parameters are free across groups, the results should be the same.

Anonymous posted on Tuesday, November 30, 2004 - 12:51 pm

I have a question about reporting factor means. I conducted multiple, multi-group analyses, and I tested invariance across gender, age, and race.
Now, for the manuscript, I would like to report factor means. However, the factor means for one group in each of the multigroup analyses are set to zero. Is my only option to report:

Mean Conduct Problems
Men 0
Women -1.2
Caucasian 0
African American 2.12
Etc....

Thank you

bmuthen posted on Tuesday, November 30, 2004 - 5:43 pm

Yes, factor means need to be fixed to zero in one group for identification purposes. You should view this group as the reference group to which the factor means of the other groups are compared. So that's how you want to portray it in your reporting. Another way of saying this is that it is really only the factor mean difference between the groups that is identifiable.

Laura Stapleton posted on Wednesday, December 08, 2004 - 4:22 pm

I have what I think is a simple question. I have two covariance matrices for which I would like to run a multigroup analysis. All Mplus examples I have seen on the website and in the manual assume that one has raw data with a grouping variable present on the dataset.
1) Can one model with two (or more) covariance matrices instead?
2) If so, could you provide some example syntax?

Thank you muchly!

Linda K. Muthen posted on Wednesday, December 08, 2004 - 6:14 pm

See the discussion of multiple group analysis in Chapter 13 of the Mplus User's Guide. The only difference is how you refer to the groups. Note that some estimators require raw data.

Anonymous posted on Tuesday, December 14, 2004 - 2:46 pm

Hi -
I am trying to use the Difftest option to test measurement invariance of factor loadings and thresholds across sex. The first model allows factor loadings and thresholds to vary across groups. The second model constrains factor loadings and thresholds to be equal across groups. I keep getting an error message saying my models are not nested. Could you help me determine why they are not nested?

The first model -
MODEL:
delinq by lvdamage@1 lvcut lvweapon lvpunish lvbeaten
lvthrt lvver lvstolen;

lvdamage@1;
lvcut@1;
lvweapon@1;
lvpunish@1;
lvbeaten@1;
lvthrt@1;
lvver@1;
lvstolen@1;

expec by partyfun betrmood caring friendly join
betrfren holiday silly homework drive;

partyfun@1;
betrmood@1;
caring@1;
friendly@1;
join@1;
betrfren@1;
holiday@1;
silly@1;
homework@1;
drive@1;

[expec@0];
[delinq@0];

daysalc on delinq expec;
expec on delinq;

model indirect:
daysalc IND expec delinq;

model female:

delinq by lvdamage; [lvdamage$1];
delinq by lvcut; [lvcut$1];
delinq by lvweapon; [lvweapon$1];
delinq by lvpunish; [lvpunish$1];
delinq by lvthrt; [lvthrt$1];
delinq by lvver; [lvver$1];
delinq by lvstolen; [lvstolen$1];

expec by betrmood; [betrmood$1];
expec by caring; [caring$1];
expec by friendly; [friendly$1];
expec by join; [join$1];
expec by betrfren; [betrfren$1];
expec by holiday; [holiday$1];
expec by silly; [silly$1];
expec by homework; [homework$1];
expec by drive; [drive$1];

SAVEDATA: Difftest = h1.dat
OUTPUT: sampstat STANDARDIZED;

The second Model:
MODEL:

delinq by lvdamage@1
lvcut
lvweapon
lvpunish
lvbeaten
lvthrt
lvver
lvstolen;

expec by partyfun@1
friendly
join
betrfren
holiday
silly
caring
betrmood
drive
homework;

daysalc on delinq;
expec on delinq;
daysalc on expec;

model indirect:
daysalc IND expec delinq;

Thank you

Linda K. Muthen posted on Wednesday, December 15, 2004 - 9:20 am

Can you send both outputs and your data to support@statmodel.com.

Linda K. Muthen posted on Thursday, December 16, 2004 - 1:40 pm

I just did this nested model testing for examples in the user's guide using both the Delta and Theta parameterization and it worked fine. I would be happy to send you the setups if you give me your email address.

Anonymous posted on Friday, December 31, 2004 - 9:22 am

Could you direct me to the documentation for the new difftest that's available in MPlus when using the WLSMV estimator?

Linda K. Muthen posted on Friday, December 31, 2004 - 10:20 am

There is no paper that explicitly describes this. DIFFTEST is based on principles described in:

Muthén, B., du Toit, S.H.C. & Spisic, D. (1997). Robust inference using weighted least squares and quadratic estimating equations in latent variable modeling with categorical and continuous outcomes.

which can be requested from bmuthen@ucla.edu.

Holmes Finch posted on Tuesday, January 11, 2005 - 5:25 am

Hi,

I would like to use WLSMV for testing invariance for a CFA model with dichotomous data. I see that Mplus has the DIFFTEST command, and I can use it to save the derivatives, but I'm unsure what to do with them after that. Could you help me understand how to use this command? Thanks. Below is an example of the code I want to use.

MODEL: F1 BY Y1@1 Y2-Y3*;
F2 BY Y4@1 Y5-Y6*;
F1-F2*;
F1 WITH F2*;
[Y1$1*-1];
[Y2$1*-.5];
[Y3$1*-.25];
[Y4$1*0];
[Y5$1*.25];
[Y6$1*.5];

MODEL G2: F1 BY Y2*;
[Y2$1*-.5];
(Y2@1);

MODEL: F1 BY Y1@1 Y2-Y3*;
F2 BY Y4@1 Y5-Y6*;
F1-F2*;
F1 WITH F2*;
[Y1$1*-1];
[Y2$1*-.5];
[Y3$1*-.25];
[Y4$1*0];
[Y5$1*.25];
[Y6$1*.5];
SAVEDATA: DIFFTEST IS OUT.DAT;
RESULTS ARE CATDIFFRESULTS.DAT;

Linda K. Muthen posted on Wednesday, January 12, 2005 - 2:01 pm

See Chapter 12 of the Version 3 Mplus User's Guide. There is an example of how to use the DIFFTEST option.

Holmes Finch posted on Thursday, January 13, 2005 - 9:52 am

Thanks very much.

Holmes Finch posted on Wednesday, January 19, 2005 - 5:22 am

I appreciate your directing me to the discussion in the manual regarding using DIFFTEST for WLSMV. I'm using this command in a simulation study, and was wondering if it's possible to save the results of the chi-square difference test for WLSMV that one gets using DIFFTEST. I couldn't find it in the file produced by the RESULTS command, and have looked through the manual, but haven't found anything. Thanks in advance.

Linda K. Muthen posted on Thursday, January 20, 2005 - 7:48 pm

I don't think this is possible but send the question to support@statmodel.com. Thuy would know for sure.

Anonymous posted on Monday, February 07, 2005 - 9:28 am

am doing a multiple group analysis with 4 latent variables and one categorical outcome variables. I am using:

analysis:
type=mgroup MISSING h1;
iterations=100;
PARAMETERIZATION=THETA; estimator= WLSMV;

the output is giving me a message :
_________________________________________________

SERIOUS COMPUTATIONAL PROBLEMS OCCURRED IN THE BIVARIATE ESTIMATION OF THE CORRELATION FOR VARIABLES PERSISTE AND IIE72. CHECK YOUR DATA.
IF THE PROGRAM RECOVERS FOR THIS PAIR OF VARIABLES (SEE TECHNICAL 6
OUTPUT), THE ESTIMATES ARE VALID. THE PROBLEM OCCURRED FOR THE FOLLOWING OBSERVATION(S):
OBSERVATION 3
OBSERVATION 3
COMPUTATIONAL PROBLEMS ESTIMATING THE CORRELATION FOR PERSISTE AND IIE72
______________________________
i have checked my data and I don't find a problem with it. The tech 6 report is not provided with the type of analysis I am doing. What are my alternatives to fix this problem?

shawna anderson posted on Friday, February 11, 2005 - 9:48 pm

i am doing a multiple group analysis with 3 groups. i am not getting any fit statistics with the model results, except AIC and BIC. Can you tell me how to get chi-square, TLI, IFI, and RMSEA? Thanks!

Linda K. Muthen posted on Saturday, February 12, 2005 - 8:12 am

If you are not getting any fit fit statistics, it is most likely the case that they are not available for the model you are estimating. If you send your full output to support@statmodel.com, I can determine the reason.

Anonymous posted on Friday, February 18, 2005 - 2:08 pm

I just ran a multi-group analysis to test differences in mediation across race. I can test whether the paths of the mediation model are significantly different across groups. Is there a way to test whether the mediated effect (or proportion mediated) is statistically different across groups?

bmuthen posted on Friday, February 18, 2005 - 5:16 pm

If you have the estimate of the mediated effect and its SE for each of the 2 groups, you can simply use those numbers to create the approximately normal test variable:

(e1 - e2)/(se(e1-e2)),

where the denominator is sqrt(var(e1-e2)), where var(e1-e2) is var(e1) + var (e2), where var(e) is the square of the SE(e).

Anonymous posted on Wednesday, March 23, 2005 - 7:47 am

I did a multigroup analysis and a DIFFTEST. The DIFFTEST yielded a Chi-Square difference value of 13.237 with 1 degree of freedom (the difference between the more restrictive H0 modell and the H1 model is only one parameter), which is statistically significant at the .05 probabilty level. Does this mean that the less restrictive modell H1 (in which the parameter was allowed to be estimated freely) fits better than the more restrictive H0 modell and therefore should be used in my further analysis? I ask this question, because I tested the same model with AMOS (only difference: the 6-scale indicators of the latent variable were treated as continous variables) and I got nearly identical results (Estimator : ML), except for the mentioned parameter. Setting this parameter equal across both groups results in AMOS in a significantly better cmin/df.

bmuthen posted on Wednesday, March 23, 2005 - 7:49 am

The answer is yes.

Anonymous posted on Friday, April 22, 2005 - 11:49 am

When is multigroup analysis more appropriate than running a regression with interactions? The variances of my variables are quite different across groups - and I am wondering if this is why multigroup analyses is telling me the groups are different but regression with interaction analyses are telling me the groups are the same. I was thinking this disparity was because the multigroup takes variances by group, where regression with interactions takes pooled variances.

bmuthen posted on Friday, April 22, 2005 - 3:15 pm

It sounds like you are correct.

Anonymous posted on Monday, April 25, 2005 - 10:19 am

Dr. Muthen
I try to see the baseline model or the model whitout any constraints for multiple groups analysis(three groups).

Can I use the sum of the df as a check to see if ran without any constraints. I ran a individual model where I had an estimated df=19 and then I ran a multiple group where I had a estimated df = 63.

If not, how can I check if my syntax would be the correct model without any constraints?

I used
analysis: type = gen missing h1; estimator=mlr;

Thanks,

Linda K. Muthen posted on Monday, April 25, 2005 - 11:38 am

You can look at TECH1 of the OUTPUT command to see if you have the model that you want.

Anonymous posted on Tuesday, April 26, 2005 - 2:15 am

Multigroup comparison & Sample size_lisrel

1. I wonder, when testing the mesurement model for
invariance across groups, what should the PSIs and the BETAs be(IN or PS)?

2. When sample size is >3000, is it then appropriate to use MIs > 5 (as in Byrne, Shavelsob and Muthen, 1989)as a criteria for releasing parameters?

Linda K. Muthen posted on Tuesday, April 26, 2005 - 11:03 am

I do not understand what IN and PS are but stuctural parameters such as factor means,variances, covariances, and regression coefficients do not need to be held equal for measurement invariance. I would, however, have the same structural parameters in the groups while testing measurement invariance.

I think the rule of thumb of 5 probably has little meaning at this time.

Anonymous posted on Monday, May 09, 2005 - 2:20 am

I am doing a multigroup analysis using the theta parameterization and having a dichotomous outcome. If I understood it correctly, the factor loadings are held equal across the groups as well as the means and intercepts. If I want to free the factor loadings and the thresholds, I have to do it simultanously and I HAVE to fix the residual variances in all groups to one and the factor means in all groups to zero. Is that correct? I ask this, because if I do a chi-square diff test between a model with factors means fixed to zero in the first group and free in the other group and a model with factor means fixed to zero in both groups, the result speaks clearly against the second model.

Linda K. Muthen posted on Monday, May 09, 2005 - 5:58 am

It is the factor loadings and thresholds of the factor indicators that are held equal as the default. In the default model, factor means are fixed to zero in the first group and are free to be estimated in the other groups. With the theta parameterization, residual variances of the factor indicators are fixed to one in the first group and are free to be estimated in the other group. You are correct that when you free factor loadings and thresholds, all factors means should be fixed to zero and all residual variances should be fixed to one.

Anonymous posted on Thursday, June 23, 2005 - 8:34 am

After running a separate analysis for males (M) and females (F), I ran a multiple group with no constraints. However, my chi-square and df values for M and F do not add up to the chi square and df for the multiple group no constraints model. I have provided my syntax for the M model (the F model is the same - I do get the same number of df for the M and F when I run them separately). I have also included my syntax for the multiple group (MG) no constraints model. Each separate model has 154 df and the MG model has 320 df.

MG model syntax:

VARIABLE: ... MISSING = BLANK ;

GROUPING IS gender (0=female 1=male) ;

ANALYSIS: TYPE = MISSING H1;

MODEL: extprob BY T1delinq T1agg ;
risk BY MomBSI Finstrai Neighpro ;
intprob BY T1somati T1Withdr T1anxiou ;
pospar BY Monitor MCTrust SchInvol ;
devpeer BY SchFr NeighFr PeerDelq ;
extprob2 BY T2delinq T2agg ;
intprob2 BY T2somati T2withdr T2anxiou ;
pospar ON risk ;
devpeer ON pospar T1parstr;
T1parstr ON risk ;
extprob2 ON devpeer extprob;
intprob2 ON devpeer intprob;
T2delinq WITH T1delinq ;
T2agg WITH T1agg ;
T2somati WITH T1somati ;
T2withdr WITH T1withdr ;
T2anxiou WITH T1anxiou ;

MODEL male:
extprob BY T1agg ;
risk BY Finstrai Neighpro ;
intprob BY T1Withdr T1anxiou ;
pospar BY MCTrust SchInvol ;
devpeer BY NeighFr PeerDelq ;
extprob2 BY T2agg ;
intprob2 BY T2withdr T2anxiou ;

OUTPUT: STANDARDIZED MODINDICES(3.84) SAMPSTAT TECH1 ;

Separate model:

VARIABLE: ... MISSING=BLANK ;

ANALYSIS: TYPE = MISSING H1;

MODEL: extprob BY T1delinq T1agg ;
risk BY MomBSI Finstrai Neighpro ;
intprob BY T1somati T1Withdr T1anxiou ;
pospar BY Monitor MCTrust SchInvol ;
devpeer BY SchFr NeighFr PeerDelq ;
extprob2 BY T2delinq T2agg ;
intprob2 BY T2somati T2withdr T2anxiou ;
pospar ON risk ;
devpeer ON pospar T1parstr;
T1parstr ON risk ;
extprob2 ON devpeer extprob;
intprob2 ON devpeer intprob;
T2delinq WITH T1delinq ;
T2agg WITH T1agg ;
T2somati WITH T1somati ;
T2withdr WITH T1withdr ;
T2anxiou WITH T1anxiou ;

OUTPUT: STANDARDIZED MODINDICES(3.84) SAMPSTAT ;

Your help is greatly appreciated.

BMuthen posted on Friday, June 24, 2005 - 1:51 am

This is a support question. Please send your outputs, data, and license number to support@statmodel.com.

Anonymous posted on Wednesday, July 27, 2005 - 4:05 pm

If you find a model is different across two or more groups, is it best to test them simultaneously and get one set of model statistics? Or is it better to split the sample and test the models for each sample separately and get separate sets of model statistics?

bmuthen posted on Wednesday, July 27, 2005 - 6:39 pm

If all parameters are different across groups, it is simpler to work with each group separately. But as long as some parameters are equal across groups you benefit from a simultaneous analysis.

Eva Van de gaer posted on Friday, August 26, 2005 - 7:33 am

Hello,

I am testing measurement invariance for a single construct that was measured at different time points. I use multiple CFA in Mplus where the different groups represent the different measurement occasions. I would like to model covariances between the like items' error variances across occasions. I do not know how to model this in a multiple CFA framework in Mplus. Any suggestions will be highly appreciated.

Thank you

Linda K. Muthen posted on Friday, August 26, 2005 - 7:53 am

You should not use different groups to represent different measurement occasions because in multiple group analysis each group should contain independent observations. Following is the input for a multiple indicator factor model with four measurement occasions:

MODEL:
f1 BY y11
y21 (1);
f2 BY y12
y22 (1);
f3 BY y13
y23 (1);
f4 BY y14
y24 (1);
[y11 y12 y13 y14] (2);
[y21 y22 y23 y24] (3);
[f1@0 f2 f3 f3];

If you want a residual covariance, you would state, for example:

y13 WITH y14;

Gerald Lackey posted on Friday, October 21, 2005 - 3:00 pm

I have a SEM model with two latent endogenous variables that I am treating as continuous and using an MLR estimator. I am testing invariance of the model using the grouping option in Mplus and I have been able to do most of what I want. I am confused, however, about how to constrain the means of my latent factors to be equal across my groups. Can this be done for latent endogenous variables? Related to this, above Dr. Muthen notes that factor means must be set to zero in one group to identify the model, but then why is my tech4 output giving me an estimated mean for my latent variables in both groups? I do see that the intercept for my latent is set to zero in the first group, but I'm somehow missing the connection here. Thanks for any help you can give me.

Linda K. Muthen posted on Saturday, October 22, 2005 - 6:50 am

In a model where intercepts are estimated for the latent variables, there is not a straightforward test of whether means are equal. In a model where you are estimating means not intercepts, you can test that means are equal by fixing the means to zero in all groups.

The model estimated means in TECH4 are based on the model. When a latent variable is endogenous, it's mean is equal to the intercept plus the regression coefficients times the means of the exogenous variables it is regressed on.

Pancho Aguirre posted on Monday, November 14, 2005 - 6:31 pm

Hello Linda and Bengt,

I am wondering if Mplus allows me to answer an empirical question. I have employees� data from 31 organizations. My model includes three latent variables at the individual-level, Job satisfaction, job performance, and worker�s belief. My DV is job performance, my IV is job satisfaction. I conducted a multi-sample analyses and I found that the relationship between my DV and IV varies across organizations (i.e. is moderated by organization).
Now, I want to test if this moderating effect of organization on the relationship between my DV and IV is partially mediated by worker�s belief. Is this even possible in Mplus? If it is, could you please refer me to some material that deals with this type of problem?

Thanks in advance for your help,

Pancho

Boliang Guo posted on Tuesday, November 15, 2005 - 1:49 am

in your case, there are 31 organization, I think you can consider modle a 2 level path analysis, which consider the mediating effect after partial the l2 effects.if you did not have level 2 variable in your model, jsut leave the intercept and slop ramdome in the model
31 level 2 unit is better for multilevel analysis, anwyan, try check the intercept and slope's level2 variance first

Pancho Aguirre posted on Wednesday, November 16, 2005 - 7:05 pm

Hello Linda and Bengt,

I'm wondering if I can conduct the following analysis in Mplus. I modify the example 9.9 and 9.10 from the Mplus version 3 User's guide on pages 205-207. I have 31 clusters would that be large enough cluster size?

TITLE: this is an example of two-level CFA with continuous factor indicators, covariates,and random slopes
DATA: FILE IS ex9.9.dat;
VARIABLE:NAMES ARE y1-y4 x1-x4 w clus;
CLUSTER = clus;
BETWEEN = w;
ANALYSIS:TYPE = TWOLEVEL RANDOM;
ALGORITHM = INTEGRATION;
INTEGRATION = 10;
MODEL:
%WITHIN%
fw1 BY y1-y4;
fw2 BY x1-x4;
s | fw1 ON fw2;
%BETWEEN%
fb BY y1-y4;
y1-y4@0;
fb s ON w;

Thanks a lot,

Pancho

bmuthen posted on Thursday, November 17, 2005 - 5:14 am

Yes, this model can be estimated in Mplus. It may however require a long computing time. 31 clusters is on the border of being too low. Note that 31 is the sample size for between parameters. You have only 7 between parameters so you are probably ok.

Scott R. Colwell posted on Wednesday, January 18, 2006 - 9:00 am

Is it possible to do a multi-group analysis using a covariance matrix as the input if the group variable was included in the matrix? Or if it is not in the covariance matrix, but you know how many groups and the number of respondents by group...but the covariance matrix is not separated out by group?

Thanks,

Linda K. Muthen posted on Wednesday, January 18, 2006 - 9:26 am

It is possible to do a multiple group analysis using covariance matrices for some estimators. How to do this is described in Chapter 13 under Multiple Group Analsyis, Data In Multiple Group Analysis, Summary Data One Dataset. The grouping variable is not part of the matrices.

Carol posted on Friday, February 10, 2006 - 9:09 am

Hello Dr. Muthen,

I am running a twin model in MPlus using Carol Prescott's examples as a template. In my latest model I ran into the following error message:

WARNING: THE RESIDUAL COVARIANCE MATRIX (PSI) IN GROUP MZ18 IS NOT POSITIVE DEFINITE. PROBLEM INVOLVING VARIABLE A2.

Why might this happen and what are the implications in terms of parameter estimates and fit statistics?

Thank you,
Carol

bmuthen posted on Friday, February 10, 2006 - 7:40 pm

This message is ok for twin modeling where the A factors are fixed to correlate 1.0 for MZs. The warning message is good in general where you don't want factor correlations of 1.0. In your case, you can ignore it. If you are doing twin modeling, you will enjoy new features in Mplus Version 4 which will be out in a few weeks.

Aboagyewa Boohene posted on Tuesday, February 21, 2006 - 11:10 pm

Hello
I am using PLS to verify gender differences in the factors that influence small firms performance. I did run the full model and then one seperately for males and females. I am wondering how I could do the multigroup analysis and what to compare. Is it the path coefficients or T statistics or the means? I used the PLS graph 3.0 Or is there a way to run the whole model using multigroup analysis

Linda K. Muthen posted on Wednesday, February 22, 2006 - 6:13 am

There will be a description of testing for measurement invariance in the Version 4 Mplus User's Guide which will be available online next week. I don't know anything about PLS.

Scott R. Colwell posted on Sunday, March 19, 2006 - 9:55 am

If I have 3 groups 1 = low 2 = medium and 3 = high and I want to test the invariance of a structural path between the low and high group only (so that my degrees of freedom difference is 1), would I use:

MODEL:
F1 on F2 (1);

MODEL Medium:
F1 on F2;

So that only group 1 and 3 are held equal...does this sound reasonable?

Linda K. Muthen posted on Sunday, March 19, 2006 - 10:26 am

It sounds reasonable.

Heejung Chun posted on Thursday, May 11, 2006 - 7:21 pm

Hello,

I try to conduct a multiple group analysis by testing the invariance of first-order factor loadings on second-order factors.

When I ran a fully constrained model, the result indicated that the factor loadings of the first-order factors on the second-order were not equivalent.
It showed that factor loadings, intercepts and thresholds of observed variables were constrained.

How can I constrain the factor loadings of the first-order factors on the second-order factor?

Thank you.

Linda K. Muthen posted on Friday, May 12, 2006 - 10:43 am

I am not clear what you mean. See Example 5.6 in the Mplus User's Guide. This is a second-order factor analysis model. Tell me which paths in that model you want to constrain to be equal.

Zhongmiao Wang posted on Saturday, May 13, 2006 - 4:35 pm

Hello, Dr. Muthen:
The questionnair has 33 items, each one having a 5 point Likert scale. By CFA, a measurement model with 5 factor was constructed. Then, I tested the measurement invariance for two groups.
I first free the factor loadings and the item threshhold to be freely estimated, but hold the scale factor of the items to be 1 and the factor means to be 0 in both the two groups. By doing this, I got chi-square value as 1583.775. Then, I constrained the factor loadings and item threshholds to be equal across groups. The Chi-square value for the more restrictive model was 921.745*. However, the Chi-square difference is positive 26.589. I used DIFFTEST to do the Chi-square difference test because I used WLSMV estimator. Is it possible for the Chi-square of the more restrictive model to be smaller than the Chi-square of the more flexible model? Am I doing right?

Thank you so much!

Best Regards
Zhongmiao Wang

Linda K. Muthen posted on Sunday, May 14, 2006 - 9:35 am

With WLSMV, it is only the p-values of the chi-square that you should be interpreting for each analysis. This is why we have the DIFFTEST option for comparing two models.

David Barker posted on Tuesday, June 06, 2006 - 10:01 am

I would like to use the factor scores from a multiple group analysis with continuous variables to graph the relationship between two latent variables. However, the factor scores from the multiple group analysis do not seem accurate; that is, some of the children with high scores on the observed variables have very low factor scores (e.g., -3.8), while others with near identical scores on the observed variables have high factor scores (e.g., 2.0). When examine the factor scores computed from the two single group analyses the factor scores appear as expected, with high scores on the observed variables translating into high factor scores. Why are the factor scores from the multiple group analysis markedly different from those from the single group analyses? Why do they not reflect the trends seen on the observed variables?

Thank you for your time,

Dave Barker

Linda K. Muthen posted on Tuesday, June 06, 2006 - 10:21 am

This is a question that would require you to send your input, data, outputs, and license number to support@statmodel.com. If you are not using the most recent version of Mplus, I would suggest that as a first step.

HW posted on Friday, June 16, 2006 - 12:21 pm

I am working with 5 groups, and would like to test for structural invariance doing pairwise comparisons. I know this code:

model:
x on y1 (1)
y2 (2)
y3 (3);

will result in a test of equivalence for y1 (and y2,y3) across all groups - how can i code it so that only group 2 and group3 (for example) are being compared? I am evaluating the significance of between group differences using the chi-square difference test, incorporating the scaling correction factor (i am using wlsm estimation).

Thanks

Linda K. Muthen posted on Friday, June 16, 2006 - 5:01 pm

You need to use group-specific MODEL commands to achieve this.

MODEL:
x on y1
y2
y3;
MODEL g2:
x on y1 (1)
y2 (2)
y3 (3);
MODEL g3:
x on y1 (1)
y2 (2)
y3 (3);

Ronald Cox posted on Friday, June 16, 2006 - 5:59 pm

Hi
I am testing to see if measurement invariance in a CFA model holds for a repeated measures study. I am fitting the same model simultaneously in both samples (time 1 and time 2), without any parameter constraints in order to create a baseline model. However I am getting an error message of "insufficient data" I am using the demo version. Do you have any suggestions what I might be doing wrong? My input file follows.
Thanks,

INPUT INSTRUCTIONS

TITLE: Baseline model 10th and 11th graders
STEP 3)
DATA: FILE = assig6data3.1.INP;
TYPE = COVARIANCE;
NGROUPS= 2;
NOBS = 220 220;
VARI: NAMES = CA11 CA12 CA13 CA21 CA22 CA23;
MODEL: CASPIRE1 BY CA11 CA12 CA13;
CASPIRE2 BY CA21 CA22 CA23;
MODEL G2: CASPIRE1 BY CA11 CA12 CA13;
CASPIRE2 BY CA21 CA22 CA23;

*** ERROR
Insufficient data in "assig6data3.1.INP"

Linda K. Muthen posted on Friday, June 16, 2006 - 9:11 pm

This means that Mplus is not finding enough information in the data file. You need to place the covariance matrix for group 1 first followed by the covariance matrix for group 2. See Chapter 13 where this is described. If you can't solve this, you need to send your input, data, and output to support@statmodel.com.

HW posted on Friday, July 07, 2006 - 7:41 am

A few questions regarding multiple group comparisons:

1) I have read that Kenny recommends testing for structural invariance before testing for invariance of error covariance - what would be the harm in testing for invariance of error covariance prior to testing for structural invariance?

2) If forcing two structural parameters to be equal results in a non-positive definite latent variable covariance matrix OR model non-convergence, what should be done about this? What would be the next step?

3) I have read previously on the MPlus discussion board that if a scaled chi-square difference test doesn't run due to negative chi-square difference values, that this is a function of the method and it is not possible to conclude whether the parameters are equal or not in each group. Can you provide a reference for this?

Thanks.

Linda K. Muthen posted on Wednesday, July 12, 2006 - 1:53 am

1. There is no harm but invariance of error covariances is less likely than structural invariance.
2. This may indicate that the structural parameters should not be held equal.
3. There is a Satorra and Bentler article about this from a few years ago. I don't know the exact reference.

HW posted on Monday, July 17, 2006 - 11:14 am

when testing between group differences, should it always be a change of one degree of freedom between models? If I hold a parameter equal across all groups, I get a change of four degrees of freedom. should i be putting equality constraints between two groups at a time?

Linda K. Muthen posted on Monday, July 17, 2006 - 12:25 pm

This would depend on your hypothesis.

Daniel Rodriguez posted on Wednesday, August 02, 2006 - 7:58 am

HI, I'm working on a two group model with uneven group sizes. The grouping variable is high school team sport participation among females. The first group has 128 participants. The second (no teams) group has only 43 individuals. I ran the two group model and all the fit indicies are good, including a non-sig chi-square and an RMSEA=0(0 .03). My question is whether this analysis is troublesome because of the vast difference in group sizes?

Linda K. Muthen posted on Wednesday, August 02, 2006 - 9:39 am

I don't think there should be a problem due to different group sizes other than the larger one has more power than the smaller one.

Daniel Rodriguez posted on Wednesday, August 02, 2006 - 10:02 am

Thanks.

monica oxford posted on Wednesday, August 09, 2006 - 1:52 pm

Is it possible to do a multiple group path analysis? If so what syntax would I use to constrain the paths (and test for significant differences), I have three groups?
Thanks in advance!

MODEL:
y1 on x1 - x3;

Linda K. Muthen posted on Wednesday, August 09, 2006 - 2:03 pm

Yes. The following would hold the regression coefficients equal across groups:

MODEL:
y1 on x1 (1)
x2 (2)
x3 (3);

Nina Zuna posted on Wednesday, August 23, 2006 - 1:56 pm

Dear Drs. Muth�n,

I am still in Mplus learning mode and came across something I don't understand.
I ran 2 single CFAs for my 2 grps and then ran my initial Multiple grp (MG) CFA (configural invariance). Each used MLR estimator. My single group Chi sqs. using MLR do not add up to multiple grp total chi sq using MLR. If this same procedure is done using ML they add up.
Que 1. What is diff about MLR that makes the two separate chi sqs not add up to MG chi sq?
Second puzzling occurence regardless of estimator used: I had always assumed in MG invariance testing that the group with the lower contribution to chi sq had better fit (ideally you want these #'s roughly equal in MG), but when I did my single CFAs as described above I found out the opposite occured.
The group with the larger Chi sq in MG when run in single CFA had better fit than grp with lower chi sq run in single CFA. The grp with better fit statisitcs in single CFA (higher chi sq) appears to be driving the fit statistics in MG invariance tests. This group also has the larger n so perhaps this power differential is the cause. However, I thought since the CFI and TLI are comparative fit indices that they wouldn't be as influenced by sample size?? I am quite confused by this.
Que 2. Any thoughts would be very much appreciated.

Thank you kindly :-)

Bengt O. Muthen posted on Wednesday, August 23, 2006 - 7:15 pm

Q1. This is the same issue as MLR chi-square differences between nested models not being chi-square distributed. This topic is discussed on our web site - see left margin How-To "chi-square difference test".

Q2. Groups with larger n influence the parameter estimates more. And parameter estimates in turn influence CFI/TLI. You say "better fit statisitcs in single CFA (higher chi sq)" - that must be a typo since high chi-square is a worse fit statistic than a low chi-square.

Nina Zuna posted on Wednesday, August 23, 2006 - 8:58 pm

Thanks for your reponse to que 1-makes sense. As for 2nd ques I am still stumped b/c it is not a typo. Below is my output from the 2 single CFAs and MG.

Chi-Sq Test of Model Fit-Disability group (n=112)
Value 351.127*
df 183
P-Value 0.0000 CFI .810 TLI 0.782

RMSEA 0.091 As you will note all 3 fit statistics are worse in this model with lower Chi Sq value.

Chi-Sq Test of Model Fit (Non Disability group n=566)
Value 514.760*
df 183 PValue 0.000
CFI 0.906 TLI 0.892

RMSEA 0.057 Fit statistics are better in this model with higher Chi Sq.

Chi-Sq Test of Model Fit (Multiple group- Disability and Non-Disability)

Value 882.407*
df 366
P Value 0.0000

CFI 0.889 TLI 0.872
RMSEA 0.065
In the multiple group, the fit statistics are in between the other two models, with the Non Disab. grp with higher Chi sq. seeming to dominate.

Still perplexed....any thoughts?

Bengt O. Muthen posted on Thursday, August 24, 2006 - 5:48 pm

Convention suggests that CFI should be above 0.95 for a well-fitting model. I don't think one should compare fit indices when they all point to this degree of misfit. Degrees of poor fit can't really be judged well, I think. In any case, fit indices quite often disagree with each other - this is why it is useful to work with many - at least it is helpful in cases where they are all good.

Nina Zuna posted on Thursday, August 24, 2006 - 6:39 pm

Thank you, Bengt; your continued follow-up is very much appreciated. Indeed, I agree the fit is bad for both groups. I continue to grapple with the fact that the higher chi square had the better model fit. So based on your response am I correct to assume when model fit is this poor, one might see such anomalies as the occurence of better model fit with higher chi squares than a model with lower chi square?
I don't think I have seen this before. Everything I have read indicates that Chi square value is a measure of badness of fit.
Is there any explanation I could offer to my committee members on this discrepancy?

With much gratitude (and final posting!),

nz

Bengt O. Muthen posted on Friday, August 25, 2006 - 7:53 am

Your Non-Disability group has worse chi-square but better CFI than your disability group - this may be due to the Non-Disability group having a much larger sample size where sample size probably affects chi-square more than CFI.

Nina Zuna posted on Friday, August 25, 2006 - 9:00 am

Again, thank you so much for your time. Your website and discussion board are such wonderful resources.
I look forward to meeting you and learning from you in MD.

nz

Madeline Hogan posted on Friday, October 13, 2006 - 5:20 pm

In testing measurement invariance (categorical indicators: loadings and thresholds), does anyone have an idea about how many invariant loadings/thresholds is needed to meet criteria for partial measurement invariance?

Linda K. Muthen posted on Sunday, October 15, 2006 - 11:37 am

I don't think there are any definite guidelines for this. A few might be okay statistically, but the more important issue is if the construct can be argued to be the same across time or groups.

mehdi rezaei posted on Tuesday, November 21, 2006 - 5:25 am

would you please help me about description of multigroup analysis in LISREL.

Linda K. Muthen posted on Tuesday, November 21, 2006 - 6:19 am

I don't know about LISREL. This forum is for Mplus. You would need to contact LISREL support.

TAO, Sha posted on Thursday, March 01, 2007 - 12:58 pm

I am trying to do a 3- group SEM with summary sata (correlation matrices and STDs). There are two predictors (one was measured by 3 indicators, the other one is measured by 2 indicators), and one outcome measured by 3 indicators. This analysis is to examine the equality of the path parameters from the two predictors to the outcome. The Script is specified as follows:
TITLE:
Grade 1-3 SEM:
only paths from Independent LVs to the DV are constrained to equal ;
DATA: FILE IS "D:\GRADE1-3.txt" ;
TYPE = CORR MEANS STD ;
NOBSERVATIONS = 100 100 100 ;
NGROUPS = 3 ;
VARIABLE: NAMES ARE OV1 OV2
OV3 OV4 OV5
OV6 OV7 OV8 ;
ANALYSIS: TYPE = General ;
ESTIMATOR is ML ;
MODELS:
LV1 by OV1@1 OV2 OV3 ;
LV2 by OV4* OV5@1 ;
LV3 by OV6@1 OV7 OV8 ;

LV3 on LV1 LV2;
LV1 with LV2 ;

LV1 LV2 LV3;

OV1 OV2 OV3 OV4 OV5
OV6 OV7 OV8 ;

TAO, Sha posted on Thursday, March 01, 2007 - 12:59 pm

MODEL g2: LV1 by OV1@1 OV2 OV3 ;
LV2 by OV4* OV5@1 ;
LV3 by OV6@1 OV7 OV8 ;

LV1 LV2 LV3;

OV1 OV2 OV3 OV4 OV5
OV6 OV7 OV8 ;

[LV1 LV2 LV3] ;

[OV1 OV2 OV3 OV4 OV5
OV6 OV7 OV8 ] ;

MODEL g3: LV1 by OV1@1 OV2 OV3 ;
LV2 by OV4* OV5@1 ;
LV3 by OV6@1 OV7 OV8 ;

LV1 LV2 LV3;

OV1 OV2 OV3 OV4 OV5
OV6 OV7 OV8 ;

[LV1 LV2 LV3] ;

[OV1 OV2 OV3 OV4 OV5
OV6 OV7 OV8 ] ;

OUTPUT: STANDARDIZED SAMP ;

When I run the analysis, MPLUS stopped with an error message:
*** ERROR
Insufficient data in "D:\GRADE1-3.txt"

So I checked the summary data, and did not find anything wrong with the three matrices and STDs. Would you pls let me know what caused this error and how to fix it? Thanks a lot.

Linda K. Muthen posted on Thursday, March 01, 2007 - 2:03 pm

Please do not continue your post into more than one window. Posts that cannot be fit into one window are not appropriate for Mplus Discussion. This is a support question. Please send your input, data, output, and license number to support@statmodel.com.

Sarah Strand posted on Tuesday, April 17, 2007 - 10:19 pm

I am trying to use multiple group analysis for a SEM model with two continuous latent independent variables and a variety of observed independent variables regressed on a count dependent variable.

When I try to include the GROUPING option, I get the following error:
ALGORITHM = INTEGRATION is not available for multiple group analysis.
Try using the KNOWNCLASS option for TYPE = MIXTURE.

However, I have not specified "ALGORITHM=INTEGRATION" and MIXTURE does not make sense for my model (I am using GENERAL). I tried using KNOWNCLASS to see if it would work, and it says: KNOWNCLASS option is only available with TYPE=MIXTURE.

Any idea what the problem might be? Thanks so much in advance.

Linda K. Muthen posted on Wednesday, April 18, 2007 - 7:08 am

Please send your input, data, output, and license number to support@statmodel.com.

Linda posted on Thursday, April 26, 2007 - 8:50 am

I have an experimental data with multiple groups (3 intervention groups and 1 control group). I was told that I could create contrast between the groups and use it as an exogenous variable or use multiple group sample analysis. What is the advantage of doing one vs the other? Also, if I use the multiple group analysis, would I be including the control group as well? If this question is too basic, could you refer me to an article? I have done a search before and can't find an article that addresses my question.

Thanks in advance.

Linda K. Muthen posted on Saturday, April 28, 2007 - 8:43 am

In multiple group analysis, more parameters can vary than in a model where the grouping variable is a covariate where only intercepts and means can vary. I would include the control group. You may find the following paper of interest:

Muth�n, B. & Curran, P. (1997). General longitudinal modeling of individual differences in experimental designs: A latent variable framework for analysis and power estimation. Psychological Methods, 2, 371-402

Michael Peterman posted on Tuesday, May 01, 2007 - 11:48 am

Using WLSMV, we're estimating a multiple groups path analysis with an ordered, categorical outcome, which we'll refer to as z. Additionally, we have several exogenous predictors, call them x1, x2, and x3, and a mediator y. Initially, we obtained an excellent fitting model by allowing y to partially mediate the influence of each exogenous predictor (x1, x2, and x3) on z. Now, across several ethnic groups, we're attempting to impose and equality constraint on the path coefficient from y to z. The model statement reads as follows:

MODEL:
y ON x1 x2 x3;

z ON y (1);

z ON x1;

z ON x2;

z ON x3;

Given the use of WLSMV, we've employed the DIFFTEST option to obtain the chi-square difference test. Appropriately, we've tested the less restrictive model first, saved the results using SAVEDATA, and then estimated the constrained model. Surprisingly, however, the output does not include the difference test, instead reporting that the constrained model is not nested within the original. As far as we can tell, this is inaccurate. Also, as specified in the input, Mplus correctly imposes the equality constraint across ethnic groups for the relationship of y to z, with all remaining effects estimated as requested.

Are we incorrect in assuming the restricted model is nested within the original model?

Linda K. Muthen posted on Tuesday, May 01, 2007 - 2:11 pm

You need to send the two complete outputs and your license number to support@statmodel.com.

Katherine A. Johnson posted on Monday, June 25, 2007 - 8:30 am

I am running a path model with multiple groups (with both dichotomous and continuous endogenous variables using WLSMV). I am interested in testing for gender differences in individual regression coefficients. I know that I can constrain all regression paths to be equal between groups and then compare this model to the model without these constraints. This will tell me if there is a significant difference in the fit of the path model by gender. In order to test for structural invariance of individual paths however, do I have to run separate models for each? I would be running 23 models and doing difference testing for each.

Thank you!

Linda K. Muthen posted on Monday, June 25, 2007 - 8:42 am

You could do that or you could use MODEL TEST. See the user's guide for more information about MODEL TEST.

Katherine A. Johnson posted on Monday, June 25, 2007 - 8:57 am

I'll look into that. Thank you.

Bram Foubert posted on Wednesday, July 18, 2007 - 6:42 am

I'm running a multigroup analysis with covariates. Apparently, Mplus returns an error (and does not estimate the model) whenever the variance of one of the covariates in one of the groups is zero. However, this zero-variance is not necessarily a problem as long as I pool the coefficient of that covariate across groups. So, is there a way to "force" Mplus to estimate the model. Thanks in advance.

Linda K. Muthen posted on Wednesday, July 18, 2007 - 11:03 am

You can use the VARIANCES=NOCHECK; option of the DATA command to avoid stopping for zero variance.

Bram Foubert posted on Wednesday, July 18, 2007 - 11:52 am

Thanks a lot for your prompt reply!!

Bram Foubert posted on Thursday, July 19, 2007 - 6:30 am

Thanks again for your answer: by including the the VARIANCES=NOCHECK; option, the model starts running. However, apparently, the procedure still encounters singularities because of group-specific operations (again, in one of the groups one of my covariates has zero-variance). Can I somehow exclude the covariate from the group where it has zero-variance and still keep the covariates' coefficients equal across groups? I think that would solve the problem.

Linda K. Muthen posted on Thursday, July 19, 2007 - 6:51 am

You can try fixing the coefficient to zero in the groups where it has no variance, for example,

y ON x@0;

If this does not work, please send your input, data, output, and license number to support@statmodel.com.

Linda posted on Thursday, October 11, 2007 - 9:02 am

I had a question about interpreting findings from multiple sample SEM investigating structural paths. I am runnig multiple sample SEM using intervention types as groups (Control, TPC, TMI, and TPC+TMI). And as an obvious approach, I am using the control group as my reference group when building the multiple sample SEM. Here is my question. So when a structural path shows that it's not different across groups, is that in reference to the control group only? Is multiple sample SEM allowing me to see the differences in the paths between TPC vs. TMI, TPC vs. TPC+TMI, and TMI vs. TPC+TMI? If so, how does that work given that I am specifying a reference group?

Thanks in advance!

Linda

Linda K. Muthen posted on Thursday, October 11, 2007 - 10:12 am

I'm not sure what you mean by using the control group as a reference group. You may want to use MODEL TEST. See the user's guide.

Linda posted on Thursday, October 11, 2007 - 10:48 am

Yes, you are right. I got confused after reading an article. It's all clear now. I also have another question. To test for group invariance, how would I code my groups? Does this make sense control=0, TPC=1 TMI=2 and TPC+TMI=3? Also, doesn't the numeric coding imply a linearity of the groups?

Linda K. Muthen posted on Thursday, October 11, 2007 - 2:42 pm

I'm not sure what you mean by code your groups. Are you referring to the values in the GROUPING option?

Linda posted on Thursday, October 11, 2007 - 5:09 pm

Yes. The values in the grouping option.

Linda K. Muthen posted on Thursday, October 11, 2007 - 5:29 pm

These are the values of your grouping variable. For example for the variable gender, if 0 represents males and 1 represents female, you would give the label males to 0 and females to 1.

Linda posted on Friday, October 12, 2007 - 1:28 am

Yes...the grouping option. By assigning numbers to the groups, it gives me the impression that the values imply linearity.

Linda posted on Friday, October 12, 2007 - 1:36 am

Please ignore the posting above. For some reason, I couldn't read your response until I posted the same question again.

So when I have four groups and I am assigning values for each one of those groups, the numbers 0, 1, 2, 3, do not mean that the value 3 is 3x as big as 1, but that 3 is group 3...is this correct?

Thanks in advance!

Linda K. Muthen posted on Friday, October 12, 2007 - 6:29 am

The numbers tell the program how to divide the data into groups.

Linda posted on Friday, October 12, 2007 - 8:02 am

Great! It's all clear now. Thank you very much!

Linda posted on Wednesday, November 28, 2007 - 8:53 am

I am conducting Multi-sample SEM on 4 groups. I am investigating structural paths between 5 variables (1 LV and 4 OV). I would like to build my model constraining first the LV before I constrain the structural paths.

Here is my question. Do I need to constrain the loadings, residual variances, and means? or could i just constrain the loadings?

Linda K. Muthen posted on Wednesday, November 28, 2007 - 2:36 pm

The first step is to establish measurement invariance of the latent variable. How to do this is described in Chapter 13 of the user's guide. Only if the latent variable is the same construct in all groups does it make sense to make comparisons of the structural parameters.

Linda posted on Thursday, November 29, 2007 - 8:25 am

Thank you for your reply.

I did establish measurement invariance first. And I wanted to take a step further to establish the structural parameters. To do that, do I keep the groups constrained on the loadings only or residual variances and means as well?

Linda posted on Thursday, November 29, 2007 - 9:22 am

I am running the model below where x1, x2, and x3 are exogenous predictor variables, m1, m2, m3 and f1 are mediators, and y is the outcome variable.

MODEL:
f1 by y1 y2 y3;
m1 on x1 x2 x3;
f1 on m1;
m2 on f1;
m3 on m2;
y on m3;

I get the following error message. how could I fix this? Thanks in advance.

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE
TRUSTWORTHY FOR SOME PARAMETERS DUE TO A NON-POSITIVE DEFINITE FIRST-ORDER DERIVATIVE PRODUCT MATRIX. THIS MAY BE DUE TO THE STARTING VALUES BUT MAY ALSO BE AN INDICATION OF MODEL NONIDENTIFICATION. THE CONDITION NUMBER IS 0.338D-10. PROBLEM INVOLVING PARAMETER 15.

Linda K. Muthen posted on Friday, November 30, 2007 - 8:08 am

Once you have established measurement invariance, you should leave the equalities in place. We don't use equalities of residual variances.

Regarding the error message, please send your input, data, output, and license number to support@statmodel.com.

Jungeun Lee posted on Friday, November 30, 2007 - 4:49 pm

I am working on a multiple group (males and females) SEM. I'd like to test whether or not each individual coefficient in the structural part differs by males and females. I used MODEL TEST to test this. Here is my mplus input.

MODEL:
depr by dep1 dep2 dep3 dep4;
hope by pos1 pos2 ;
anxiety by anx1 anx2;
hope on anxiety (p1);
depr on hope (p2);

MODEL female: depr by dep2 dep3 dep4;
hope by pos2 ;
anxiety by anx2;
hope on anxiety(p3);
depr on hope (p4);

MODEL TEST:
0=p1-p3;
0=p2-p4;

The program gave me one wald test result (value=.343 pvalue=.8425).

Q1. What does this mean? Does it mean that p1=p3 & p2=p4?

Q2. I expected more than 1 test results from the above analysis-- like the first test result corresponds to p1=p3 and the second test corresponds to p2=p4... Is there a way to do this in mplus? Or, do I have to run separate models for each?

Linda K. Muthen posted on Friday, November 30, 2007 - 5:55 pm

You need to run separate runs for each. A large p=values means you cannot reject equality of the parameters.

J�rg von Irmer posted on Thursday, December 20, 2007 - 6:44 am

I've got a question to the way of reporting a multigroup-SEM in a paper:

some of the effects in my model are set equal for both, men and women. Some are not. If the model is presented in a paper, standardized estimates are reported in general. But for the equal Effects, the standardized estimates are different, while the unstandardized are not.

If I want to report the standardized estimates, which estimate do I choose? the one of the male or of the female group?

Linda K. Muthen posted on Thursday, December 20, 2007 - 9:47 am

I would report both raw and standardized coefficients and standardized for both males and females.

Erika Wolf posted on Monday, January 28, 2008 - 10:57 am

I'm running a series of CFAs to examine measurement invariance across 2 groups. I'm using categorical indicators and using the WLSMV estimator and the DIFFTEST function to test the nested models.

I'm first testing for equal form across the groups, allowing the factor loadings and thresholds to be freely estimated in both groups and setting the scale factor to 1 in the 2nd group.

In the second model, I'm testing for equal factor loadings, so I've left in all the Mplus defaults and have not specified anything for the 2nd group.

I'm confused, though, because my equal factor loadings model has fewer DF than my equal form model when I would expect this to be the otherway around. Is this simply a function of the DF being estimated with the WLSMV estimator? Or are their additional defaults that I should override in my equal factor loadings model?

Thanks,
Erika

Linda K. Muthen posted on Monday, January 28, 2008 - 11:55 am

With WLSMV, the only value that you should look at is the p-value. If you want to look at degrees of freedom in the traditional way, use WLS or WLSMV to see if they are behaving as you would expect. See also the section in Chapter 13 where the set of models to test measurement invariance for categorical outcomes are described.

Dale Glaser posted on Tuesday, February 05, 2008 - 12:35 pm

Hi Linda and Bengt...I have a result that seems easy enough to rectify but is proving to be intractible! I am testing a multigroup (g = 2) CFA with three constructs and three items per construct. When I test the model for the full sample, I get an unimpressive fit (CFI = .82,RMSEA = .118, etc.); however, when I run the multigroup model, whether I constrain the loadings to be equal or not I get an error message that "the standard errors of the model parameter estimates could not be computed.....".....when I check the offending parameter, it is the parameter in the PSI matrix and has a negative SE. After checking for collinearity, multivariate normality, etc there didn't seem to be any major problems. Interestingly, when I run an EFA for each group there is a very clean factorial solution for each group (though I am well aware of EFA vs. CFA differences in results). After trying various fixes (e.g,constraining the elements in the PSI matrix to 0) I was only able to attain convergence when I used the parameter estimates from the full sample as fixed estimates, and as expected fit was horrible (CFI = .65, RMSEA = .122,). Unfortunately, due to privacy issues I can't share the data as Linda generously offers. So, before abandoning this model, any recommendations for negative SE in PSI matrix even though the usual culprits (e.g, singularity) are not an apparent issue?
Thank you....Dale

Linda K. Muthen posted on Tuesday, February 05, 2008 - 1:56 pm

Have you run the CFA model for the two groups separately?

Dale Glaser posted on Tuesday, February 05, 2008 - 3:25 pm

yes I did Linda, and I was able to obtain convergence for one group but fit was abysmal (CFI approx .8, RMSEA approx, .12, etc.)......and I believe that for the other group I had to fix the PSI estimate to obtain convergence (and again fit was problematic).......what I find intriguing is the factorial solution for EFA (whether orthogonal or oblique rotation) was very unambiguous (i.e., as postulated) for both groups.......

Linda K. Muthen posted on Tuesday, February 05, 2008 - 4:24 pm

I know you can't send your data but I would like to see the EFA outputs for each group and the CFA outputs for each group. If you had clear EFA results, the CFA's should not fit so poorly. It does not sound like the CFA fits in either group.

Craig Nathanson posted on Tuesday, February 05, 2008 - 7:57 pm

Hello all,

I recently conducted several analyses where I compared the pattern of results across correlation matrices of mostly personality data. Specifically, I was interested in whether the pattern of results in group A (e.g., men) was similar to that seen in group B (e.g., women). The procedure yielded the following fit indices:

-Chi-square
-Standardized RMR
-RMSEA
-Population Gamma
-Adjusted Population Gamma
-McDonald Noncentrality index
-Population noncentrality index

For all but the first two I have 90% confidence intervals as well.

My sample sizes are, by SEM standards, small (<=250). I have read the Hu and Bentler paper but am still a little unclear as to what the "appropriate" cutoffs are for assessing model fit.
Any suggestions? How might the confidence intervals sort out this issue?

Any and all suggestions would be greatly appreciated -- thanks in advance!

Linda K. Muthen posted on Wednesday, February 06, 2008 - 10:31 am

From the indices you give, it looks like you are not using Mplus. These indices are tests of overall fit of the model not the comparison of groups. Chi-square difference testing can be used to test across group differences.

Raffaele Guetto posted on Saturday, February 09, 2008 - 3:22 am

Hello all,

I have a problem with a multiplegroup SEM (two groups). I have a set of mixed observed variables (continuous and categorical), so my input is a matrix of Polychoric/polyserial/pearson correlations. However I can't use WLS estimator (and calculate Asymptotic Covariances Matrix), maybe because of little N (500) in one group.
My questions are:

a) my model converges with a ML estimation, with quite good fit; anyway, is that correct?
b) If I want to compare two structural parameters (or test factorial invariance), what can I do if I used a correlation matrix?

Thanks!

Linda K. Muthen posted on Saturday, February 09, 2008 - 10:12 am

There are a couple of issues here. If you are doing a multiple group analysis using a correlation matrix as input, then you must be telling Mplus it is a covariance matrix or this would not be allowed. If you have a combination of continuous and categorical dependent variables, you need to use raw data in Mplus.

Susan Seibold-Simpson posted on Sunday, March 16, 2008 - 9:32 am

Hi Linda and Bengt-I am working on measurement invariance for my model and have a question about contraining means to 0. According to UG:
MODELS FOR CONTINUOUS OUTCOMES...
1. Intercepts, factor loadings, and residual variances free across groups; factor means fixed at zero in all groups
2. Factor loadings constrained to be equal across groups; intercepts and residual variances free; factor means fixed at zero in all groups...

This would be the means for the latents only and be indicated by [var@0] for the second group? Should I also set means for observed exogenous variables to 0 as well? Thanks, Sue

Linda K. Muthen posted on Sunday, March 16, 2008 - 10:01 am

It is only the latent variable, factor, means that are fixed to zero.

Susan Seibold-Simpson posted on Sunday, March 16, 2008 - 10:49 am

Thank you Linda. Sue

Susan Seibold-Simpson posted on Sunday, March 16, 2008 - 1:07 pm

Hi Linda and Bengt:
Still working on measurement invariance.

MODELS FOR CONTINUOUS OUTCOMES

2. Factor loadings constrained to be equal across groups; intercepts and residual variances free; factor means fixed at zero in all groups

I originally ran this model with the factor means default and it ran fine and I was able to achieve partial metric invariance. However, when I run the model with the factor means at zero for all groups, I get the following error message:
THE MODEL ESTIMATION TERMINATED NORMALLY

THE CHI-SQUARE DIFFERENCE TEST COULD NOT BE COMPUTED BECAUSE THE H0 MODEL MAY NOT BE NESTED IN THE H1 MODEL. DECREASING THE CONVERGENCE OPTION MAY
RESOLVE THIS PROBLEM.

My analysis syntax was:
ANALYSIS:
DIFFTEST = 'C:\derivh1_white16_1.dat';
TYPE= MEANSTRUCTURE COMPLEX MISSING H1;
CONVERGENCE = .0001;
ITERATIONS = 20000;

Can you tell me what I am doing wrong? Thanks, Sue

Linda K. Muthen posted on Sunday, March 16, 2008 - 1:12 pm

Please send the necessary files and your license number to support@statmodel.com.

Susan Seibold-Simpson posted on Sunday, March 16, 2008 - 3:09 pm

Dear Linda-
I'm delighted to say that I figured out my syntax error on my own. Thanks for your quick response. I appreciate your availability. Sue

Veronique Van ACker posted on Saturday, May 10, 2008 - 3:02 pm

Hello all,

I'm running a two-group path analysis, but I've a major problem... I don't obtain an output !

As mentioned in the user's guide, I first specified the H1-model (unconstrained model) with DIFFTEST-option in the SAVEDATA-command. That turned out well (output file was OK).

Second, I specified the H0-model (fully constrained) in which all regression coefficients are defined eqaul between both groups. This is done by specifying the DIFFTEST-option in the ANALYSIS-command. The syntax for this H0-model is provided below. The output file only mentions that reading the input terminated normally. However, no other information is provided in the output (no model results, no Chi� difference test, ...). What can cause this problem ?

DATA: FILE IS C:\AAG\data AAG test.dat;

VARIABLE:
NAMES ARE ...
USEV ARE ...
MISSING ARE ...
CATEGORICAL = cu;
GROUPING IS tour (0=worktour 1=complextour);

ANALYSIS: PARAMETERIZATION = THETA;
DIFFTEST = C:\AAG\deriv test.dat;

MODEL: co2 ON sx dl i1 i2 i3 i4 hbi;

cu ON dl d2 d3 d4 d5 co2;

co2 on sx (1);
co2 on dl (2);
co2 on i1 (3);
co2 on i2 (4);
... and so on ...

Linda K. Muthen posted on Saturday, May 10, 2008 - 3:47 pm

Please send your input, data, output, and license number to support@statmodel.com so we can see what the problem is.

Veronique Van ACker posted on Tuesday, May 13, 2008 - 1:42 pm

Dear Linda,

Thanks for your help, but I've managed to solve the problem. The models ran succesfully and I obtained the outputs.

anja schüle posted on Thursday, June 05, 2008 - 6:08 am

I confirmed a big SEM model with continuous variables.
Now, I am trying to do a two-group SEM analysis within this model in addition:
Hypothesizing that 7 of the 14 betas are affected and respectively vary across the two groups,
(but the other 7 do not, and the gamma doesn�t either).
is it the right way to test my seven hypothesis by running the model at first for both groups by the �grouping� command, and only specifying the general Model after �Model:� without any restrictions, and afterwards, in the second run, doing the same again but constraining one beta to be equal across groups by formulating:
f4 on f1 (1)
(So I would have to calculate this second model 7 times, each time constraining only 1 beta.
And than, comparing the X� of this constrained run with the X� of the unconstrained run to see if the difference is significant?)

Or is it better to constrain all betas and gammas across groups in the first run, and afterwards in the second run, to set only one beta free in each run? (And compare the X� of the completely constrained model with the X� of the model where only one beta is set free?)

Thanks a lot in advance!

Linda K. Muthen posted on Thursday, June 05, 2008 - 7:53 am

I don't think it matters.

Michael S. Businelle, Ph.D. posted on Thursday, June 26, 2008 - 12:04 pm

Hello.

I have tested a complex model (6 latent variables; 22 observed variables) and obtained acceptable model fit for the data. I ran subsequent multiple groups analyses for each of the 3 race/ethnicities within the dataset (n = 140 for each race/ethnicity). The model fit was excellent for two of the groups, but unacceptable for the third group. How do you suggest I proceed? Should I scrap the omnibus model and develop individual models for each race/ethnicity? Should I accept the omnibus model for the two races that have good fit and develop a different model for the third race?

I have been unable to find guidance on this issue? Any help (advice, references) you could provide would be very appreciated.

Linda K. Muthen posted on Friday, June 27, 2008 - 9:36 am

It does not make sense to put groups together if the same model does not fit the data well for each group. Determining this is the first step in testing for measurement invariance. Once this has been established, then measurement invariance across the groups can be tested. Only then does it makes sense to combine the groups. You can search the literature for measurement invariance for more information and also see the following papers:

Muth�n, B. (1989). Factor structure in groups selected on observed scores. British Journal of Mathematical and Statistical Psychology, 42, 81-90.

Muth�n, B. (1989). Multiple-group structural modeling with non-normal continuous variables. British Journal of Mathematical and Statistical Psychology, 42, 55-62.

Angela Bryan posted on Wednesday, July 02, 2008 - 12:10 pm

I have a question about power analysis for a multpile group SEM where we plan to evaluate the mediated effects of an intervention on drinking outcomes in Caucasian versus Hispanic adolescents. I have conducted the power analysis in MPlus on the overall model and have found that with 200 subjects I have power .75 and with 250 subjects I have power .84. Given this information, how do I determine what sample size is needed for each group? Do I simply double the same size (and thus, I would need n=250 Caucasians and n=250 Hispanics)? I've read a few papers including Muthen & Muthen 2002 from the website and can't seem to find the answer to this question. Most I can find about power in multiple group SEM is that power is higher if group sizes are equal. Any help would be much appreciated!

Linda K. Muthen posted on Thursday, July 03, 2008 - 10:30 am

I assume that you are going to compare the two groups because you think they are different. I would do a separate power study for each group.

Angela Bryan posted on Thursday, July 03, 2008 - 10:33 am

Thanks so much for your reponse. Yes, we hypothesize structural paths that will be different in the two groups. So I should do two different power analyses? If the answer is that I were to need 100 in one group and 200 in another, doesn't this compromise power for the 1 df chi-square difference tests for whether a particlar path is, indeed, different between the groups?

Linda K. Muthen posted on Thursday, July 03, 2008 - 4:14 pm

I don't think this would compromise the multiple group power test. However, this is a test of the equality of two parameters so the last column is not power because the parameter is not being compared to zero. You would need to use MODEL CONSTRAINT to create a new parameter that is the difference between the two parameters and see the last column for that.

Andrea Lawson posted on Wednesday, October 01, 2008 - 12:40 pm

Hi there - I have a SEM model with one categorical predictor (two levels). This predictor represents 2 different experimental conditions that my participants were in (between subjects design). My question is, am I able to simply dummy code this categorical variable and run the usual SEM, or do I need to run a different kind of SEM in order to analyze this?
Thanks very much for your help, Andrea

Linda K. Muthen posted on Wednesday, October 01, 2008 - 2:13 pm

A dummy covariate can be included in an SEM model. See Example 5.8 in the Mplus User's Guide.

Angela D'Angelo posted on Sunday, October 05, 2008 - 11:10 am

Hello,

I am estimating a structural model to look at whether father involvement mediates the relationship between being an immigrant child and that child's cognitive outcomes.

The mediator of father involvement is a latent variable. I am first running a CFA before proceeding to the path analysis portion. I have determined that I do not have measurement invariance between resident and non-resident fathers on the latent variable of Father Involvement, although the CFA model shows an acceptable fit for each group. Theoretically, this makes a lot of sense, since what fathers do when they live with or away from their children should vary, that is, I expected to find noninvariance.

I am struggling because now that I have established noninvariance, is it ok to go ahead and estimate the larger path models separately by group? Or should I estimate one large model across both groups and allow all the parameters of the latent variable Father Involvement to vary across groups? Is this even possible?

Thanks.

Andrea Lawson posted on Sunday, October 05, 2008 - 11:33 am

Thanks so much for getting back to me Dr. Muthen. I think I need a bit more clarification though (forgive me - I am new at this whole SEM thing). I was wondering if a dummy coded, 2-level categorical predictor (not a covariate) can be included in a regular SEM analysis. And I checked example 5.8 and it doesn't seem to refer to a categorical predictor...perhaps I am misinterpreing it? Thanks so much, sorry for the repeat postings, Andrea

Linda K. Muthen posted on Sunday, October 05, 2008 - 12:00 pm

Andrea: The predictor can be continuous or categorical as in regular regression.

Andrea Lawson posted on Sunday, October 05, 2008 - 6:20 pm

Wonderful! Thanks so much for your speedy reply, Andrea

Angela D'Angelo posted on Monday, October 06, 2008 - 2:13 pm

Linda K. Muthen posted on Monday, October 06, 2008 - 4:09 pm

If you do not have invariance of the factor across groups, then you should look at the two groups separately. You cannot compare the factor parameters across the two groups in a meaningful way without measurement invariance.

Heejung Chun posted on Thursday, November 27, 2008 - 8:09 pm

Hello,

I am conducting a multiple group analysis (MGA) with a second-order confirmatory factor model. The MGA is established in five steps. The five steps are the following:

1. Configural invariance (released the intercepts of the indicators along with releasing all other parameters)
2. Factor Loading invariance of Indicators
3. Factor Loading invariance of First-order factors
4. Intercept invariance of Indicators
5. Intercept invariance of First-order Factors

In my understanding the CFIs should be deceased as I constrain factor loadings and/or intercepts between groups. However, my results showed greater CFIs as I constrained some parameters between groups. Is this right?

I would appreciate your answer.

Thank you.

dena posted on Friday, November 28, 2008 - 8:27 am

Hi,

I would like to run autoregressive models separately for boys and girls because the correlation matrix clearly suggests that our variables of interest are significantly correlated among girls but not among boys. My question is can I (and if yes, how) justify my decision to run models separately for boys and girls? I read somewhere that we can test whether the variance-covariance matrix is the same for boys and girls and if not, this could justify the split of the analyses. I�m not sure if this is right and how to do that. I constrained all the correlations to be equal for boys and girls. The chi-square with 28 df = 53.17. Can I compare it to the base model (chi-square = 0) and say that the constrained model is �significantly worse� (critical chi-square for n = 28df = 41.34)?

I also did multi-group analyses. Even though some coefficients are significant and quite different for girls and boys, the difference in the chi-square when I look at the constrained vs. unconstrained models is not significant. Is it normal?

Thank you very much for your time.

Bengt O. Muthen posted on Friday, November 28, 2008 - 4:42 pm

The most pointed analysis would be the multi-group analysis where the auto-regressive model is used for both genders and runs with full equality and full inequality are used to form the chi-square difference test.

Bengt O. Muthen posted on Friday, November 28, 2008 - 4:57 pm

Regarding the Chun post, the loglikelihood should get lower with more restrictive models (chi-square higher), but I don't think CFI needs to follow this pattern.

Bander Ahmed posted on Wednesday, December 03, 2008 - 10:48 am

Hi, i have two groups; one of them is 232 respondents (repose rate 50%) and the other one is 386 respondents (response rate 80).does this effect the comparison results?

Linda K. Muthen posted on Wednesday, December 03, 2008 - 11:38 am

I would be concerned about the 50% response rate. This is very low. It could affect the comparison.

Bander Ahmed posted on Thursday, December 04, 2008 - 6:17 am

whould you kindly tell me what effect it might have? and whats the possible solutions? Many thanks

Linda K. Muthen posted on Thursday, December 04, 2008 - 3:52 pm

If the missingness is due to different reasons in the two groups, any group comparisons would be biased. You should investigate why the missingness occurs. The only solution would be to include variables in the model that relate to missingness. But with so much missing data in the one group, some may not find your results meaningful.

David Zimmer posted on Friday, December 05, 2008 - 2:22 am

Hello, I've got the following question...

I have two groups and I want to fix all factor loadings and path coefficients across all groups. Therefore, I have the following input:

MODEL:

CP by cp1 cp3 (1);
AC by ac3 ac4 ac5 ac6 (2);
NC by nc4 nc5 nc7 (3);

CP on AC (4);
CP on NC (5);

But, when I look into the output, loadings and path coefficients are only equal in the model result section. If I compare the values in the stdyx standardization section, the factor loadings and path coefficients differ. I don't understand this, as they should be also equal - I mean that is what I wanted to fix in my input commands. When I run the same stuff in LISREL, the standardized values are equal...

What is wrong?

Thanks!!!!

Linda K. Muthen posted on Friday, December 05, 2008 - 6:18 am

The standardized are different because they are standardized using the groups standard deviations not the overall standard deviations.

nina chien posted on Tuesday, December 09, 2008 - 10:42 am

I am doing multiple-group analysis with chi-square difference tests, and my estimator is MLR. I referred to your page:

http://www.statmodel.com/chidiff.shtml

I am a little confused about the following:

1. Estimate the nested and comparison models using MLR. The printout gives loglikelihood values L0 and L1 for the H0 and H1 models, respectively, as well as scaling correction factors c0 and c1 for the H0 and H1 models, respectively. For example,

L0 = -2,606, c0 = 1.450 with 39 parameters (p0 = 39)
L1 = -2,583, c1 = 1.546 with 47 parameters (p1 = 47)

****************************************
Is the L0 in the instructions the H0 Value of the *nested* model? And the c0 is the scaling correction factor for the nested model?

And is the L1 in the instructions the H0 Value of the *comparison* model? And the c1 is the scaling correction factor for the comparison model?

Thank you for your help.

Linda K. Muthen posted on Wednesday, December 10, 2008 - 10:28 am

Is the L0 in the instructions the H0 Value of the *nested* model? And the c0 is the scaling correction factor for the nested model?

Yes.

And is the L1 in the instructions the H0 Value of the *comparison* model? And the c1 is the scaling correction factor for the comparison model?

Yes.

dena posted on Friday, January 23, 2009 - 7:36 am

Hi,

I�m doing multi-group analyses on autoregressive cross-lagged paths with two variables and four time-points.

I first did my global model (both genders), in which I had to add two correlations between residuals to improve model fit.

Then, I looked whether the fit was good among girls and boys. Fits are ok, but I noticed that one of the correlated residual is not significant among girls, whereas the other is not significant among boys (i.e., one is significant in each group, but it�s not the same).

I also noticed that some paths are significant among girls but none are significant among boys.

My questions are:

- Is it necessary to go further in the analyses since the coefficients are only significant among girls and not among boys?
- If yes, if I constrain only the significant paths to be equal among boys and girls, should the difference in the chi-square detect these differences?
- If not, what could explain it?

Thank you very much for your time.

Bengt O. Muthen posted on Friday, January 23, 2009 - 5:49 pm

You can have coefficients significant in one group and not the other and still not be able to reject equality across groups. For instance, point estimates of say

0.2 for girls
0.15 for boys

may be significant for girls and insignificant for boys, while the two are not significantly different.

I hope that's what you were asking.

dena posted on Monday, January 26, 2009 - 8:54 am

From prior dena comment:

Yes, it is. Then, what can I conclude from these findings?

Can I still report the coefficients for boys and girls and mention that even though the coefficients were significant for girls but not for boys, we could not detect a significant differences between the two?

What does it mean?

Thank you again for your precious help.

Bengt O. Muthen posted on Monday, January 26, 2009 - 4:58 pm

Answer to your second question - yes.

Answer to your third (last) question - if you cannot reject equality of the coefficients across gender you would want to consider if this common coefficient is significant; perhaps it is. That would make perfect sense I think.

dena posted on Sunday, February 01, 2009 - 1:26 pm

The common coefficients (I assume these are the coefficients for the total sample) are not always significant...

What conclusions can I then draw?

Example:

Coefficient for total = .09, z = 1.88
Coefficient for girls = .13, z = 2.01
Coefficient for boys = -.02, z = -0.281

If I constrain only this coefficient to be equal among boys and girls, the delta chi-square is not significant.

Thanks again!

Bengt O. Muthen posted on Monday, February 02, 2009 - 7:55 am

I would report what you see - the coefficient for girls is significantly different from zero. The coeff for boys is not. The two coefficients are not significantly different from each other. This is not contradictory - the last statement might be due to the coefficient for boys having a large standard error; the SE for boys plays into the gender difference testing. Perhaps there is too little power to reject gender differences.

Kihan Kim posted on Tuesday, February 17, 2009 - 4:00 pm

I am trying to test a multi-group SEM with no constraints on the measurement and structural parts (I do not want any parameter to be constraint).

I have five factors (F1-F5), and the following is the MODEL command. I am keep receiving the following error message, and I am not sure what is wrong with the model identification. Could you please advise me?

Error Message:

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL. PROBLEM INVOLVING PARAMETER 73.

Model: F1 by Y1 Y2 Y3;
F2 by Y4 Y5 Y6;
F3 by Y7 Y8;
F4 by Y9 Y10;
F5 by Y11 Y12 Y13 Y14;

F5 on F1 F2 F3 F4;

Model G2:

F1 by Y1 Y2 Y3;
F2 by Y4 Y5 Y6;
F3 by Y7 Y8;
F4 by Y9 Y10;
F5 by Y11 Y12 Y13 Y14;

Bengt O. Muthen posted on Wednesday, February 18, 2009 - 6:12 am

In Model G2 you should not free the first factor indicator loading since that sets the metric of the factor. See the Mplus UG multi-group examples.

Kihan Kim posted on Wednesday, February 18, 2009 - 12:25 pm

I have four exogenous factors predicting one endogenous factor such as:

F1 by Y1 Y2 Y3;
F2 by Y4 Y5 Y6;
F3 by Y7 Y8 Y9;
F4 by Y10 Y11 Y12;
F5 by Y13 Y14 Y15;

F5 on F1-F4;

I appeared that F4 had a negative impact on F5. But the correlations among the factor indicators for F4 and F5 are all positively correlated.

Is it possible to find a negative path coefficient when the correlations among indicators are all positive?

Thanks you.

Linda K. Muthen posted on Wednesday, February 18, 2009 - 5:13 pm

The fact that the factor loadings for both factors are positive does not preclude the factors from having a negative relationship with each other.

Kihan Kim posted on Wednesday, February 18, 2009 - 5:34 pm

Sorry... I think I confused you in my previous question. It was not the factor loadings for both factors that were positive, but the correlations among the indicators were positive.

So, for the following model, the correlations among Y10-Y15 were all positive, and I found a negative impact of F4 on F5. Is this possible? Thank you again..

F1 by Y1 Y2 Y3;
F2 by Y4 Y5 Y6;
F3 by Y7 Y8 Y9;
F4 by Y10 Y11 Y12;
F5 by Y13 Y14 Y15;

F5 on F1-F4;

Linda K. Muthen posted on Wednesday, February 18, 2009 - 5:47 pm

Please send the full output and your license number to support@statmodel.com so I can see exactly what you are doing. Please include SAMPSTAT in the OUTPUT command.

Ina Schoellgen posted on Friday, February 20, 2009 - 5:53 am

Hi,

I was wondering whether it is possible to test for an interaction between an observed and a latent variable (both continuous) within a multiple group analysis, i.e. to test whether the interaction (between two variables) differs between groups (third variable).

Thanks for your help,
Ina

Linda K. Muthen posted on Friday, February 20, 2009 - 6:01 am

Yes, you can use the XWITH option to do this.

Ina Schoellgen posted on Friday, February 20, 2009 - 7:11 am

I just read that interactions with continuous variables require numerical integration. But if I add "ALGORITHM = INTEGRATION", MPLus tells me that "ALGORITHM = INTEGRATION is not available for multiple group analysis".

Linda K. Muthen posted on Friday, February 20, 2009 - 7:21 am

You need to use the KNOWNCLASS option and TYPE=MIXTURE instead of the GROUPING option when numerical integration is involved. If you have further questions on this topic, send them along with your license number to support@statmodel.com.

Ela Polek posted on Friday, February 27, 2009 - 7:30 am

I run Multiple Group SEM in 6 groups with Mplus. I have to compare some specific coefficients across groups, (to test if the influence of some variables on the outcome variable differs across groups). I have already used model-test command in Mplus, which gives Wald Test, but this does not test invariance of specific coefficients across groups.
I know that when comparing coefficients in 2 groups t-test can be used. What test should be used when comparing 6 groups? I will be more than thankful for any advice.

Thank you in advance,
Ela

Linda K. Muthen posted on Friday, February 27, 2009 - 10:21 am

You can test specific coefficients using MODEL TEST, for example,

MODEL:
y ON x1 (p1)
x2 (p2);
MODEL TEST:
0 = p1-p2;

of you can use difference testing of nested models using either chi-square or the loglikelihood.

Dorothee Durpoix posted on Wednesday, April 15, 2009 - 4:18 pm

Hello,

I suspect I have two groups but I'd like to prove it.
I have run ESEM analysis separately on each of the two groups and they do not respond in the same way to my construct: some variables do not load on the same latent factors between the two groups.
But I'd actually like to go back a step and actually show on my whole sample (n1 + n2) that the grouping variable has a significant effect on the construct. Could you please tell me how I can assess in Mplus if the grouping variables has an additive effect on the outcomes loadings but as well may lead to some variables loadings on some latents in one group but not in the other group? Would the following syntax take into account the different possible effects I just mentioned of the grouping variable on the construct?

MODEL:
F1 BY u1-u7(*1);
F2 BY u1-u7(*1);
F1 with F2;
u1-u7 ON Gp;

With F1 and F2, the two common continuous latent factors; u1 to u7, the ordinal outcomes variables; Gp, the binary grouping variable (coded in 0 and 1).

Thank you very much in advance for any input.

Linda K. Muthen posted on Friday, April 17, 2009 - 10:39 am

The model you have specified can determine differences in the intercepts only not the factor loadings. If you do a multiple group analysis where the factor loadings are held equal across groups as the default, you can look at modification indices to assess measurement invariance.

Dorothee Durpoix posted on Sunday, April 19, 2009 - 7:59 pm

Hi Linda,

I�ve got 2 questions:
1. Would this ESEM model assess the effect the grouping variable (x1) can have on the loading structure as well as the strength of the loadings?

USEVARIABLES ARE
x1 u1 u2 u3 u4 u5 u6 u7
u1x1 u2x1 u3x1 u4x1 u5x1 u6x1 u7x1;

CATEGORICAL ARE
u1 u2 u3 u4 u5 u6 u7;

DEFINE:
u1x1 = u1*x1;
u2x1 = u2*x1;
u3x1 = u3*x1;
u4x1 = u4*x1;
u5x1 = u5*x1;
u6x1 = u6*x1;
u7x1 = u7*x1;

MODEL:
F1 BY u1-u7x1(*1);
F2 BY u1-u7x1(*1);
F1-F2 ON x1;
F1 with F2;
u4 with u5;

2. The original model has 2 latent factors on which different outcomes (e.g.u1) load according to the group (when I run the analysis separately on the 2 groups). To test the effect of the interaction variables (e.g. u1x1) on the model structure on the whole sample, should I still specify 2 latent factors or more/less? The results show the original ordinal outcomes significantly loading on 1 factor, and the interaction variables, on the other�I'm not sure what this is showing.

Linda K. Muthen posted on Monday, April 20, 2009 - 10:04 am

The interaction between the covariate and the items does not get at factor loading invariance. The interaction between the covariate and the factor would do that. The best way to look at factor loading invariance is multiple group analysis. See Example 4 in the Version 5.1 Examples Addendum on the website with the user's guide. See also the Topic 1 course handout and video. Topics 1 and 2 will be taught at Johns Hopkins University in August.

Emma Sterrett posted on Friday, April 24, 2009 - 8:12 am

Hi,

I am comparing a CFA across two groups and keep getting the same fit index statistics for increasingly constrained models. Do you think you can tell me what I'm doing wrong. Here is my code:

(1)
Model:
DOMCON BY AIDEDUC@1 HEALTHCA PROGVIOL SOCSEC;
INTCON BY DEFENSE @1 GATHINTE HOMESEC ECONOTHR MILOTHRN;
DOMCON with INTCON (1);

(2)
Model:
DOMCON BY AIDEDUC@1 HEALTHCA PROGVIOL SOCSEC;
INTCON BY DEFENSE @1 GATHINTE HOMESEC ECONOTHR MILOTHRN;
DOMCON with INTCON(1);
AIDEDUC (2);
HEALTHCA (3);
PROGVIOL (4);
SOCSEC (5);
DEFENSE (6);
GATHINTE (7);
HOMESEC (8);
ECONOTHR (9);
MILOTHRN (10);

(3)
Model:
DOMCON BY AIDEDUC@1 HEALTHCA PROGVIOL SOCSEC;
INTCON BY DEFENSE @1 GATHINTE HOMESEC ECONOTHR MILOTHRN;
DOMCON with INTCON(1);
AIDEDUC (2);
HEALTHCA (3);
PROGVIOL (4);
SOCSEC (5);
DEFENSE (6);
GATHINTE (7);
HOMESEC (8);
ECONOTHR (9);
MILOTHRN (10);
[DOMCON] (11);
[INTCON] (12);

Linda K. Muthen posted on Friday, April 24, 2009 - 9:31 am

Please send the three full outputs and your license number to support@statmodel.com.

Andrea Hildebrandt posted on Wednesday, May 06, 2009 - 1:23 pm

I investigate mean level differences between three age groups in a measurement model with three correlated factors. In the next step I would like to test the between group ability differences in the same factors after controlling for general cognition for example. To do this I modelled a structural model in which those factors are regressed onto general cognition:

f1 on GenCOG;
f2 on GenCOG;
f3 on GenCOG;

I need to compare the means of the residuals of f1, f2, f3 between the groups if I want to test performance differences on those factors after controlling for GenCOG, right? Where in Mplus Output do I find those values? In the residual Output I find only parameter for the Indicators. Tech 4 shows means of the latent variables (also of the endogenous variables), but when I am looking at those values, I dont think those are the means of the residuals of the endogenous factors. There are exactely the same latent means displayed that I found in the measurement model. But the exogenous variable explains at about the half of the variance, so the residual means of f1, f2 and f3 should change compared to the measurement model, I think. Could you please advice me.

Thank you for your help!

Linda K. Muthen posted on Thursday, May 07, 2009 - 10:21 am

The parameters you want are the intercepts of f1, f2, and f3 which you will find in the Results section of the output.

michael mccarthy posted on Tuesday, May 12, 2009 - 5:22 pm

How do you tell if the difference between groups is significant when you run a multi-group model?

Linda K. Muthen posted on Wednesday, May 13, 2009 - 9:54 am

You can do this using chi-square or loglikelihood difference testing of two nested models where one model has the parameter of interest free across groups and the other has the parameter constrained to be equal across groups. You can also use MODEL TEST.

Elizabeth Oliva posted on Friday, July 31, 2009 - 11:00 am

I am trying to see if the path analysis model below differs between males and females and am not sure what syntax to use to constrain the paths. I have been looking at syntax that people use but am not sure how to apply it to my model and/or whether I need to prepare my data differently to test for measurement invariance.

Thanks!

VARIABLE: NAMES ARE ID IDYRFAM sex SES ZSES alc2 cn0 gp1 bp1 cn2 gp2 bp2;
USEVARIABLES ARE alc2 cn0 bp1 cn2 bp2;
CLUSTER = IDYRFAM;
ANALYSIS: TYPE = COMPLEX;
MODEL: bp1 bp2 cn2 alc2 ON cn0;
bp2 cn2 alc2 ON bp1;
alc2 ON bp2;
alc2 ON cn2;
bp2 WITH cn2;
OUTPUT: SAMPSTAT STANDARDIZED;
standardized mod(3.84);

Linda K. Muthen posted on Friday, July 31, 2009 - 4:23 pm

Chapter 13 has a section on Equalities in Multiple Group Analysis that should help you. There is a full discussion of the Mplus language for multiple group analysis in that chapter.

Linda posted on Monday, August 24, 2009 - 11:16 am

How does multi-sample SEM account for multiple comparisons when comparing models across multiple groups?

naT posted on Wednesday, August 26, 2009 - 1:41 pm

Hello,

I am modelling path analysis model with all observed variables. However, my TECH1 tells me that there are no parameter specified in NU nor THETA matrices, but instead all are specified in ALPHA and PSI matrices. I am wondering whether I misspecified the model, or is this the default of the mplus? If I have misspecified the model, how can I fix this?

Thank you very much for your help!

Linda K. Muthen posted on Wednesday, August 26, 2009 - 2:40 pm

This is correct. There is no matrix in Mplus for observed regressed on observed so the observed variables are turned into latent variables that are identical to the observed variables. This does not in any way affect the results. It simply moves the parameters from one matrix to another.

naT posted on Wednesday, August 26, 2009 - 4:01 pm

Thank you very much for clarifying!
Best

Jayanthi Rajamani posted on Wednesday, September 02, 2009 - 1:49 pm

I am using similar syntax as in the MPlus manual to estimate a SEM with constrained factor loadings across multiple groups, but separate model ON statements. But I get a mesg that the model didn't converge and factor scores were not computed. When I look at the parameter estimates, they don't look too huge, there are no negative residual variances. I have 2 groups, and 3 continuous latent variables, and 15 factors. Can you send some sample syntax that would work?

Linda K. Muthen posted on Wednesday, September 02, 2009 - 2:12 pm

Please send your full output and license number to support@statmodel.com so I can see exactly what you are doing.

Sara Anderson posted on Sunday, September 27, 2009 - 1:21 pm

I'm running a model and want to compare effects across developmental periods. I've run two models, one unconstrained, and one where I've constrained the paths of interest to be equivalent across developmental periods. Is there also a way to test if the factor loadings in the unconstrained model are significantly different across periods instead of running a separate model where the paths are constrained to be equal?

Linda K. Muthen posted on Monday, September 28, 2009 - 8:40 am

You can use MODEL TEST. See the user's guide.

Aniruddha Das. posted on Saturday, October 03, 2009 - 1:56 pm

Hi,
I fully understand why using nominal variables as mediators in a path/SEM is unacceptable. But suppose one were to include it as a set of dummies. *And* if separate analysis, using seemingly unrelated estimation, showed that the effects of "upstream" exogenous variables on these dummies, was statistically no different from the effects of these same exogenous variables on the corresponding nominal variable categories -- would the strategy then become defensible?
Thanks.
- Bobby

Aniruddha Das. posted on Saturday, October 03, 2009 - 2:05 pm

Elaboration on last post: my point is that if IIA holds, then why not replace the multinomial part of the model (exogenous > nominal) with corresponding logits (exogenous > dummies)? Thanks,
Bobby

Linda K. Muthen posted on Monday, October 05, 2009 - 11:30 am

Following is an answer that was given to a similar question last week. It was found by searching on nominal.

"I don't think mediation via a nominal mediator m has been studied methodologically - but correct me if I am wrong. One possible direction to go would be to create a latent class variable c where the nominal categories of c are the same as the observed nominal variable categories of m (this is done via logit thresholds). c on x is then a multinomial logistic regression and the influence of c on y is captured by the means of y changing over the c categories (you don't say "y on c", but it has the same effect). This avoids the y on m regression which would treat m as continuous which would not make sense when m is nominal.

One can then explore if there is a need for direct effects y on x. But there isn't any guidance for how one should/could simply quantify how much of the x influence goes via m versus directly. Perhaps that isn't needed. This topic is a method research paper in itself - anyone?"

Aniruddha Das. posted on Monday, October 05, 2009 - 1:49 pm

Hi Linda,
Thanks, this helps. I apologize in advance if the following is a completely brain-dead question:

From a previous discussion:
"Making the observed nominal u variable the same as the categorical latent c variable is done by saying

%c#1%
[u$1@15];
%c#2%
[u$1@-15];"

I'm guessing this is for 2 categories.
I.e. for fixing thresholds to logit values of 15 & -15, corresponding to 0/1 probabilities. Right? So how to fix thresholds for 3(+) categories?

Once more, I suspect I'm being really dumb here... But clarification would be appreciated. Thanks,
Bobby

Bengt O. Muthen posted on Monday, October 05, 2009 - 2:36 pm

For 3 groups (categories) you use:

%c#1% ! g=0 group
!(note: high threshold, low prob.)
[g$1@15 g$2@16 g$3@17];

%c#2% ! g=2 group
[g$1@-15 g$2@16 g$3@17];

%c#3% ! g=3 group
[g$1@-16 g$2@-15 g$3@17];

where g is declared categorical.

Aniruddha Das. posted on Thursday, October 08, 2009 - 9:24 am

Thanks!

One last question: I'm assuming this latent variable couldn't be dumped into a more elaborate SEM, where "downstream" dependent variables influence each other, and all are influenced by exogenous x's.

I don't particularly need to know how much of the influence of x goes via the nominal variable versus directly.

Just wanted to check. Thanks again,
Bobby

Bengt O. Muthen posted on Thursday, October 08, 2009 - 10:00 am

This approach can be combined with a full SEM.

Sally Olderbak posted on Sunday, November 01, 2009 - 5:24 pm

I am interested in running a multiple group analysis and constraining paths across three samples. However, one sample is missing two variables so, I would like to constrain all of the paths across all of the samples, except for paths related to those two variables for that sample, but for the two samples that have those two variables, I would like the paths constrained.

Please let me know if this is possible in MPlus and if so, how I can do it.

Thanks!

Linda K. Muthen posted on Monday, November 02, 2009 - 9:37 am

You need the same set of observed variables in each group.

Bjarte Furnes posted on Wednesday, November 04, 2009 - 12:54 am

Hi,

I'm doing a multigroup comparison including children learning to read across two different orthographies.

I'm a new user of Mplus, but so far I understand that the procedure is to go step by step. First comparing (across groups) factor loadings, then intercepts, then factor variance/covariance etc.

I've also learned that if some of the steps show a sig.diff. across groups than further comparison is meaningless.

My question concerns partial measurement invariance. In my study I have five latent variables made up by ten indicators (two indicators for each latent) A chi-square difference test tells me that my factor loadings diff. sig across groups. But does that mean that all loadings are sig.diff? or is there a place in the output showing which loadings that differ?

Is it possible to continue comparing invariance across groups allowing some parameters to be free?

Thanks

Linda K. Muthen posted on Wednesday, November 04, 2009 - 5:57 am

You can have partial measurement invariance if you model the invariance by allowing the parameters to differ across groups. How much invariance you can have is debatable. You can see where the large differences are by looking at modification indices. See the Topic 1 video and course handout for a full description of measurement invariance.

QianLi Xue posted on Sunday, November 29, 2009 - 4:07 pm

Is it true that theoretically, multiple group CFA with categorical factor indicators can have same loadings but different thresholds across groups? The model will be identified as long as the scale factors are set to 1 across all groups.

Bengt O. Muthen posted on Sunday, November 29, 2009 - 5:05 pm

Yes, I think that's true. Note that the scale factors depend on 3 things: loadings, factor variance, and item residual variance. If factor variances are different across groups, fixing scale factors at 1 in all groups would be inconsistent with that.

Bjarte Furnes posted on Monday, December 14, 2009 - 5:10 am

Hi,

I'm doing an hierarchical regression analysis in two groups using Cholesky decomposition (because of indications of colinerity among the independent variables). I have established measurement as well as factor variance/covariance across groups. How do I compare regression coefficients across groups? I think this procedure is quite straigthforward doing ordinary SEM, but I`m getting confused by the decomposition framework.

Thanks.

Sally Czaja posted on Monday, December 14, 2009 - 11:10 am

I am predicting a person level latent variable outcome (achievement) using a cluster level factor (neighborhood poverty) by group (grp). I'm running the following analysis and am getting an error message (ERROR in MODEL command Parameters involving between-level variables are not allowed to vary across classes. Parameter: FB ON NEIGHPOV). Is there another way to estimate a model which allows between level variables to vary across classes? What would you suggest? Thank you.
Classes= c(2);
KNOWNCLASS = c(grp=0 grp=1);
WITHIN = female raceWb ageint1 poverty;
CLUSTER = census;
BETWEEN = neighpov;
ANALYSIS: TYPE= TWOLEVEL mixture;
MODEL:
%WITHIN%
%OVERALL%
fw BY ZgrdyrSp5 Zwratscr Zqutest;
fw ON female raceWb ageint1 poverty;
%c#1%
fw BY ZgrdyrSp5 Zwratscr Zqutest;
fw ON female raceWb ageint1 poverty;
%c#2%
fw BY ZgrdyrSp5 Zwratscr Zqutest;
fw ON female raceWb ageint1 poverty;
%BETWEEN%
%Overall%
fb by ZgrdyrSp5 Zwratscr Zqutest;
fb on neighpov;
ZgrdyrSp5 Zwratscr Zqutest @0;
%c#1%
fb by ZgrdyrSp5 Zwratscr Zqutest;
fb on neighpov;
ZgrdyrSp5 Zwratscr Zqutest @0;
%c#2%
fb by ZgrdyrSp5 Zwratscr Zqutest;
fb on neighpov;
ZgrdyrSp5 Zwratscr Zqutest @0;

Bengt O. Muthen posted on Tuesday, December 15, 2009 - 11:31 am

Bjarte - is that a Cholesky decomposition of the independent factors? So that one factor is residualized given the other?

Bjarte Furnes posted on Tuesday, December 15, 2009 - 12:11 pm

Yes.

Bjarte Furnes posted on Tuesday, December 15, 2009 - 12:16 pm

I see that my first explanation was somewhat unclear. I meant "I have established measurement invariance as well as factor variance/covariance invariance across groups". And yes, it is a Cholesky decomposition of the independent factors.

Bengt O. Muthen posted on Wednesday, December 16, 2009 - 11:31 am

It sounds like for your group comparisons of slopes on the factors you don't want to use the decomposed factors oyu got by Cholesky but the original ones. If so, you backtranslate the slopes to the original factors using Model Constraint and do tests of invariance using Model Test.

Linda K. Muthen posted on Wednesday, December 16, 2009 - 12:01 pm

Sally: Please send your input, data, output, and license number to support@statmodel.com.

Bjarte Furnes posted on Thursday, December 17, 2009 - 11:22 am

Hi, thank you so far! I think I include some more information

Below is the input for my separate hierarchical regressions (Cholesky decomposition). As I said in an earlier post I understand (hopefully) the procedure in how to compare SEM models when the predictor variables are included simultaneously. However, I�m not sure what to do when comparing hierarchical models. What should I include in the second model (Model Scan) so that I can set the baseline model and then proceed with the comparison from factor loadings to structural paths?

Thank you

See next post for input.

Bjarte Furnes posted on Thursday, December 17, 2009 - 11:28 am

USEVARIABLES ARE v1-v10;
USEOBSERVATIONS = V1 EQ 2;
!GROUPING IS V1 (1=Eng 2=Scan);

WR by v1 v2;
VOC by v3 v4;
RAN by v5 v6;
PA by v7 v8;

WR1 by v9 v10;

PH1 by WR* VOC RAN PA;
PH2 BY VOC* RAN PA ;
PH3 BY RAN* PA;
PH4 by PA*;

WR@0;
VOC@0;
PA@0;
RAN@0;

PH1@1;
PH2@1;
PH3@1;
PH4@1;

PH1 WITH PH2@0 PH3@0 PH4@0;
PH2 WITH PH3@0 PH4@0;
PH3 with PH4@0;

WR1 on PH1 PH2 PH3 PH4;

Model Scan: ?

Bengt O. Muthen posted on Thursday, December 17, 2009 - 3:16 pm

Is your question how to translate the slopes of

WR1 ON PH1-PH4;

to slopes for

WR1 ON WR VOC RAN PA;

?

I don't understand what "Model Scan:" is. I would expect that you would have Model Constraint here.

Bjarte Furnes posted on Friday, December 18, 2009 - 4:22 am

The "model scan" is the group specific model (Group 1 is Eng, Group 2 is Scan).
The output above is my set up for hieracichal regressions. PH1-PH4 is equal to one factor residualized given the other. That is, PH1 equal to WR, PH2 is the residual of VOC after WR have been partialled out, PH3 is the residual of RAN after WR and VOC have been partialled out, and PH4 is the residual of PA after, WR, VOC, and RAN have been partialled out.

I have done separate analysis for each group (Eng and Scan). My problem arises when I try to compare the structural paths for WR1 on PH1-PH4 across groups.

As I said, when doing standard regression this is quite straigthforward. When doing hierarchical regression I�m not sure if it's even possible...

Bengt O. Muthen posted on Friday, December 18, 2009 - 9:45 am

I don't see why a problem would arise here. First test that the measurement parameters are equal across groups, including the parameters of the PH* BY statements, and if that is not rejected, test if the structural parameters of WR1 ON PH1-PH4 are equal. Group-equality testing is covered in our Topic 1 course on the web.

Yaacov Petscher posted on Thursday, January 07, 2010 - 11:50 am

Greetings,

I have data for three different grades who were assessed on the same instrument in the fall and spring of the academic year. In order to estimate appropriately scaled ability scores across time points (fall/spring) and grade (1,2,3), is it best to run two separate multiple group analyses (one for each time point) or to run a multiple group MIMIC model with time as the covariate? Thank you for any input!

Linda K. Muthen posted on Friday, January 08, 2010 - 9:39 am

I would first test measurement invariance across time for each grade. Once that is established, I would use multiple group analysis to test measurement invariance across grade.

leah lipsky posted on Friday, January 08, 2010 - 11:05 am

Hello, Can you tell me why I'm getting the same estimates & fit statistics regardless of which paths I constrain (trying to do multiple group path analysis)? For example, the 1st model below constrains all paths, and the 2nd I believe frees them all. Thank you!!

MODEL 1--ALL PATHS CONSTRAINED
VARIABLE: NAMES ARE id edu exfreq ageyrs wtchg2y gainer pcap pwtatt pseff
yr1rtrn yr2rtrn return bmichg ploc fvint retain1y partot;
MISSING = ALL (999);
GROUPING IS retain1y (0=no 1=yes);
USEV ARE pseff ploc exfreq fvint wtchg2y;
CATEGORICAL = exfreq fvint;
MODEL: exfreq on pseff ploc;
fvint on pseff ploc;
wtchg2y on pseff ploc;
pseff with ploc;
fvint with exfreq@0;
OUTPUT: standardized modindices(3.84);

MODEL 2- NO CONSTRAINTS
DATA: ...same as above...
MODEL: exfreq on pseff ploc;
fvint on pseff ploc;
wtchg2y on pseff ploc;
pseff with ploc;
fvint with exfreq@0;
MODEL retainer: exfreq on pseff ploc;
fvint on pseff ploc;
wtchg2y on pseff ploc;
pseff with ploc;
fvint with exfreq@0;
OUTPUT: standardized modindices(3.84);

Linda K. Muthen posted on Friday, January 08, 2010 - 11:36 am

The default in Mplus is for regression coefficients to be free across groups as the default. So the models are the same. You can see this by looking at TECH1 or your model results. You need to constrain the parameters to be equal in the second model.

Maren Winkler posted on Tuesday, February 09, 2010 - 6:35 am

Dear Linda,

in my multiple group analysis (testing for metric invariance) I get the following message:

"THE MODEL ESTIMATION TERMINATED NORMALLY

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE
COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL.
PROBLEM INVOLVING PARAMETER 324."

However, I don't have a parameter 324 - I checked TECH1.
My model looks like this:
y1 BY x1 TO x7;
y2 BY x8 TO x13;
y2 ON y1 x7;

I have four groups.

My file contains missing data which I specify by "missing = blank" and I use the "auxiliary" option.
My groups are of different size.
However, this was not a problem when establishing configural invariance.

Since I don't have a parameter 324 - what does the error message mean?

Thanks a lot!

Linda K. Muthen posted on Tuesday, February 09, 2010 - 6:45 am

Go to the beginning of Technical 1 and search for 324. I have never heard of us reporting a parameter number that does not exist. If this does not help, please send the full output and your license number to support@statmodel.com.

Maren Winkler posted on Tuesday, February 09, 2010 - 9:20 am

Dear Linda,

I've just emailed my output.

Additionally, I've estimated the above model testing for metric invariance without the "AUXILIARY" command, getting the following warning:

"THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE TRUSTWORTHY FOR SOME PARAMETERS DUE TO A NON-POSITIVE DEFINITE FIRST-ORDER DERIVATIVE PRODUCT MATRIX. THIS MAY BE DUE TO THE STARTING VALUES BUT MAY ALSO BE AN INDICATION OF MODEL NONIDENTIFICATION. THE CONDITION NUMBER IS -0.693D-17. PROBLEM INVOLVING PARAMETER 82.
THIS IS MOST LIKELY DUE TO HAVING MORE PARAMETERS THAN THE SAMPLE SIZE IN ONE OF THE GROUPS."

My groups are of sample size 42, 100, 72 and 158, respectively. If I'm not mistaken, I estimate less parameters than sample size in one group. Moreover, the parameter that Mplus points to is the psi parameter for the endogenous latent variable in my second largest group.
And I did not get this error message in the model testing for configural invariance where more parameters had to be estimated!

Thanks for your help!

Maren Winkler posted on Tuesday, February 09, 2010 - 10:08 am

Dear Linda,

I've checked my input file for the fifth time and have eventually discoverd the reason for both problems mentioned above.
I've specified the model correctly now and it works well.
Sorry for bothering you ...

Maren Winkler posted on Tuesday, February 23, 2010 - 8:00 am

Dear Drs. Muth�n,

I have four groups in my SEM. Since subjects in all groups have missing values I have used the "AUXILIARY option" as follows:

AUXILIARY = (m) z1 z2 z3 z4 z5;

My models did either not converge or SE could not be estimated. Now I've run the model again without auxiliary variables and everything works fine (i. e. I can establish scalar invariance). How am I to interpret this result? I would have expected a different result - no convergence without auxiliary variables and convergene with auxiliary variables.

Thanks for your help!

Linda K. Muthen posted on Tuesday, February 23, 2010 - 8:59 am

It would be impossible to answer this question without seeing your input, data, and license number at support@statmodel.com.

Claudio Rocha posted on Wednesday, February 24, 2010 - 2:57 pm

I would like to test the structural invariance between three groups.
To produce an unconstrained model (regression weights free among groups), should I fixed all parameters (factor loadings, intercepts, and means@0) in my measurement model? Would it be better to fix only factor loadings and intercepts? What should I do with my covariances, fixing them or letting them free?
Thank you for your help!

Linda K. Muthen posted on Wednesday, February 24, 2010 - 5:41 pm

The Mplus default is to hold the measurement parameters of factor loadings and intercepts equal across groups. The residual variances are not held equal. To compare structural parameters, measurement invariance is required.

See the end of the discussion of measurement invariance and population heterogeneity to see the models for testing the equality of the structural parameters.

Jennie Jester posted on Friday, February 26, 2010 - 7:28 am

I am running a fairly complicated SEM analysis. Just to give you an idea of the scope of it, here is the syntax

MODEL:
abusemid on abuseearly;
momdrink by maxdrkBMt12 dkpropbmt1BM bingedkt1BM;
maxdrkBMt12 with bingedkt1BM;
daddrink by bingedkt1BF dkpropbmt1BF maxdrkBFt12;
bingedkt1BF with maxdrkBFt12;
daddrink with momdrink;
momdrink on FHg0;
daddrink on FHg0;
abuseearly on momdrink;
abuseearly on daddrink;
adolund by DELINt4 DELINt5 aggret4 aggret5;
earlyund by aggBFt1 aggBFt2 aggBMt1 aggBMt2 delBFt2
delBMt2 ;
school by TRFviii1t5 TRFviii2t5 TRFviii3t5 TRFviii4t5;
abuseearly on FHg0;
crisemft4 on FHg0 momdrink daddrink;
adolund on abusemid crisemft4;
school on abusemid crisemft4;
school on adolund ;
adolund on earlyund tsex;

I have 329 in my sample - 89 are girls and 240 are boys. I would like to address sex differences in this model, but I feel like I do not have enough girls to do this with. Do you think there are enough girls to try to run the 2 group analysis?

Thanks!

Jennie

Linda K. Muthen posted on Friday, February 26, 2010 - 9:33 am

At a minimum you need to have several more observations in a group than you have parameters in the group. If you meet this condition, you would need to do a Monte Carlo study to see if the sample size is large enough.

Maren Winkler posted on Friday, March 26, 2010 - 5:30 am

Dear Linda and Bengt,

I have a question concerning the calculation of CFI in multi-group models. Little et al. (2007) refer to a paper by Widaman and Thompson (2003) who argue that "many applications of SEM require one to specify and estimate an appropriate null model" when one wishes to model variances or means.

Is such an altered null model used as the default in the calculation of CFI in Mplus 5.21 when specifying a grouping variable?

Thanks for your help,
Maren

.......
Little, T. D., Card, N. A., Slegers, D. W. & Ledford, E. C. (2007). Representing contextual effects in multiple-group MACS models. In T. D. Little, J. A. Bovaird & N. A. Card (Eds.), Modeling contextual effects in longitudinal studies. (pp. 121-147). Mahwah, NJ US: Lawrence Erlbaum Associates Publishers.

Widaman, K. F. & Thompson, J. S. (2003). On specifying the null model for incremental fit indices in structural equation modeling. Psychological Methods, 8, 16-37

Linda K. Muthen posted on Friday, March 26, 2010 - 9:18 am

I am not familiar with the articles. The Baseline model used in Mplus is the means and variances of all observed variables and the covariances of the observed exogenous variables.

Maren Winkler posted on Friday, March 26, 2010 - 10:32 am

Widaman and Thompson (2003) describe the modified null model as follows: "First, an acceptable null model must represent covariances among manifest variables as null, or zero. Second, and the key distinction here, if any within-group and / or between-group constraints on estimates of manifest variable means or residual variances are invoked in any substantive models under consideration, these constraints must be included in an acceptable null model. These constraints on means and residual variances will typically be operationalized as constraints on the tau and theta matrices that are the only matrices with parameter estimates in the standard null model."

Is there a way to specify such an alternative baseline model in Mplus?

Linda K. Muthen posted on Friday, March 26, 2010 - 4:11 pm

You can't change the Baseline model that Mplus uses. However, you can run two models, the baseline you want and your H0 model and do a difference test.

We do not fix the observed exogenous variable covariances to zero because the model is estimated conditioned on the observed exogenous variables. Their covariances are not fixed at zero during model estimation. By fixing them at zero in the baseline model, overall model fit depends on how highly the observed exogenous variables correlate in spite of the fact that these correlations are not H0 model parameters.

Maren Winkler posted on Sunday, March 28, 2010 - 11:25 pm

I've thought about running a difference test, too. However, if I did as you 've suggested, wouldn't all goodness-of-fit-indices (CFI, TLI, RMSEA, ...) for my H0 model still be calculated on the basis of the baseline model that Mplus uses as the default?
If so, and if I want to estimate these fit-indices by using my baseline model, could I estimate these indices by hand by using the chi-square difference value in the formulas?
Thanks for your help!

Linda K. Muthen posted on Monday, March 29, 2010 - 7:00 am

The Baseline model used by Mplus will not change. You would need to calculate all fit statistics by hand using the chi-square difference value if you want to change the Baseline model.

Suzanne Elgendy posted on Wednesday, March 31, 2010 - 10:31 am

Drs. Muthen,

I am conducting a multi-group path analysis and am attempting to compare a model where all parameters are freely estimated to one in which the means are constrained to be equal across groups. I am obtaining the same model fit statistics and parameter estimates in both the freely estimated and constrained models, which does not seem possible. I set up my variables as latent constructs using a single indicator. Below is a portion of my input.

Freely estimated model:
!LATENT VARIABLES MEANS (A = alpha)
[p0_pos]; [p1_pos]; [p2_pos]; [p3_pos];
[t0_agg]; [t1_agg]; [t2_agg]; [t3_agg];

Model with means constrained to be equal:

!LATENT VARIABLES MEANS (A = alpha)
[p0_pos](1); [p1_pos](2); [p2_pos](3); [p3_pos](4); [t0_agg](5); [t1_agg](6); [t2_agg](7); [t3_agg](8);

I would appreciate any suggestions on how to correct this issue. Thank you very much for you help.

Linda K. Muthen posted on Wednesday, March 31, 2010 - 10:57 am

It is not possible to answer your question without more information. Please send the two outputs and your license number to support@statmodel.com.

Wu wenfeng posted on Monday, April 05, 2010 - 5:20 pm

Hello!
I have read some articles about measurement invariance, and found the process of using MASC to test the multi-group were different. I wonder when testing the latent mean equivalence, should the item variance equivalence be test? And if it should, the test should be before or after latent mean equivalence test?

Bengt O. Muthen posted on Tuesday, April 06, 2010 - 7:51 am

I don't know what MASC is.

Wu wenfeng posted on Tuesday, April 06, 2010 - 8:16 am

sorry! I spell wrong. it should be MACS(means and covariance structures)

Linda K. Muthen posted on Tuesday, April 06, 2010 - 8:26 am

Latent variable means are not measurement parameters. They are structural parameters. See our Topic 1 course handout and video for a discussion of using multiple group analysis to test for measurement invariance and population heterogeneity.

Wu wenfeng posted on Tuesday, April 06, 2010 - 9:32 am

I have read the content you mentioned, but still confused.anyway,thank you!

Pascal Bruno posted on Monday, April 12, 2010 - 7:11 am

Dear Dr. Muthen,

I am conducting a multiple group ESEM analysis of dichotomous data (2 groups) with a high number of cases in each group (170.000 and 50.000 respectively) and 41 variables. The fit indices of our analysis (CFI, RMSEA and TLI) indicate, when testing for measurement invariance (thresholds and loadings equal, scale factors 1 in one group and free in the other), that both groups have a similar structure. We conduct factor analyses in the first place in order to obtain factor values on which our further analyses are based.

Based on the factor values, which are comparable for the two groups in the invariant model, we would like to calculate Eulidean Distances (of these factor values for each case across the two groups).

Is it legitimate to constrain the factor means in the invariant model in both groups to zero (contrarily to the recommendation that - when holding thresholds and loadings constant - means in the second group should be estimated freely)? With factor means of 0 in both groups, factor values seem to be much more comparable in our case and Euclidean Distances would be calculated for standardized factors instead substracting standardized from unstandardized factor values, right?

I would highly appreciate your comments.
Thanks,
Pablo

Linda K. Muthen posted on Tuesday, April 13, 2010 - 11:06 am

I would not recommend this.

PB posted on Tuesday, April 13, 2010 - 12:40 pm

Thank you for your reply.

The idea behind holding factor means invariant was to being able to actually compare factor values by calculating distances between the factor scores of one group and the scores of the other group per item.

Our goal is in fact to have a such a proximity measure on which our further analyses are based.

Could you maybe specify what exactly you would not recommend: Holding factor means in this case invariant (although for non-invariant factor means distances between the factor values do not really make sense) or actually calculating distances of the factor scores at all (and if so, why)?

Your help is very much appreciated.
Thanks in advance.

Bengt O. Muthen posted on Friday, April 16, 2010 - 10:27 am

You need to first test if the factor means are equal across the groups. Only if that is not rejected would I work with factor scores from the model where you hold the means at zero in all groups.

Note that factor scores are comparable across groups even when factor means are different. The measurement invariance ensures that. So you could go ahead and calculate your factor score distances under our default model.

PB posted on Friday, April 30, 2010 - 1:57 am

Thank you for your response.
Which way to test for the equality of the factor means across the groups is recommended/sufficient?

When holding means in the reference group constant, while freeing them in the other group, the resulting estimated means (which as far as I understand are the mean differences in comparison to the reference group) are significant. This is not surprising since the dataset is quite comprehensive.

However, when comparing the model with means = 0 in the reference group and freely estimated means in the second group to a model with factor means fixed at zero in both groups, the change in the fit indices is quite small and the indices themselves are good.

Could I therefore assume that I can work with a model with factor means hold at zero in both groups (due to the still good fit values)?

Thanks again for your much appreciated help.

Linda K. Muthen posted on Friday, April 30, 2010 - 8:47 am

The differences between what you do in paragraphs 2 and 3 are unclear. Please send the two outputs and your license number to support@statmodel.com.

PB posted on Friday, April 30, 2010 - 9:42 am

Sorry for not being clear. Let me try to clarify.

What I meant in paragraph 3 was: I am comparing model A (Thresholds and Factor Loadings constrained to be equal across groups; residual variances fixed at one in one group and free in the other; factor means fixed at zero in one group and free in the other group) to model B (Factor Loadings and Thresholds held equal in both groups AND Factor Means fixed at zero in BOTH groups).

The fit indices in model B are good and the difference to the fit indices in model A are rather small.

Can I therefore assume (referring to Dr. Muthen�s post from April 16, 2010 - 10:27 am), that factor means are equal across the groups (since the invariance model B with equal means shows good fit indices)?

Linda K. Muthen posted on Saturday, May 01, 2010 - 4:11 pm

Yes.

Maren Winkler posted on Friday, May 07, 2010 - 8:10 am

Dear Drs. Muth�n,

I would like to specify a multiple group model in Mplus with the following constraints:
factor loadings fixed to zero,
intercepts invariant over groups,
and unique factor covariances are freely estimated.

I have a g-factor-modell with seven indicators which I specified as follows for all four groups:

F BY a@0 b@0 c@0 d@0 e@0 f@0 g@0;
[a] (1);
[b] (2);
[c] (3);
[d] (4);
[e] (5);
[f] (6);
[g] (7);

I get the following error message:
THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL. PROBLEM INVOLVING PARAMETER 14.
(which is the theta parameter for variable g)

What am I doing wrong in specifying the model?

Thank you so much for your help!

Linda K. Muthen posted on Friday, May 07, 2010 - 9:03 am

When you fix the factor loadings to zero, the factor variance is not identified.

Maren Winkler posted on Friday, May 07, 2010 - 9:44 am

Dear Linda,

I've added the following in each group:

F@1;
[F@0];

a*;
b*;
c*;
d*;
e*;
f*;
g*;

And now I do get estimates.

Do these additional commands alter the meaning of my model?

Thanks for your help!

Bengt O. Muthen posted on Friday, May 07, 2010 - 9:55 am

That specification is the same as not having the factor in the model - no free parameters are associated with it.

Maren Winkler posted on Sunday, May 09, 2010 - 11:35 pm

What I'm trying to do with this model is specifying the "acceptable null model" according to Widaman and Thompson (2003). For my case, this model has to have the following specifications:
factor loadings fixed to zero,
intercepts invariant over groups,
unique factor covariances freely estimated.

I'm not sure whether the additional specifications for mean and variance (which I added in order to identify the model) actually alter the model's meaning.

Thanks for your help'!

Linda K. Muthen posted on Monday, May 10, 2010 - 8:14 am

You should specify a model of means held equal across groups and variances and covariances not held equal across groups.

Lotta Tynkkynen posted on Monday, June 14, 2010 - 7:13 am

hello!

Why the estimates are a little bit different if I use multigroup analysis (let's say for boys and girls) than if I use separate data for boys and girls and run the exactly same model (no constraints included)?

Thank you!

Linda K. Muthen posted on Monday, June 14, 2010 - 7:47 am

They should not be different. You may not have relaxed all of the default equality constraints in Mplus. If you can't see the problem, please send the relevant outputs and your license number to support@statmodel.com.

Nadia posted on Wednesday, June 23, 2010 - 9:53 am

hi, i am trying to run a simple logistic regression with a categorical variable as predictor, how do i run this?
I have put in grouping is... but it still comes up with an error message:
ERROR in ANALYSIS command
ALGORITHM = INTEGRATION is not available for multiple group analysis.
Try using the KNOWNCLASS option for TYPE = MIXTURE.

and if i try to do mixture it tells me i don't have the mixture option

Linda K. Muthen posted on Wednesday, June 23, 2010 - 10:59 am

The GROUPING option is not available with maximum likelihood and categorical outcomes. You need to use KNOWNCLASS and MIXTURE for that. You can use probit regression and the GROUPING option.

Nadia posted on Friday, June 25, 2010 - 9:12 am

Sorry Linda,
I am using a binary outcomes but a categorical predictor, not a categorical outcome.

Linda K. Muthen posted on Friday, June 25, 2010 - 10:09 am

A binary outcome is a categorical outcome.

Nadia posted on Sunday, June 27, 2010 - 6:40 am

thanks,
is there any way i can buy just the mixture add on? although i have the basic option?

Linda K. Muthen posted on Sunday, June 27, 2010 - 9:22 am

Yes. Please contact Michelle at Mplusadmin@statmodel.com for more information.

Nadia posted on Tuesday, June 29, 2010 - 1:30 am

hi linda, this is really driving me bonkers!
I am trting this out on our institution computer which has the mixture add on
I have tried the type=mixture with knownclass and i keep getting an error message
*** ERROR in Variable command
CLASSES option not specified. Mixture analysis requires one categorical
latent variable.
but i don't want a categorical latent variable, all i want is a straightforward logistic regression with a categorical predictor
How can it be so complicated to do this!!

Linda K. Muthen posted on Tuesday, June 29, 2010 - 6:16 am

A logistic regression is shown in Example 3.5. If you want multiple group analysis also, you need to use the KNOWNCLASS option along with the CLASSES option and TYPE=MIXTURE. Example 8.8 shows the way to specify this.

Prathiba posted on Tuesday, June 29, 2010 - 9:22 am

Dear Drs. Muthen: If I conduct a multigroup CFA with sample sizes N=550, 3261, and 2103, do you think the sample size disparity would cause inflation/deflation of any estimates? No parameters are constrained across groups.

Bengt O. Muthen posted on Wednesday, June 30, 2010 - 10:04 am

If no parameters are constrained equal across groups the results are the same as if analyzing each group separately.

min soo kim posted on Tuesday, July 13, 2010 - 6:51 am

Hello, Dr. Muthen

I'm conducting a SEM including an interaction effect.
I want to conduct MSEM with an interaction effect.
As I know, Mplus does not provide chi-square statistic when an interaction term is included in the model.
How can I examine the group differences?

Linda K. Muthen posted on Tuesday, July 13, 2010 - 7:53 am

Using loglikelihood difference testing where -2 times the loglikelihood difference is distributed as chi-square. Or use MODEL TEST.

Anna Nagy posted on Tuesday, August 10, 2010 - 12:41 pm

Dr. Muthén,

I have math outcome data at two time points (pretest and post test) for students in two conditions: Treatment and Control.
Pretest and post test score measure 4 different aspects of math. Therefore I created a latent variable.

My question is are there significant differences between treatment and control group in math. To address that question my plan was to conduct multiple group analysis. However because of the small sample size (N = 78) I couldn't conduct the analysis.
Is there another way to address my question? The only idea that comes to my mind is to save the factor scores and conduct an ANCOVA.
Can you recommend some other ways to analyze my data?

Thank you,
Anna

Linda K. Muthen posted on Tuesday, August 10, 2010 - 12:58 pm

If you can estimate a model and obtain factor scores, I'm not sure why you were unable to conduct the analysis.

Anna Nagy posted on Tuesday, August 10, 2010 - 1:15 pm

I was only able to conduct the CFA and create two latent variables measuring math at time 1 and time 2.
Following that step I was planning to conduct the MGA, but the model blow up right at the configural invariance level.
I blamed on the small sample size.

Linda K. Muthen posted on Tuesday, August 10, 2010 - 2:13 pm

Please send the files and your license number to support@statmodel.com.

Sonja Nonte posted on Monday, August 23, 2010 - 8:45 am

We are trying to test factorial invariance in a multigroup CFA (categorical data). We would like to specify the baseline model with free thresholds, factor
loadings, and means. We've already found out that we have to fix the factor mean at 0 and the residual variances at 1 for identification.
But our question is one step before that: how can we free the tresholds and factor loadings?
Until now, we have the following statements:

VARIABLE: ...
grouping is S1sex (1=girls 2=boys);

ANALYSIS: PARAMETERIZATION=THETA;

MODEL:
SpoSeko by S1Sp1r S1Sp2r
S1Sp3r S1Sp4r;
SpoSeko@0;
S1Sp1r@1;
S1Sp2r@1;
S1Sp3r@1;
S1Sp4r@1;
And in the next step (equal thresholds and factor loadings) do we keep the restrictions concerning the mean and the residual variances? If we do not keep
those, how can we still perform a diff test, even though we changed the baseline model?

Linda K. Muthen posted on Monday, August 23, 2010 - 8:58 am

See the Topic 2 course handout under multiple group analysis. Here the measurement invariance models are shown for the Delta parametrization. The only difference between this and the Theta parametrization is that scale factors are parameters in Delta and residual variances are parameters in Theta.

Laura Lysenko posted on Tuesday, September 07, 2010 - 7:54 am

Dr. Muthen,

I'm trying to run a cross-lagged model with second order factors. The simplified model is shown below:

model:
f1 by x1-x3;
f2 by x4-x6;
f3 by x7-x9;
f4 by x10-x12;

f5 by f1-f2;
f6 by f3-f4;

f6 ON f5;

This model runs fine, but when I run the model for multiple groups:

MODEL male:

f6 ON f5;

this model is not identified. Any advice would be helpful.

Thanks

Bengt O. Muthen posted on Tuesday, September 07, 2010 - 8:29 am

The intercepts of the 1st order factors in their regression on the 2nd order factors need to be fixed at zero in both groups for identification.

Regan posted on Tuesday, September 14, 2010 - 12:48 am

Hello! A while ago, someone had this question:

"...I ran subsequent multiple groups analyses for each of the 3 race/ethnicities...The model fit was excellent for two of the groups, but unacceptable for the third group....Should I accept the omnibus model for the two races that have good fit and develop a different model for the third race?"

Dr. Linda Muthen's advice to him:

"...It does not make sense to put groups together if the same model does not fit the data well for each group.... Only then does it makes sense to combine the groups..."

My questions:

1) I wanted to confirm that if in attempting to do a multiple-group path model, we first test the model in each group separately, and if you have good model fit in two groups and poor fit in one group, one should stop and just present a separate model for each group and not attempt the multiple-group approach? (If there may be a plausible and interesting reason as to the finding that the third model did not fit the data, can we also present this model?)

2) Am I correct that with a non-significant chi-2 diff test, your interpretation is that the H1 and Ho models are not significantly different from each other and it is okay to combine the data into one group--perhaps allowing for invariance in certain paths? (and that separate models are necessary if the chi-2 diff test IS significantly different)?

Thank you!

Linda K. Muthen posted on Tuesday, September 14, 2010 - 10:54 am

1. A first step in a multiple group analysis is to analyze each group separately. Only groups for which the same model fits well should be compared. That a different model fits well for one group can be of interest.

2. I don't know what you mean by combine into one group because if you do this, you cannot allow for invariance.

Regan posted on Tuesday, September 14, 2010 - 12:02 pm

Hello again,
I was referring to your response to the gentleman above. When you say that:

'it does not make sense to put groups together if the same model does not fit the data well...'

By this, do you mean that we should not try to compare these groups?

If my model fits well for non-hispanic caucasians and non-hispanic african-americans for instance, but not for hispanics/latinos, I believe what I should do after having run separate models is just do a multiple group analysis with the caucasian and african-american groups and either explain the lack of fit in the hispanic group---or develop a separate model altogether for them. Is this correct understanding?

Thanks again in advance!

Linda K. Muthen posted on Tuesday, September 14, 2010 - 12:46 pm

You should not compare groups for which the same model does not fit well. There is no basis for comparison. These groups should not be included in the multiple group analysis.

Michael S. Businelle, Ph.D. posted on Thursday, October 07, 2010 - 1:57 pm

I have a good fitting omnibus structural equation model that includes three racial/ethnic groups (N=424). When I run the model that includes all participants, the fit indices are all good and I have no error messages. When I run the model separately for each group, the fit indices are still good, but I get the following error message:
"THE MODEL ESTIMATION TERMINATED NORMALLY WARNING: THE LATENT VARIABLE COVARIANCE MATRIX (PSI) IS NOT POSITIVE DEFINITE. THIS COULD INDICATE A NEGATIVE VARIANCE/RESIDUAL VARIANCE FOR A LATENT VARIABLE, A CORRELATION GREATER OR EQUAL TO ONE BETWEEN TWO LATENT VARIABLES, OR A LINEAR DEPENDENCY AMONG MORE THAN TWO LATENT VARIABLES. CHECK THE TECH4 OUTPUT FOR MORE INFORMATION. PROBLEM INVOLVING VARIABLE EDU3."

My question is, is the mplus output for each racial group intrepetable, or does the error message negate interpretability?

Linda K. Muthen posted on Thursday, October 07, 2010 - 2:30 pm

This message means the model is not admissible. You probably have a negative residual variance or variance for edu3.

haxha posted on Saturday, October 30, 2010 - 11:26 pm

Dear Dr. Mullen. I am using MPLUS in conducting multi group analysis. I have a question with regards to validating of the model. I have a model that I created for a large sample of 800 women. I was told that to validate it I need to test this model on one half of the population and then test it again on the other half. I do this using MLM estimator because my data is not normal regardless of the transformations I have undertaken. Most estimators are similar (the direction, significance, chi square significance) but one parameter looses the significance when I test the mode in one half of the data. Should I respecify the model? Is it essential that all parameters are significant in all models tested? Also, since I cant use bootstrapping with MLM; is there any other simulation method I am able to use? Haxha

Linda K. Muthen posted on Sunday, October 31, 2010 - 9:59 am

I think typically one randomly divides the sample as a first step and fits the model first in one sample and then in the other. If key parameters are not significant in both, the model may not be robust.

haxha posted on Sunday, October 31, 2010 - 10:18 am

Thanks so much Dr. Muthen. I apologize for a typo earlier. Just one more follow up question if you don't mind. I have transformed the data but they are still not normal; I am using mLM but I am doing so with the already transformed data....is that ok? OR must I go back to using the data on their original form? Data on the original form are severely skewed. Also is there any simulation method instead of bootstrapping I can use with MLM? Many many thanks! Haxha.

Linda K. Muthen posted on Sunday, October 31, 2010 - 10:52 am

In general I would not transform variables. I would use the MLR estimator.

haxha posted on Sunday, October 31, 2010 - 11:15 am

Thank you so much!

Kai Savi posted on Thursday, November 04, 2010 - 12:51 pm

Hello,

I am working on a multiple group analysis and am looking for MLR output. so I can use chi-square difference testing. I know I can't do MLR with grouping, but my data is in two data sets. I can do a multigroup analysis, but not with MLR.

Beccause I am using two data sets, I do not have a single variable to use to differentiate classes. Is it possible to use KNOWNCLASS and get a MLR output with two data sets?

Thanks.

Linda K. Muthen posted on Thursday, November 04, 2010 - 1:55 pm

You should be able to do this with MLR if you are using TYPE=GENERAL. Have you received an error message or are you just assuming this? If you have received an error message, please send your output and license number to support@statmodel.com.

Kai Savi posted on Thursday, November 04, 2010 - 2:09 pm

Thanks Linda,

I am assuming it, because I am not clear on how to describe KNOWNCLASS with two datasets (as opposed to a class variable). It seems like it should be simple enough, but I was not able to find anything in the manual on how to write that into the syntax.

Thanks,

Linda K. Muthen posted on Thursday, November 04, 2010 - 2:13 pm

It is not clear to me why you think you need the KNOWNCLASS option or if you do. If you do, the two data sets must be in the same file with a grouping variable.

Sofie Henschel posted on Tuesday, November 16, 2010 - 9:36 am

Hello,
I run a multi group analysis with strong invariance that fits fine. When i try to test for measurement invariance (configural or weak) I receive the message:

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE TRUSTWORTHY FOR SOME PARAMETERS DUE TO A NON-POSITIVE DEFINITE
FIRST-ORDER DERIVATIVE PRODUCT MATRIX. THIS MAY BE DUE TO THE STARTING VALUES BUT MAY ALSO BE AN INDICATION OF MODEL NONIDENTIFICATION. THE CONDITION NUMBER IS -0.929D-17. PROBLEM INVOLVING PARAMETER 34.

I checked parameter 34 (psi between two latent var.) but didn't see what's the problem. The model also runs fine, when I try every group in a single model (only girls or boys). I'm wondering because it doesn't makes sense to me that both single models work well and a model with strong invariance shows a good fit too, while more liberal models don't work. Is there any explanation for this phenomenon? Thanks a lot.

Linda K. Muthen posted on Tuesday, November 16, 2010 - 10:03 am

Please send the output and your license number to support@statmodel.com.

Amy Tobler posted on Friday, November 19, 2010 - 1:04 pm

I am running a multi-group clustered path analysis. When the model is run I get the following warning:

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE
TRUSTWORTHY FOR SOME PARAMETERS DUE TO A NON-POSITIVE DEFINITE
FIRST-ORDER DERIVATIVE PRODUCT MATRIX. THIS MAY BE DUE TO THE STARTING
VALUES BUT MAY ALSO BE AN INDICATION OF MODEL NONIDENTIFICATION. THE
CONDITION NUMBER IS -0.291D-15. PROBLEM INVOLVING PARAMETER 29.

THIS IS MOST LIKELY DUE TO HAVING MORE PARAMETERS THAN THE NUMBER
OF CLUSTERS MINUS THE NUMBER OF STRATA WITH MORE THAN ONE CLUSTER.

My question is, does this mean that the chi-square values for model fit are not reliable as well or just the individual parameter standard errors?

Thanks

Bengt O. Muthen posted on Saturday, November 20, 2010 - 7:33 am

Both chi-square and SEs are somewhat questionable in this case where you have more parameters than clusters. They could be fine, or they may be poor. It depends on several factors, including how many parameters refer to the between level - if few, you may be ok. Only a Monte Carlo simulation study could tell more.

aprile benner posted on Wednesday, December 08, 2010 - 1:30 pm

I am running a multiple group analysis for a path analysis with multiple mediators using ML, and I am getting a negative chi square value. I know that this can happen with MLR, and that you can't interpret the test. What are your recommendations when this happens with a model using ML?

Linda K. Muthen posted on Thursday, December 09, 2010 - 11:58 am

Please send the files that show this and your license number to support@statmodel.com.

Sylia Wilson posted on Saturday, February 05, 2011 - 12:09 am

Hello,

I am using multigroup models to compare across mothers and fathers in our sample. In order to fit the measurement model for our first latent construct (not yet testing invariance across groups), I need to constrain 1 of 4 indicators to be equal to 1, and the other 3 indicators to be equal to one another:
Model:
latent by obs1@1
obs2 (1)
obs3 (1)
obs4 (1);

I would now like to test invariance across mothers and fathers, but I'm not sure of the code for setting the loadings equal across groups, taking into consideration the constraints on the latent variable. What I would like to do is something like the following, where the * means equal across groups, but I know you cannot put 2 () in the same line.
Model for mothers:
latent by obs1@1
obs2 (1) (1*)
obs3 (1) (2*)
obs4 (1) (3*);
Model for fathers:
latent by obs1@1
obs2 (1) (1*)
obs3 (1) (2*)
obs4 (1) (3*);

Do you have any suggestions? Thank you very much for your time.

Linda K. Muthen posted on Sunday, February 06, 2011 - 10:57 am

The following specification holds parameters equal across variables and across groups. See TECH1 or your results to see that these equalities hold.

Philipp Th�ne posted on Wednesday, February 09, 2011 - 3:03 am

Hello,

I�ll like to do a multigroup model with two groups. First group should be composed of all observations with a value from 2 to 4 for the variable "SNB" and the other group should be composed of all observations with the value 1 for this variable "SNB". I tested serval options for the grouping, as:

GROUPING IS SNB (1 = NO_SNB 2 3 4 = SNOWB);

or

GROUPING IS SNB (1 = NO_SNB 2, 3, 4 = SNOWB);

but no option works runs.

Would be great if you could help me!

Thx

Linda K. Muthen posted on Wednesday, February 09, 2011 - 9:33 am

You need to use DEFINE to create a variable that combines the values of 2, 3, and 4.

Veronique Eicher posted on Tuesday, March 08, 2011 - 12:17 am

Dear Dr. Mullen,
I am running a multiple group analysis with two groups with very different sample sizes.
n of group 1 = 213
n of group 2 = 70
The unconstrained path coefficients are in the case of 2 paths very different for the two groups. As an example:
Group 1:
beta = -.04 (p = .674)
Group 2:
beta = -.39 (p = .006)
However, when I constrain all path coefficients for the two groups to be equal in order to test if the paths differ for the groups, the contrained model is not significantly worse than the unconstrained model (p = .355).
Is it possible that this nonsignificant difference is due to the unequal sample sizes? And if so, is there a way to circumvent the problem of the unequal sample sizes?
I thank you very much for any advice you could give me!
Veronique

Linda K. Muthen posted on Tuesday, March 08, 2011 - 9:13 am

I think the problem is lack of power.

Veronique Eicher posted on Wednesday, March 09, 2011 - 8:58 am

Thank you for your response Dr Muthen, I was afraid that could be the problem.
Veronique

Sofie Henschel posted on Wednesday, March 16, 2011 - 12:47 pm

Dear Linda,
I`m trying to run a multiple group analysis with three groups and imputed
data. My model contains a latent variable which is regressed on four other
latent variables. Furthermore, I added three covariates (following the mplus
user guide's example 5.14). The model shows good fit, however, standard
errors of the latent means for group 2 and group 3 seem to be extremely
large. Running the model without the covariates leads to acceptable standard
errors, so I assume there might be some problem with the covariates. Do you
have any idea why the standard errors of the latent means increase when I add
covariates?
Thanks in advance!

Linda K. Muthen posted on Wednesday, March 16, 2011 - 2:15 pm

Please send the full outputs and your license number to support@statmodel.com.

Richard E. Zinbarg posted on Tuesday, April 12, 2011 - 8:39 pm

Hi Linda,
Is there a limit to the number of groups that Mplus can accomodate in a multiple group analysis? In the Mplus manual, I can only find examples involving 2 groups but saw mention of 6 groups in an earlier post in this topic. I am analyzing data from a study involving 8 groups and am wondering if I can include all 8 groups in the same analysis?
Thanks!
P.S. I am very excited to see that there will be a Mac version of Mplus at some point soon (I can stop spending money on Parallels and Windows at that point). Will those of us who switch have to buy a new license or will we be able to get the Mac Version as part of our annual renewal?

Linda K. Muthen posted on Wednesday, April 13, 2011 - 6:35 am

There is no explicit limit to the number of groups.

Those who want to change from Windows to Mac will be able to do so if their upgrade and support contract is current. I am working out the details on how that will happen.

Gabriel Schlomer posted on Thursday, May 05, 2011 - 12:59 pm

Dear Dr. Muthen,

I am conducting a multiple group analysis (by gender) on a model wherein we have separate hypothesized models for men and women. Essentially we have a theoretical model for men and theoretical model for women. Both models contain the same latent variables but paths are specified to be different between the genders.

What I would like to know is if it is possible in Mplus to empirically show that the male model fits the data better for males than it does for females and that the female model fits the data better for females than it does for males.

Bengt O. Muthen posted on Thursday, May 05, 2011 - 6:44 pm

If you use the same observed variables for the two genders, yes.

Jessie Dezutter posted on Monday, May 09, 2011 - 2:44 am

Dear Drs. Muthen,

I'm trying to test a mediation model in three religious groups in a large dataset (N= 10 000)with latent variables. I also want to control for the influence of another grouping variable (university) so using multigroup analysis as well as a TYPE=complex. My mediation model runs perfectly in the whole group with the TYPE=complex statement. However, when I try to run the multigroup I receive this warning: THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE
COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL.
PROBLEM INVOLVING PARAMETER 89.
I do not know how I can interprete this warning or how I can solve this problem?
Thanks for any advice!
Kind regards

Linda K. Muthen posted on Monday, May 09, 2011 - 6:12 am

Please send your output and license number to support@statmodel.com.

Jessie Dezutter posted on Tuesday, May 10, 2011 - 3:05 am

Dear Dr. Muthen,

Thanks for your quick reply. A colleague noticed in the output that one item has residual variances above 1. So, I will rerun the analyses without this item and see whether the model is identified. Thanks anyway!
Kind regards,
Jessie

rebecca lazarides posted on Friday, May 13, 2011 - 6:24 am

Dear Drs. Muthen,

I did a single group analysis as first step of a multiple group analysis (structural equation model with 4 latent factors). results indicated that in the second group one observed indicator of a latent factor had to be removed for getting a good model fit.
this probably indicates that measurement models of this factor differ, right?
my problem is now:
1) how should I proceed, if I want to test moderation in both groups?
2) may I just continue with the mga and leave this observed indicator out?
3) ai8 is the observed indicator which has to be removed in the second group (male) and if i try something like:

model:

f1 by ai8 ai9 ai10 ai11

model male:

ai8@0;

i get the following message:

WARNING: THE LATENT VARIABLE COVARIANCE MATRIX (PSI) IN GROUP M.M. IS NOT POSITIVE DEFINITE. THIS COULD INDICATE A NEGATIVE VARIANCE/RESIDUAL VARIANCE FOR A LATENT VARIABLE, A CORRELATION GREATER OR EQUAL TO ONE BETWEEN TWO LATENT VARIABLES, OR A LINEAR DEPENDENCY AMONG MORE THAN TWO LATENT VARIABLES. CHECK THE TECH4 OUTPUT FOR MORE INFORMATION.
PROBLEM INVOLVING VARIABLE AI8.

thank you!!

rebecca lazarides posted on Friday, May 13, 2011 - 6:39 am

sorry, i saw only now that ai8@0 does fix the variance at 0...thats not the solution for my problem...my question is how to analyse moderation effects if the factor structure differs in the two groups because of one observed indicator..

Bengt O. Muthen posted on Sunday, May 15, 2011 - 8:33 pm

Deleting an item is done by not including it on the USEV list. But if you have a model where in one group an item makes it not fit the data, then multiple-group invariance for the remaining items seems unlikely. If you want to model moderation of structural relations by group membership, you need measurement invariance.

Kiana Johnson posted on Monday, May 16, 2011 - 9:15 am

Professors Muthen,
I am trying to conduct a multiple group (4 groups) SEM with seven latent variables and one single indicator. This model also initially had two higher order factors.

1st I conducted 4 single group CFAs. The model fit was good for three of four groups. there were two errors for the last group, one for an observed variable and the other for a second order latent factor- both I believe were linear dependence related.

When I removed the observed variable I only received one non positive definite error. I removed the second order factor and repeated the analysis in all four groups and the model fit was acceptable for all. Can you please help me understand why this helped since its basically the same model?

Also, in testing configural Invaraince - I have two questions about single indicators and second order factors. Are single indicators as well as the first first-order factor omitted in the group specific model statements?

Bengt O. Muthen posted on Monday, May 16, 2011 - 8:49 pm

A 2nd-order factor model usually puts constraints on the covariance matrix of the 1st-order factors so it isn't the same model as without a 2nd-order structure.

Regarding configural invariance - I don't think they should be.

magushi monepa posted on Monday, May 30, 2011 - 9:06 am

hi I need your help, I want to model using logistic regression the survival of bird as a function of weight and sex. The probability of survival for female is given by p(f)=exp(b0+b1x)/1+exp(b0+b1x)
and probability of male given by pm(x)=pf(x+b2/b1), can some one help me verify this? how do i verify?

Bengt O. Muthen posted on Monday, May 30, 2011 - 9:35 am

Do males and females have the same slope for weight?

Did you mean to write

pm(x) = pf(x+b2/b1)?

What does the right-hand side mean?

HwaYoung Lee posted on Wednesday, June 22, 2011 - 1:12 pm

Dear Dr. Muthen

I am conducting Multi-Group SEM to do cross-cultural research for my dissertation.
I tested my final model using MPLUS and I could obtain fine fit indexes.

Measurement fixed, Structure fixed (Strict invariance)(using ONLY Correlation matrix and STD as the dataset)
CFI = 0.93, TLI = 0.90, SRMR = 0.09, SRMR = 0.09

However, I could not figure that out the way to compare factor means using this correlation matrix as data set.

Therefore, I tried to add mean (means of observed variables (Ys)) in this data set.
Syntax: type is correlation and std and means

And I got pretty different results.
All syntax is same except including means in the data set.
CFI = 0.85, TLI = 0.81, RMSEA = 0.13, SRMR = 0.10

(1) Why is it different? and Is it OKAY to use only correlation matrix to conduct multi-group SEM?

(2) When I used raw data( or correlaiton and mean std), I found that one group has consistently higher observed means (Y's) than the other cultural gorup, so probably, I think it is a weak invariance. However, others are very similar (structure is similar), only observed means differ by groups. Can I use multi-group SEM?

Thank you so much in advance.

Byungbae Kim posted on Wednesday, June 22, 2011 - 6:33 pm

Hello I always thank you for your support
i have a question on the multiple group comparison, I would like to test for the moderating effect of a drug crime (drug v. non-drug offenders). I am especially interested in how a drug crime conditions the effect of being racial minority on sentence length in courts. I wonder if it is ok for me to impose constrains on one or two related variables of interests, and do the chi-square difference tests with the unconstrained model? Some of my old material told me that I have to do structural invariance test (?) first, which is the chi-square difference test between the unconstrained model and the fully constrained model, and then only in the situations where there is no statistical difference in the chi-square test, I can proceed to doing the path by path test using the chi-square difference test just like the former approach. Which one is correct? If the latter approach is correct, then I wonder why we do the path by path test even though we do not find any difference when we constrain the whole paths. I am confused. So, my question is "do we need to do structural invariance test even though I am doing just a path analysis with only observed variables, not SEM?"

Thank you very much in adavnce!

Linda K. Muthen posted on Thursday, June 23, 2011 - 11:11 am

HwaYoung:

1. The differences are due to adding means to the data. The model then constrains the intercepts to be equal over time.

You cannot use only the correlations. You must also use the standard deviations as you mentioned above. Mplus turns these into a covariance matrix.

2. The high observed variable means may results in high factors means. It is the intercepts for which you are testing measurement invariance.

Linda K. Muthen posted on Thursday, June 23, 2011 - 11:13 am

Byungbae:

I think it would be fine to test only the coefficients for which you have a substantive hypothesis. You can do this using MODEL TEST.

HwaYoung Lee posted on Thursday, June 23, 2011 - 4:28 pm

Dear Dr. Muthen,
I really appreiciate your comments.
So,it means that it is okay to use correlation matrix and std for multigroup SEM? What is limitaiton when using correlation matrix and std?

I have one more question. When I use raw data for multigroup SEM,
I got the decent fit indexes for each cultural group when I conducted CFA for each cultural group (measurement model).
When I conducted multigroup SEM, fit indexes were low, so I released some of intercepts (observed means) for one group and then I got a good result.

MODEL :
F1 BY Y2* Y5 Y11 Y12;
F2 BY Y4 y7 Y10;
F3 BY Y3 Y6 Y9;
F1@1;

F2 on F1;
F3 on F1;
F2 with F3;
Y17 on F1(4);
Y17 on F2(5);
Y17 on F3(6);

Y7 with Y10;
Y7 with Y3;
Y6 with Y4;
Y6 with Y3;
Y10 with Y3;

Model G:
[Y7 Y9 Y12];

-->CFI 0.933
TLI 0.911
RMSEA 0.088
SRMR 0.096

Is it partially measurement invariance?
I can't compare factor means, right?

Thank you so much for your help.

Linda K. Muthen posted on Thursday, June 23, 2011 - 5:25 pm

I would not use only the correlations and standard deviations. I would use also the means which is the default with raw data.

Please see multiple group analysis in the Topic 1 course handout on the website. It discusses testing for measurement invariance in addition to testing of factor means across groups. There is also a video that you can watch.

peter pitt posted on Friday, July 01, 2011 - 1:26 pm

Dear professors,

I have some questions with respect to the factor variances in multigroup EFA (ESEM). (a) Suppose that the variables are standardized per group and that I didn�t constrain the factor variances to be equal, for example to one (but instead I constrained some loadings to one to solve the identification problems), are these factor variances then subject to any constraint (e.g., the sum of factor variance of a factor in group A + factor variance of the same factor in group B = 1)? What would be the influence of standardizing the concatenated data instead of standardizing for each group separately? (b) Is it possible to find a solution with the same factor loadings, but with factors that have different factor variances in each group (and if so, what does this mean then)?

Thank you very much!

Bengt O. Muthen posted on Saturday, July 02, 2011 - 8:30 am

(a) You don't want to standardize variables in a multi-group analysis because then you cannot study group diffs in means and variances.

(b) Multi-group ESEM has the default of group-invariant loadings and intercepts and group-varying factor variances and means. The goal of multi-group analysis is to be able to study population (people) diffs in factors when measurement (variable) par's are the same.

Jan-Henning Ehm posted on Friday, August 05, 2011 - 6:55 am

Hi,

I like to compare standardized path in a multi group analysis. My model is the following (three latent dependent variables and three exogenous manifest variables):

SR BY y1-y5;
SW BY y6-y10;
SC BY y11-15;
SR ON A B C;
SW ON A B C;
SC ON A B C;
A with B C;
C with B;
SR with SW SC;
SC with SW;

I have two groups and will compare the standardized path from C to SC over these two groups. I don�t know how to create the standardized coefficients in the MODEL CONSTRAINT. A similar question was posted on Wednesday, August 04, 2010 - 11:34 am by Simon O. F. posted on
http://www.statmodel.com/discussion/messages/11/16.html?1309783320

I think, I can compare the two standardized path with this equitation:
beta_CSC1 = beta_CSC2*(sqrt(sdC2)/sqrt(sdSC2))/(sqrt(sdC1)/ sqrt(sdSC1). But, how can I define beta_CSC2 and beta_CSC1 as well as the variance of SC as NEW parameters in MODEL CONSTRAINT and test the difference using MODEL TEST.

Thanks a lot in advance,
Jan-Henning Ehm

Bengt O. Muthen posted on Saturday, August 06, 2011 - 9:15 am

The standardized beta is

beta*SD(x)/SD(y).

Your SD(x) is the C standard deviation and your SD(y) is the variance of the SC dependent variable factor. By regular regression expectation algebra you compute SD(SC) as the sqrt of

V(SC) = beta1*V(A)+beta2*V(B)+beta3*V(C)+ 2* beta1*beta2*Cov(A,B)+ 2*beta1*beta3*Cov(A,C)+2*beta2*beta3*Cov(B,C)+resvar(SC),

where resvar(SC) is the residual variance of SC that you get in the output.

Jeffrey Duong posted on Saturday, October 08, 2011 - 2:17 pm

Thank you Drs. Muthen & Muthen for taking the time to answer our questions.

I was wondering if someone could please explain to me the difference between fitting a full SEM where we do not specify that there are two different groups, as opposed to a model where we specify GROUP IS but constrain parameters to be equal.

As an example, for a study I am working on, I am performing two multiple group analyses. I first fit a full SEM. I then performed a Multiple Group Analysis comparing the parameters between two groups of teachers based on their teaching experience. The parameters of the constrained model differ from the parameters obtained in the full SEM. I then performed another Multiple Group Analysis comparing the parameters between two groups of teachers based on school level. I found that parameters of the constrained model also differed from the full SEM as well as from the constrained model of the first Multiple Group Analysis.

Is this supposed to happen? If so, I also wonder why some journal articles I have read do not report the results of their constrained models.

Thank you!

Linda K. Muthen posted on Sunday, October 09, 2011 - 9:29 am

Looking at the sample of males and females together is not the same as a multiple group analysis where the coefficients are held equal between males and females. The first analysis is a mixture. See the following paper which is available on the website for further information:

Muth�n, B. (1989). Latent variable modeling in heterogeneous populations. Psychometrika, 54:4, 557-585.

Jeffrey Duong posted on Sunday, October 09, 2011 - 2:15 pm

Fantastic. Thank you very much for this.

Siran Zhan posted on Sunday, October 09, 2011 - 11:26 pm

Dear Dr. Muthen,

I'm trying to establish measurement equivalence between 2 groups. The problem I'm facing is that one of the groups has no data at all on one of the manifest variables. Mplus did not run my script because of that and returned error messages that "One or more variables in the data set have no non-missing values". Is there a way you can suggest for me go around this?

Thank you very much for your help in advance!

Linda K. Muthen posted on Monday, October 10, 2011 - 7:38 am

See the FAQ on the website about having a different number of variables in different groups.

sam posted on Monday, October 10, 2011 - 2:44 pm

Hi,

I'm interested in comparing whether the path coefficients are different across groups. My model consisted of 2 latent variables and 3 observed variables. I came across some articles stating that the comparison would be more meaningful if measurement invariance can be established before comparing the path coefficients. Is the measurement invariance necessary? If yes, how could I establish the measurement invariance where some of my variables are observed variables?

Thank you very much.

Linda K. Muthen posted on Monday, October 10, 2011 - 3:00 pm

Measurement invariance applies to latent variables not observed variables. It is necessary to establish that the latent variables have the same meaning in different groups if comparisons are going to be made across groups. See the Topic 1 course handout on the website under Multiple Group Analysis.

Anto John Verghese posted on Monday, October 31, 2011 - 9:21 am

Dear Dr. Muthen,

I am testing for invariance across two groups and I want to know if it is possible to constrain only the factor loadings across groups without constraining the error terms across the groups.

For example when I use

Model yes:F1 by y1(1);
Model No: F1 by y1(1);

This tends to constrain the error terms and the factor loadins across groups.

Thanks

Linda K. Muthen posted on Monday, October 31, 2011 - 2:01 pm

I don't know why the error terms would be constrained with what you show. Please see the Topic 1 course handout under multiple group analysis for the inputs for testing measurement invariance. If these don't help, please send your output and license number to support@statmodel.com.

ellen posted on Monday, November 07, 2011 - 11:37 pm

Dear Drs. Muthen,
I have a question about how to test invariance of path coefficients for structural paths using Mplus. I am not testing measurement invariance. I am comparing SEM models for 3 groups, and see some paths seem to be comparable across groups.
I read something about examining the �completely standardized common metric solution� in LISREL. However, I use Mplus. What section of an Mplus output would suggest significant group differences are present for some specific relationships (e.g., between A & B).
Here is what I read:
�We used LISREL to examine the invariance of path coefficients for structural paths in the SEM model by conducting multiple-group comparison for boys and girls. To compare the two models, we conducted a model in which the relations among A, B, C, D variables were freely estimated and a model in which the relations were set to be equal for boys and girls. We then used the chi-square difference test to examine whether these models were equivalent. Results showed there was a significant chi-square difference... Examination of the completely standardized common metric solution suggested that significant group differences were present for the relationship between A and B ... To confirm this, we compared a model in which the relationships among A, B, C, D, E were set to be equal for boys and girls with a model in which A and B were freely estimated. There was a significant chi-square difference between the models..."

Linda K. Muthen posted on Tuesday, November 08, 2011 - 6:31 am

They did a chi-square difference test where they estimated two models. One where regression coefficients were free across groups, for example,

MODEL:
y1 ON x1;
y2 ON x2;

and one where they were constrained to be equal across groups, for example,

MODEL:
y1 ON x1 (1);
y2 ON x2 (2);

Then they did a chi-square difference test as described on pages 434-435 of the Mplus User's Guide.

ellen posted on Tuesday, November 08, 2011 - 10:26 am

Dr. Muthen,

Thanks for your prompt response! I know how to conduct a difference test for two models, but my question is more about how to make a "justification" from a Mplus output to suspect that some (but not all) path coefficients may be equivalent across groups.
The article I described above (Nov. 7) uses the �completely standardized common metric solution� in LISREL to justify for testing a model where only some paths were set to be equal while other paths were freely estimated across groups. I am wondering whether a Mplus output of certain metric solutions will be able to provide justification for me to set certain paths equal... rather than just by my subjective view.

Please help! Thanks so much!

ellen posted on Wednesday, November 09, 2011 - 1:42 pm

Dear Drs. Muthen,

Could you respond to my question (posted above; 11/8) and restated below?

how to make a "justification" from a Mplus output to suspect that some (but not all) path coefficients may be equivalent across groups.
Some researchers use �completely standardized common metric solution� in LISREL to justify for testing a model where only some paths were set to be equal while other paths were freely estimated across groups. I am wondering whether a Mplus output of certain metric solutions will be able to provide justification for me to set certain paths equal?

Linda K. Muthen posted on Wednesday, November 09, 2011 - 5:32 pm

I am unclear what "completely standardized common metric solution� means. If it means you are comparing standardized coefficients across groups, I would not recommend this. I would compare raw coefficients. You should have a theory about which coefficients you expect to be different across groups. If you are in an exploratory setting, I would hold all raw coefficients equal across groups and look at their modification indices.

ellen posted on Wednesday, November 09, 2011 - 9:04 pm

Thanks! Could I ask 2 more questions? (sorry new to Mplus!) How do I "hold all raw coefficients equal across groups"? Is below the right way to write the commands?

Also, how do I interpret "M.I." and "E.P.C."? I read the User's Guide (pp. 646-647) but still don't understand it...
......

GROUPING = race ( 1 = Black 2 = Asian 3 = White);

ANALYSIS:
ESTIMATOR = MLR ;
MODEL:
A by A1 A2 A3 ;
T by T1 T2 T3 ;
O by O1 O2 O3 ;
S by S1 S2 S3 ;

T on A (1);
O on A T (2);
S on A T O (3);

OUTPUT:
sampstat; standardized sampstat; Modindices (0) ;

Linda K. Muthen posted on Thursday, November 10, 2011 - 6:26 am

T on A (1);
O on A (4)
T (2);
S on A (5)
T (6)
O (3);

Chapter 14 has a discussion of multiple group analysis.

Linda K. Muthen posted on Thursday, November 10, 2011 - 6:30 am

I would concentrate on modification indices. The value given is the decrease in chi-square if the equality is removed. The value 3.84 is the chi-square value of significance for one degree of freedom. Any MI over this value would improve fit significantly if the equality is removed meaning that the two coefficients are not equal across group.

ellen posted on Thursday, November 10, 2011 - 10:58 pm

Dr. Muthen,
Thank you! This is helpful! May I ask a couple follow-up questions:

There was a path (e.g., A->T) that was significant (p< .01) only in the Asian group, but not in the Black or White groups when estimated freely. However, when it was fixed to be equal across groups, the M.I. was not greater than 3.84? Does that mean we can consider this specific coefficient equal across groups? ... this does not seem to make sense-- because when estimated freely, it was significant at p< .01 in one group, while in the other two groups it was not significant. How to explain this?

Also, could you tell me how to interpret an "E.P.C."? I read the user guide but still am confused...

(Thanks SO MUCH! ...& sorry about the basic questions.)

Linda K. Muthen posted on Friday, November 11, 2011 - 11:18 am

You are looking at two different types of tests. One coefficient can be significantly different from zero and the other not even though the two coefficients may not be significantly different from each other.

EPC is the value the parameter would take if it is free.

Jiyeon So posted on Tuesday, November 22, 2011 - 11:40 pm

I have a hypothesis that predicts : The model will receive stronger support from sexually active group (group A) than sexually inactive group (group B).

To test this hypothesis, I think I should compare model fit across two groups. However, since the model is the same (and only the sample is different) Chisquare difference test does not apply here since it is only for nested groups.

Is there some sort of significance test for comparing model fit across two groups? I understand this may not be a specific Mplus question but I'm using Mplus and find this board very helpful. Please advise me what to do! I would really appreciate it!!

Linda K. Muthen posted on Wednesday, November 23, 2011 - 10:23 am

I can't think of any way to test that.

Ryan Johnson posted on Friday, January 06, 2012 - 7:45 am

Dr. Muthen,

I am testing a model for measurement invariance by gender, and receive the following error message: THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL. PROBLEM INVOLVING PARAMETER 43.

When I run the analyses for the entire sample everything goes smoothly. The error only comes up when I attempt a multi-group analyses. My syntax is below, and parameter 43 corresponds to the Alpha value for zmomhrs in the female sample. Can you advise on what might be causing this error and how I might be able to work around it? Thanks so much!

Variable:
(variables removed)
MISSING = .;
GROUPING = childgender (1=male 2=female)
Analysis:
Model NOCOVARIANCES;
Model:
f1 by;
f1 on zfamilyinc@1 zedusum;
f1@0;
f2 by zroleambig_r zdeclat_r;
f3 by zchildh_r zmchildh_r zbmi_r;
zmompa on f2;
zmompa on zmomhrs;
zmompa on f1;
zmchildpa on zmompa (1);
zmchildpa on f1;
f3 on zmchildpa (2);
f3 on f1;
f2 on f1;
zmomhrs on f1;
f2 with zmomhrs;
Output:
stdyx tech1;

Linda K. Muthen posted on Friday, January 06, 2012 - 10:08 am

Please send the full output and your license number to support@statmodel.com.

Steven John posted on Wednesday, January 18, 2012 - 5:09 am

Hi

I'm currently doing a MGA comparing the correlation between A and B for two primary school grades. The correlation between A and B is different, but significant, in separate analysis I run earlier for both grades. I now want to compare the correlation to see if the difference between grades is significant. To me it seems appropriate to run a totally relaxed MGA with both grades and then another where the correlation under investigation is relaxed. Thereafter I use the Chi2 diff-test for nested models to calulate if the models differ? Am I correct?

BR

Linda K. Muthen posted on Wednesday, January 18, 2012 - 8:07 am

You can do this. Or you can do it in one step using MODEL TEST. See the user's guide for further information.

Steven John posted on Thursday, January 19, 2012 - 1:03 am

Thanks! However, I got a message in the output and therefore calculated the TRd according to the formula on the Mplus website. (Probably because I run the MLR estimator?)

If I run the model totally constrained and thereafter relax on the correlation under investigation, would this also be correct? This seems to be the common way of doing it? However, it seems a bit strange to impose equal factor loadings across grades instead of assume that they theoretically measure the same construct (factor loadings vary across grade but follow the same pattern). The totalally constrained model also fit the data poorly.

Many thanks.

Linda K. Muthen posted on Thursday, January 19, 2012 - 10:24 am

Holding factor loadings and intercepts equal represents measurement invariance. You must first establish measurement invariance before coefficients related to latent variables can be compared across groups. See the Topic 1 course handout and video for a discussion of this topic.

Marie-Helene Veronneau posted on Wednesday, January 25, 2012 - 9:18 am

Hi!
I am comparing nested models that were estimated using MLR. I applied the chi-square difference test formula presented on your website (http://www.statmodel.com/chidiff.shtml).

I want to check that I am using the correct information from the output to do the computation.

In the following formula...
cd = (d0 * c0 - d1*c1)/(d0 - d1)

...I took the degrees of freedom (d0 and d1) in the "Chi-Square Test of Model Fit" section from each output. Is that correct? (There is another section in the output called "Chi-Square Test of Model Fit for the Baseline Model" and I want to be sure which values to use.)

Then, for the correction factor (c0 and c1), I used the following : in the "Loglikelihood" section, "H0 Scaling Correction Factor for MLR". Is that correct?

Thank you very much.

Linda K. Muthen posted on Wednesday, January 25, 2012 - 11:29 am

Yes, that is correct. You don't want the Baseline Model.

Yes, you use the H0 scaling correction factor.

KASIM YILDIRIM posted on Sunday, February 19, 2012 - 6:55 am

I am novice user of multiple group analysis. I did all analysisi, but I do not how to report and interpret data for the manuscript which model, including unconstrained, structural weights, structural covariances, and structural resiadulas � should use for reporting.

thanks

Linda K. Muthen posted on Sunday, February 19, 2012 - 8:20 am

Please see multiple group analysis in the Topic 1 course handout and video on the website. See also how results are reported in the journal you plan on submitting too. If this does not help, I suggest posting this on a general discussion forum like SEMNET.

KASIM YILDIRIM posted on Sunday, February 19, 2012 - 9:18 am

thank you for your attention

Bellinda King-Kallimanis posted on Wednesday, February 29, 2012 - 8:46 am

Hello, I have a question about a negative residual variance, it is very small and not significant. So I want to fix it zero, that is not a problem, but now I don't have the correct degrees of freedom. I cannot find any reference to this anywhere. Is there a way that I can make Mplus only estimate positive residual variances so that my degrees of freedom are correct?

Thank you!
Bellinda

Linda K. Muthen posted on Wednesday, February 29, 2012 - 6:04 pm

Use MODEL CONSTRAINT to constrain the parameter to be greater than zero.

Richard E. Zinbarg posted on Friday, April 27, 2012 - 9:36 am

Hi,
We are trying to run a multiple group version of a Cole & Maxwell Trait-State-Occasion model. When we run it in our entire sample, the model converges just fine. When we try to run a multiple group, metric invariant version of the model we get a message saying that standard errors could not be computed and the model may not be identified and the problem appears to be with a parameter related to the mean structure. We believe this may be due to the fact that the occasion factors in the Cole & Maxwell model are actually the residual variances in the State factors (after accounting for the Trait factor) and therefore are not independent of the state and trait factors but that Mplus is trying to estimate group differences on all three sets of factors (trait, state and occasion) as if they were independent parameters. We are not certain though. Does this make sense? If so, any thoughts on how we fix the identification problem in the multiple group model?

Linda K. Muthen posted on Friday, April 27, 2012 - 1:48 pm

Maybe the occasion factors cannot have free means. With free means, the intercepts need to be constrained across groups. You may need to ask the authors for help.

emmanuel bofah posted on Tuesday, September 04, 2012 - 1:51 pm

how many groups can mplus text per analysis in ESEM. eg. 2 or 3 or 4 what is the max.

Bengt O. Muthen posted on Tuesday, September 04, 2012 - 2:18 pm

There is no limit except your computer's. I have done 34.

ellen posted on Monday, September 10, 2012 - 6:58 pm

if I want to test whether two parameters are equal across three groups, is it accurate to write the Mplus language in the following way? (knowing the overall multigroup comparison chi-square difference is significant across groups.)

MODEL African:
Sg ON De (p1);
Rm WITH Ot (p2) ;

MODEL Asian:
Sg ON De (p3);
Rm WITH Ot (p4);

MODEL Hispanic:
Sg ON De (p5);
Rm WITH Ot (p6);

MODEL TEST:
p1 = p3;
p1= p5;
p2=p4;
p2 = p6;

Is this the correct way to test whether the parameters of "Sg ON De" and "Rm WITH Ot" are equal across the 3 groups?:

Linda K. Muthen posted on Tuesday, September 11, 2012 - 12:29 pm

MODEL TEST:
0 = p1 - p3;
0 = p1 - p5;
0 = p2- p4;
0 = p2 - p6;

I am removing your other post. It is not necessary to post the same question more than once.

Sofie Henschel posted on Thursday, September 20, 2012 - 8:43 am

Hello,
I am running a multiple group (male/female) sem model with categorical indicators. I freely estimated the latent means in both groups (by fixing one treshhold in the indicators to 0) but I am not sure which metric these latent means now have and how to interpret them. Some means in both groups are negative (e.g. -0.25 vs. -0.39). But my categorical indidactors are only between 1-4. It seems to me that the latent means are in some way centered to 0 and what I get are deviations from zero? But I don't understand what the reference (0) here is? The whole sample or the female and male group? And do you suggest to report the standardized or not stand. means?
Thanks, Sofie

Linda K. Muthen posted on Thursday, September 20, 2012 - 2:53 pm

There is no gain to freeing the factor mean and fixing one threshold. Using the standard approach, the factor mean is zero in the reference group and other group means are deviations from zero. When you fix one threshold to zero, the factor mean is in the metric of the threshold. Threshold are in the metric of z-scores not of the categories of the variable.

Herb Marsh posted on Monday, September 24, 2012 - 3:15 am

I have a large data set with 26 groups (13 countries x 2 age cohorts) with 3000+ cases for each group. I began by showing reasonable invariance of factor loadings across the 26 groups. In the multi-group SEM I have several key path coefficients. I would like to do something like an ANOVA to determine how much of the differences in a path coefficient across the 26 groups can be explained by country, age-cohort, and their interaction. Can I do this with either �Model Test� or �Model Constraint� ? I did something along these lines previously when there were 4 groups (with a 2 x 2 design) with model constraint where the main and interaction effects were df=1 contrasts.

Tihomir Asparouhov posted on Monday, September 24, 2012 - 11:56 am

You can do it with Model Constraint. You have 24 coefficients and you can write the two-way ANOVA sum of squares decomposition in Model Constraint.

ellen posted on Tuesday, September 25, 2012 - 12:39 am

Hi,
I am running a multigroup SEM model (African, Asian, Latino). The structural parameters initially showed very different results across groups. For example, one structural parameter was -.22** for Latino, .11 (not significant) for African, and -.12 (not significant) for Asian groups. However, when I used MODEL TEST to examine parameter equalities, the results showed the parameter was NOT significantly different across groups. This is puzzling to me because the parameter result was initially only significant for Latino (-.22**), and was not significant and in the opposite direction (positive .11) for the African group, and not significant for Asians (-.12)-- how could the three parameters not having significant difference? Does it mean statistically they are considered as equivalent? If they are considered equivalent statistically, do I have to constrain the parameters to be equal across groups and claim there is structural invariance?

When I constrained it to be equal across the three groups, I got a result that shows this parameter was significant across ALL groups. How do I interpret the results here?

I am just confused why the three parameters seemed so different initially (e.g., in opposite directions and only one was statistically significant) would somehow turn out to be statistically equivalent?

Linda K. Muthen posted on Tuesday, September 25, 2012 - 8:46 am

What happens when you test -.22 versus .11?

ellen posted on Tuesday, September 25, 2012 - 9:11 am

when I tested -.22 versus .11, it showed no statistical difference as well. It is very puzzling to me...

Linda K. Muthen posted on Tuesday, September 25, 2012 - 1:53 pm

Please send the relevant outputs and your license number to support@statmodel.com.

Herb Marsh posted on Friday, September 28, 2012 - 10:39 pm

Tihomir:
Thank you for your assistance. However, I have not been able to work out how to follow your suggestion.

In model constraint I:
1. computed age cohort differences for each of the 13 countries, and then took deviations of these from the mean cohort difference over all countries. I then used 'model test' to test whether 13-1 country deviations were simultaneously equal to zero. I guess that this is a test of the country-by-cohort interaction.
2. I computed country means (averaged across the two cohorts) for each country, and then took deviations of these from the grand mean. However, I could not use 'model test' to test whether these were simulaneously equal to without a separate analysis. As I have a LOT of coefficients to test, this would require 1000s of lines of code and many separate analyses.

More importantly these did not really give me the two-way ANOVA sum of squares decomposition that I wanted. Obviously I have missed something, Can you give me a bit more guidance about how to translate the 26 (13 countries x 2 age cohorts) coefficients into ANOVA-style SS?

HERB

Tihomir Asparouhov posted on Monday, October 01, 2012 - 11:35 am

Here is a sample code using 5 countries x 2 age cohorts - it gives the decomposition of sum of squares, see page 83 in

http://www.stat.ufl.edu/~dksparks/sta3024/chapter-6.pdf

model constraints:

new(b_g1-b_g5);
do(1,5) b_g#=(b1g#+b2g#)/2;

new(b1g_ b2g_);
do(1,2) b#g_=(b#g1+b#g2+b#g3+b#g4+b#g5)/5;

new(b_g_);
b_g_=(b1g_+b2g_)/2;

new(ss1 ss2 ss3);

ss1=5*((b1g_-b_g_)**2+(b2g_-b_g_)**2);

ss2=2*((b_g1-b_g_)**2+(b_g2-b_g_)**2+(b_g3-b_g_)**2+(b_g4-b_g_)**2)+(b_g5-b_g_)**2;

ss3=
(b1g1+b_g_-b1g_-b_g1)**2+
(b1g2+b_g_-b1g_-b_g2)**2+
(b1g3+b_g_-b1g_-b_g3)**2+
(b1g4+b_g_-b1g_-b_g4)**2+
(b1g5+b_g_-b1g_-b_g5)**2+
(b2g1+b_g_-b2g_-b_g1)**2+
(b2g2+b_g_-b2g_-b_g2)**2+
(b2g3+b_g_-b2g_-b_g3)**2+
(b2g4+b_g_-b2g_-b_g4)**2+
(b2g5+b_g_-b2g_-b_g5)**2;

Tihomir Asparouhov posted on Monday, October 01, 2012 - 11:38 am

You will also need to label the parameter

b1g1, b1g2, etc...

Herb Marsh posted on Monday, October 01, 2012 - 5:19 pm

Tihomir: Sorry for being so dense and not thinking through what I want more carefully.

Yes, your suggestion gives me SS decomposition -- like a two-way anova with one case per cell so that there is no within-cell variation. This is what I asked for

However, what I really want (with hindsight) is to be able to say that the variation explained by each effect is trivial, small, large, etc. To do this (hazardous though it is) I need some measure of SSerror or SStotal.

I cannot do this with a single value for each cell. However, what I do have is a standard error for each of the cells and the number of cases in each cell. Can I use that to construct SSerror. Naively, I am thinking I can use the SEs to create a SD (mult by N) and then compute a wted-avg of these. I doubt if I could use this to construct a legitimate F-test, but it might suffice for my descriptive purposes.

I would value your thoughts

HERB

Tihomir Asparouhov posted on Tuesday, October 02, 2012 - 4:36 pm

Herb

Two thoughts from me.

1. You can get SE for any parameter you can construct in model constraints.

2. Take a look at this BSEM design: page 130

https://www.statmodel.com/download/handouts/MuthenV7Part1.pdf

I think it is pretty impressive and should catch on in other places such as ML estimation ... but of course you are already on the fringes of BSEM

Tihomir

Kathrin Urban posted on Friday, November 02, 2012 - 3:23 am

I am running a path model with 5 continuous latent variables. This works well and now I am interested in testing for differences between groups (grouping variable: dichotomous 1,2). I already did multigroup analysis before but in this case I got the following warning:

*** WARNING
Data set contains unknown or missing values for GROUPING,
PATTERN, COHORT, CLUSTER and/or STRATIFICATION variables.
Number of cases with unknown or missing values: 503

I already rechecked the dataset but there are no missing values. I also tried to start with testing for configural and metric invariance and got the same warning. What am I doing wrong?
Thank you!

Input:

VARIABLE:
Names are
[...]

Usevariables are
PrAttA1 PrAttA2 PrAttA3
PrAttB1 PrAttB2 PrAttB3
PoAttA1 PoAttA2 PoAttA3
PoAttB1 PoAttB2 PoAttB3
AttCo_1 AttCo_2 AttCo_3
FiltPF;

Grouping is FiltPF (1= low 2= high);

MODEL:
PrAttA by PrAttA1 PrAttA2 PrAttA3;
PrAttB by PrAttB1 PrAttB2 PrAttB3;
AttCo by AttCo_1 AttCo_2 AttCo_3;
PoAttA by PoAttA1 PoAttA2 PoAttA3;
PoAttB by PoAttB1 PoAttB2 PoAttB3;

AttCo on PrAttA PrAttB;
PoAttA on PrAttA AttCo PoAttB;
PoAttB on PrAttB AttCo PoAttA;

PoAttA with PoAttB;
PrAttA with PrAttB;

Output: stdyx;

Linda K. Muthen posted on Friday, November 02, 2012 - 6:26 am

Please send the output, data set, and your license number to support@statmodel.com.

Thomas Eagle posted on Saturday, November 03, 2012 - 4:12 pm

I am having a problem setting up a multigroup analysis where I have two groups. One group answered every question. The other group skipped all the items of one complete factor plus three additional variables. I used the example as in the post dated April 29, 2004. I still get an error message. Below is my code. What am I doing wrong?

GROUPING IS teen (1 = NonTeen 2 = Teen);

MODEL: BP by nq24_1-nq24_15;
PV by nq24_16-nq24_20;
HB by nq24_21-nq24_35;
NAT by nq24_36 nq24_40 q24_41;
Q by nq24_42-nq24_44;
T by nq24_37-nq24_39 nq24_45-nq24_50;
SP_EX by nq24_51-nq24_60;
SOC_ENV by nq24_61-nq24_64;
MODEL Teen: BP by nq24_1-nq24_5 nq24_6@0 nq24_7-nq24_15;
PV by nq24_16@0 nq24_17@0 nq24_18@0 nq24_19@0 nq24_20@0;
HB by nq24_21-nq24_30 nq24_31@0 nq24_32@0 nq24_33 nq24_34 nq24_35@0;
NAT by nq24_36 nq24_40 nq24_41;
Q by nq24_42-nq24_44;
T by nq24_37-nq24_39 nq24_45-nq24_50;
SP_EX by nq24_51-nq24_60;
SOC_ENV by nq24_61-nq24_64;

Sunny Duerr posted on Sunday, November 04, 2012 - 6:22 am

Hello,

I have a multiple-group latent variable model and I would like to verify that I am interpreting the output correctly.

My analysis has one dependent variable with continuous indicators, which is regressed on each of three latent independent variables with ordinal indicators. Here is my question:

Does the regression coefficient in the output for each group represent the relationship between the independent and dependent variables for only that group, or is the regression coefficient for all groups beyond the first representing a degree of difference between the first group and another group?

As an example, if I have the following regression coefficients:

Group 1: 0.895
Group 2: -0.105
Group 3: 0.063
Group 4: 0.102

would I interpret this as the variables have a stronger relationship for Group 1 than the other groups (0.895 compared with absolute values smaller than 0.2), or as the relationship is relatively strong for all groups and the regression coefficient ranges between 0.790 and 0.997 depending on group membership?

Thanks in advance for any insight or advice you have!

Linda K. Muthen posted on Sunday, November 04, 2012 - 11:07 am

Thomas:

Please send your output and license number to support@statmodel.com.

Linda K. Muthen posted on Sunday, November 04, 2012 - 11:18 am

Sunny:

The results are for each group.

Thomas Eagle posted on Thursday, November 08, 2012 - 10:38 am

Hi Linda, I am back. I tried the fixing of missing data defined to a group to zero using what you recommended. It does not converge. Here is the essence of my code:

DEFINE: IF (teen eq 2) THEN NQ24_6 = 0;
IF (teen eq 2) THEN NQ24_16 = 0;
IF (teen eq 2) THEN NQ24_17 = 0; ... etc...

USEVARIABLES nq24_1-nq24_64;
MISSING = .;
GROUPING IS teen (1 = NonTeen 2 = Teen);

ANALYSIS: COVERAGE = 0.0;
MODEL: BP by nq24_1-nq24_15;
PV by nq24_16-nq24_20;
HB by nq24_21-nq24_35;
NAT by nq24_36 nq24_40 nq24_41;
Q by nq24_42-nq24_44;
T by nq24_37-nq24_39 nq24_45-nq24_50;
SP_EX by nq24_51-nq24_60;
SOC_ENV by nq24_61-nq24_64;

Is there a fix I can try

Tom

Linda K. Muthen posted on Thursday, November 08, 2012 - 11:39 am

Please send your output and license number to support@statmodel.com.

Danyel A.Vargas posted on Friday, November 30, 2012 - 2:09 pm

Hello,

I want to test whether my model differs by boys and girls by using a multigroup model where all parameters are equal and then another where all parameters are free. However, mplus won�t constrain one of my variables to be equal across groups. This variable is a dummy coded variable. Can you please help me with this?

Thank you!

Danyel

Linda K. Muthen posted on Friday, November 30, 2012 - 3:14 pm

Please send the output and your license number to support@statmodel.com.

Kelly Woodall posted on Thursday, December 06, 2012 - 11:12 am

Hi,

I am new to Mplus and SEM so I apologize in advance if you have answered this question elsewhere.
I want to do a multiple group comparison by sex in which the first model is free across all parameters and the other is equal across all parameters.

The model free across parameters seems to be working:

model:
PTSD by rx* an hyp;
PTSD@1;

large on PTSD;
sumcomp on PTSD;
locw2 on PTSD;
contwt on large sumcomp locw2;
contwt on contbmi;

When I run the equal parameter model (below) the tau, theta, alpha, and psi matrices in the Tech1 output are still being estimated for the female model. How do I equate these parameters?
PTSD by rx* an hyp;
PTSD@1;
large on PTSD (1);
sumcomp on PTSD (2);
locw2 on PTSD (3);
contwt on large (4);
contwt on sumcomp (5);
contwt on locw2 (6);
contwt on contbmi (7);

Model Female:
large on PTSD (1);
sumcomp on PTSD (2);
locw2 on PTSD (3);
contwt on large (4);
contwt on sumcomp (5);
contwt on locw2 (6);
contwt on contbmi (7);

Thank you!

Linda K. Muthen posted on Friday, December 07, 2012 - 9:41 am

WITH statements are used to specify parameters in Theta and Psi. Bracket statements are used to specify parameters in Tau and Theta.

y1 WITH y2;
[y1 y2];

Ram Manohar Singh posted on Thursday, December 27, 2012 - 5:19 am

Hi,

I am testing a multilevel mediation model.

Fit of the model increases by adding an insignificant path. Which model should be selected: lesser fit model with all significant paths or better fit model with insignificant path included?

How can including insignificant path increase fit of the model??

Bengt O. Muthen posted on Thursday, December 27, 2012 - 10:03 am

This is a matter of choosing one of two chi-square tests: Wald or Likelihood-ratio. They are different but asymptotically the same. Wald is the same as the z test you see when judging significance of a path and LR is the chi-square test of model fit.

I would add the path if there was theoretical reason to consider it. The fact that is is then insignificant is a finding of subject-matter interest.

Jo Brown posted on Thursday, December 27, 2012 - 12:13 pm

Dear Drs,

I am running a multiple group analyses to explore mediation. As I am using imputed data, I need to specify the direc, and indirect effects using the model constraint options.

However, when I do so I only receive one output for the direct indirect effects if I simply specify:

model:

Y on M (p1);
Y on X (c1);
M on X (m1);

MODEL CONSTRAINT:
new(ind dir);
indF = p1*m1;
dirF = c1;

Should I repeat the same lines after this as in:

model male:
Y on M (p1);
Y on X (c1);
M on X (m1);

MODEL CONSTRAINT:
new(ind dir);
indF = p1*m1;
dirF = c1;

model female:
Y on M (p1);
Y on X (c1);
M on X (m1);

MODEL CONSTRAINT:
new(ind dir);
indF = p1*m1;
dirF = c1;

to obtaion ind and dir for boys and girls separately.

I'd be grateful if you could advice me on the best way to proceed.

Many thanks

Linda K. Muthen posted on Thursday, December 27, 2012 - 2:05 pm

You need to use different labels for male and female and specify an indirect effect for each using these labels.

Jo Brown posted on Thursday, December 27, 2012 - 5:47 pm

Thanks Linda, I am sorry but do you mean something like this?

model:

Y on M (p1);
Y on X (c1);
M on X (m1);

MODEL CONSTRAINT:
new(ind dir);
indF = p1*m1;
dirF = c1;

model male:
Y on M (p2);
Y on X (c2);
M on X (m2);

MODEL CONSTRAINT:
new(indM dirM);
indF = p2*m2;
dirF = c2;

model female:
Y on M (p3);
Y on X (c3);
M on X (m3);

MODEL CONSTRAINT:
new(indF dirF);
indF = p3*m3;
dirF = c3;

I have never done this before so I am really unsure on the best way...

Thanks again

Linda K. Muthen posted on Thursday, December 27, 2012 - 6:46 pm

Yes, but have only one MODEL CONSTRAINT which is not interspersed in the MODEL command. Put MODEL CONSTRAINT either before or after the MODEL command not in the MODEL command. And don't use the same names for the direct and indirect effects.

model:

Y on M (p1);
Y on X (c1);
M on X (m1);

model male:
Y on M (p2);
Y on X (c2);
M on X (m2);

model female:
Y on M (p3);
Y on X (c3);
M on X (m3);

MODEL CONSTRAINT:
new(ind dir indm dirm indf dirf);
ind = p1*m1;
dir = c1;

indm = p2*m2;
dirm = c2;

indF = p3*m3;
dirF = c3;

Jo Brown posted on Thursday, December 27, 2012 - 11:42 pm

Thank you Linda!

Gabriel Schlomer posted on Friday, January 11, 2013 - 10:30 am

I have, perhaps, a simple question. I am running a two-group MLR model with two latent variable predictors and one latent variable dependent variable. I would like to graph, what is effectively an interaction, of the difference in the coefficients between groups in an Aiken and West style model (e.g. -1SD, 0,+1SD). The output gives me the slopes for each predictor for each group, however to properly graph the difference in the slopes I need the intercept. Can I use the group specific intercept for the DV that is printed in the output as my anchor for graphing the slopes? I noticed that one intercept seems to be fixed at zero while the other is freely estimated.

Bengt O. Muthen posted on Friday, January 11, 2013 - 4:31 pm

The answer to your question is yes.

You can also try to do your full plot for the range [-1 SD, +1 SD] using the Version 7 "LOOP" plot. See Part 1 of the handouts and videos from the Utrecht course in August on our web site - or see the version 7 UG ex 3.18 and modify to two-group analysis.

Rachel Navarro posted on Tuesday, March 12, 2013 - 9:04 am

hello,

I am using type=imputation, estimator = MLR, and running a multiple group analysis comparing two racial groups.

Can I conduct a chi-square test of difference between the unconstrained and constrained models? Will this give valid results in terms potentially moderation?

Linda K. Muthen posted on Tuesday, March 12, 2013 - 9:20 am

Difference testing has not been developed for multiple imputation. You can make comparisons using a Wald test using MODEL TEST.

Rachel Navarro posted on Tuesday, March 12, 2013 - 9:53 am

thank you for your response. can you point me to an example of MODEL TEST used in multiple group analysis?

Linda K. Muthen posted on Tuesday, March 12, 2013 - 11:44 am

I don't have such an example. Just label the parameters using the group-specific MODEL commands and use the labels in MODEL TEST.

shumail paracha posted on Monday, April 01, 2013 - 11:23 am

Hello;

i have checked my model for three socioeconomic statuses.....and build separate file for each.....

for lower SES, there are four paths which are non significant, when i placed constraints on them...chi sq value increases...model fit indices also increase but not as such great effect has been observed....but when i delete all those paths then that gives me good model fit....kindly suggest me...would i delete all those paths which are non sig (improved model fit) or place constraints on them...(which gives me just marginal model fit).

Bengt O. Muthen posted on Monday, April 01, 2013 - 3:18 pm

This question is more general and basic and is therefore more suitable for SEMNET.

Ke Anne Zhang posted on Thursday, April 18, 2013 - 3:59 pm

Dear Drs. Muthen and Muthen,

I am conducting a multiple-group analysis. I have 6 categorical indicators loading onto 3 factors (2 indicators on each). I'm using CLASSES instead of GROUPING (and have specified 8 classes), TYPE = MIXTURE, MLR estimator, and ALGORITHM = INTEGRATION.

I am testing configural invariance first (i.e., covariance invariance), and then measurement invariance (i.e., factor loadings, thresholds/factor means). Since I'm using MLR estimator, I can't test the invariance of the residual variances, but that is fine with me. And I believe that when thresholds are free, factor means have to be fixed at 0, and vice versa.

However, I'm having trouble getting some of my models to converge. I've set up 16 models to compare (by combining free or invariant model specifications for each of the 4 parameters: factor correlations, factor variances, factor loadings, thresholds/factor means).

In my model, I'm freeing the loading of the first indicator on each factor. In models where factor variances are meant to vary across groups, I've fixed variances in group 1 to 1 and allowed variances in other groups to vary freely.

My question is: Do you have any suggestions about why some models are not converging? Are there model combinations (out of my 16 combinations above) that are just not going to be identified?

Thank you very much for your help!

Bengt O. Muthen posted on Thursday, April 18, 2013 - 4:29 pm

I hear nothing wrong in what you are saying.

Models where you don't set the metric in the loadings and instead fix a factor variance at 1 in one group and have them free in other groups need to rely on holding the loadings equal across groups for identification.

The way to figure out the source of the non-identification is to check the parameter number in the error message against Tech1 to see which parameter that is.

Ke Anne Zhang posted on Friday, April 19, 2013 - 9:45 am

Thank you for your help, Dr. Bengt Muthen!

I do have a follow-up question. In one of my models, I made factor correlations invariant, factor loadings free, factor variances free (with variances in first group fixed to 1), factor means free, and thresholds invariant across groups. This model converged successfully. If I need to rely on holding loadings equal across groups for identification, why might this model have successfully converged?

Moreover, I have models in which that requirement (if variances are free, loadings must be equal) is satisfied that did not converge. For example, I have a model in which factor correlations are invariant, factor loadings are invariant, factor variances are free (with variances in first group fixed to 1), factor means are free, and thresholds are invariant. Allowing for 2000 iterations, the model still did not terminate normally. The message is "THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY DUE TO A NON-ZERO DERIVATIVE OF THE OBSERVED-DATA LOGLIKELIHOOD. THE MCONVERGENCE CRITERION OF THE EM ALGORITHM IS NOT FULFILLED. CHECK YOUR STARTING VALUES OR INCREASE THE NUMBER OF MITERATIONS. ESTIMATES CANNOT BE TRUSTED. THE LOGLIKELIHOOD DERIVATIVE FOR PARAMETER 7 IS -0.73857449D-02."

Again, thank you very much for your help! It's invaluable.

Bengt O. Muthen posted on Friday, April 19, 2013 - 2:52 pm

Factor loadings not held equal across groups makes it meaningless to compare factor variances across groups. Generally speaking, this model is not identified. Non-identified models can converge. Why your model converged I can only tell by looking at your output which you can send to support.

For the question in your second paragraph I also would have to see your output to be able to tell.

Note also that you talk about invariant factor correlations. I would think you mean factor covariances since the factor variances are not in all your models it sounds like.

Jenny L. posted on Tuesday, April 23, 2013 - 9:43 am

Dear Drs. Muthen and Muthen,

I'm doing a multi-group analysis on two groups. There are 5 exogenous variables and I would like to correlate them both within and across groups. I know that within-group correlation is a default, but I'm not sure about the code for cross-group correlation (i.e. f1 of Group 1 is correlated with f1 of Group 2) and I can't seem to find it in the user's guide. Could you please give me some clue? Thank you in advance for your help.

Linda K. Muthen posted on Tuesday, April 23, 2013 - 10:05 am

You can't correlate variables across groups.

Jenny L. posted on Tuesday, April 23, 2013 - 10:27 am

Thank you for your reply, Dr. Muthen.

What I have is actually a longitudinal data set with 2 time points. I was trying to test whether the associations among 8 variables (5 exogenous, 2 mediators, 1 outcome) would vary across time. I thought I could treat them as two different groups but it violated the independence assumption of multi-group analyses. What analysis would you suggest? Thank you again for your advice.

Linda K. Muthen posted on Tuesday, April 23, 2013 - 11:30 am

You can do a single-group analysis where you relate the variables across the two time points.

Sung Joon Jang posted on Saturday, May 04, 2013 - 9:25 am

When I conduct multi-group analysis, say, for males and females, I understand equality constraint should be used to directly test whether a coefficient of interest is statistically different between the two groups, using chi-square difference. Sometimes, however, I have a situation, where significant chi-square difference is found (i.e., statistically significant difference between male and female coefficient) when both coefficients are statistically NOT significant. In such case, should I report the coefficient is different between males and females (based on the chi-square difference test) or not (because both coefficients are not significant, that is, not different from zero; and thus comparing two non-significant coefficients is pointless)?

Linda K. Muthen posted on Saturday, May 04, 2013 - 9:29 am

I can't see any value of reporting this.

Sung Joon Jang posted on Wednesday, May 08, 2013 - 2:53 pm

Thanks. I'd like to ask a follow-up question. What if I found a structural coefficient to be significant in one group but not significant in the other when its chi-square test showed non-significant difference? Should I report significant difference between the two groups (based on significant vs. non-significant coefficient) or not (because the test showed non-significant chi-square difference with equality constraint)? This is not a hypothetical question, but I often have such case. Thanks in advance.

Linda K. Muthen posted on Wednesday, May 08, 2013 - 3:00 pm

This is really not related to Mplus. You can probably get a more thorough response by posting this on a general discussion forum like SEMNET.

Claire posted on Wednesday, May 29, 2013 - 2:11 am

I have a question (probably a stupid one!) about doing path analysis, which I was hoping you might be able to answer.

I�m thinking about running a path analysis (perhaps going onto a SEM after) but want to compare path coefficients using the same model between groups (in my case countries).

Could this be done by just running separate path models for each country and comparing the coefficients? Or do you have to use multigroup path analysis? What is the difference between the two methods? Do you have any detailed examples (including syntax) of multigroup path analysis if this is what I should be doing or can you point me in the direction of materials that explain the difference between the two?

Many thanks.

Linda K. Muthen posted on Wednesday, May 29, 2013 - 7:47 am

You should use multiple group analysis so that the testing can be done by the program using either chi-square difference testing or the Wald test using MODEL TEST. If you analyze the groups separately, you would need to do the same type of testing by hand which could be difficult.

Tait Medina posted on Tuesday, June 18, 2013 - 7:08 pm

Hello Dr. Muthen. I've been struggling with how best to approach multiple group factor analysis when there are many groups (my "group" is typically "country" and I usually have about 15-20 groups). I have been working through the paper: "General random effect latent variable modeling: Random subjects, items, contexts, and parameters" but I've been worried that the random effect approach is going to "force" (for lack of a better word) invariant loadings to be non-invariant since a random effect is estimated for each group whether or not it is "needed", and that this could bias the structural (latent mean) parameter estimates. I'm just wondering if I am completely off the mark.

Thank you for your time.

Linda K. Muthen posted on Wednesday, June 19, 2013 - 8:30 am

See the new ALIGNMENT option in the Version 7.1 Language Addendum on the website with the user's guide. See also Web Note 18 and Bengt's UCONN Keynote address which discusses random versus fixed factor loadings.

Tait Medina posted on Wednesday, June 19, 2013 - 8:38 am

Thank you for these resources! The ALIGNMENT option is VERY interesting. I am looking forward to giving it a go.

Andrew Burton Jones posted on Wednesday, June 19, 2013 - 5:33 pm

Dear Professors

A colleague of mine is using Mplus to examine how associations among constructs differ in two contexts.

Rather than constrain each path at a time, and then compare the Chi square difference between the constrained and unconstrained models to see if it�s significant, he has used pairwise t-tests with pooled standard errors. He wrote that he did this because he used the mean-adjusted maximum likelihood method in Mplus and that the chi-square values from this test in Mplus cannot be used for chi-square tests.

I have not used Mplus before and would like to confirm whether this is a good handling of the issue. Could you please let me know? I looked for other posts on this issue before posting this, but couldn't find anything.

Sorry for taking up your time, but your advice would be really helpful.

Andrew.

Linda K. Muthen posted on Thursday, June 20, 2013 - 7:55 am

One can do difference testing using MLM. It requires using a scaling correction factor. I think that would find the same results as what your colleague did as long as the values from TECH3 are used in the computations.

Andrew Burton Jones posted on Thursday, June 20, 2013 - 1:10 pm

Many thanks Linda!

Marie-Helene Veronneau posted on Wednesday, July 31, 2013 - 9:17 am

Greetings,
As a follow-up on my question posted here on January 25, 2012, I would like to know what is the purpose of the "scaling correction factor for MLR" value under the section named "Chi-square test of model fit". Because your answer to my previous post says that I need to use the "H0 Scaling Correction Factor for MLR" found under the "Loglikelihood" section, I'm wondering why I also have this other correction factor available.
For your information, I am using the correction factors in the following formula:
cd = (d0 * c0 - d1*c1)/(d0 - d1)
Thanks for your assistance.

Linda K. Muthen posted on Wednesday, July 31, 2013 - 11:30 am

For difference testing you need the scaling correction factor which is related to the degree of non-normality. You can do difference testing using either chi-square values or loglikelihood values. You would use the scaling correction factor that is for the test statistic you decide to use.

Hannah Lee posted on Friday, September 06, 2013 - 9:05 am

Hi, I am trying to conduct a multigroup analysis (4 groups). It seems I can only compare two paths at a time with MODEL TEST. So here was my input:

usevariables= REOadd36 COMP1-COMP5 LENG PERC DREadd36 DPUadd36;
Grouping= RCR4split (0=LOBC 1=LSE 2=HSE 3=HOBC);

ANALYSIS: ESTIMATOR=MLMV;

MODEL: comp BY COMP1-COMP5;
comp ON REOadd36 perc leng DREadd36 DPUadd36;
MODEL HSE:
comp BY COMP1-COMP5;
comp ON REOadd36 (HSEb1)
perc leng DREadd36 DPUadd36;
MODEL HOBC:
comp BY COMP1-COMP5;
comp ON REOadd36 (HOBCb1)
perc leng DREadd36 DPUadd36;
MODEL TEST:
HSEb1=HOBCb1;

OUTPUT: TECH1 STDYX;

Although I get the mdel estimates,I get the following meassage:

THE MODEL ESTIMATION TERMINATED NORMALLY

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE
COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL.
PROBLEM INVOLVING PARAMETER 49.

THE CONDITION NUMBER IS -0.317D-19.

I am not sure where to look for "parameter 49." Could this have to with individual group sample sizes?

Linda K. Muthen posted on Friday, September 06, 2013 - 9:43 am

You find the parameters and their numbers in TECH1.

You should not mention the first factor indicator, comp1, in the group-specific MODEL commands. When you do this, they are free making the model not identified.

marie posted on Wednesday, October 02, 2013 - 3:05 pm

Hi,

I am running a multiple group analysis with gender as the grouping variable. I understand that I cannot just compare the paths without establishing measurement invariance. I first checked whether the final structural regression model was a good fit across groups ("no constraint across groups"). I have 10 latent constructs. There were twice more females than males. Three questions:

- How do I test whether the drop in fit is significant or not (I am using MLR)? Numerically speaking there was a drop in the CFI and an increase in the RMSEA and SRMR.
- All the indirect effect became non significant in the male group. Five of my direct paths in both groups became non significant. Is this due to the small sample size after I divided them into two groups?
- Is it fair to say that the final model could not hold in both groups so I have to stop my multiple group analysis there?

Thank you

Linda K. Muthen posted on Thursday, October 03, 2013 - 11:02 am

You should test for measurement invariance using a model with only the ten latent variables. You should not include paths among the ten latent variables until you have established measurement invariance. See the Version 7.1 Mplus Language Addendum on the website with the user's guide. There is a new feature that automatically tests for measurement invariance.

Jennifer Clark posted on Monday, October 07, 2013 - 7:57 am

Hi,

I have a SEM model with the WLSMV estimator for categorical items. In Mplus version 6 I was able to stratify my final model using the GROUPING option by levels of a variable I made in the DEFINE command. However, I have imputed data and now when I try to re-run the same input file in Mplus version 7 it says that I cannot do this because the GROUPING option requires the same number of participants per group in each imputation (and this is not the case since the imputation created some variation across the defined variable). Is this a bug in version 7 because in version 6 it just took the average number of people per group across all the imputations in order to do it? Is there another way I could stratify my model?

Linda K. Muthen posted on Tuesday, October 08, 2013 - 9:46 am

Please send the Version 6 and 7 outputs and your license number to support@statmodel.com.

Ebrahim Hamedi posted on Sunday, December 01, 2013 - 6:58 pm

Hi
How can I analyze a single data file with a grouping variable defining two or more groups, but only include one of the groups in the analysis?

thanks,Ebi

Linda K. Muthen posted on Monday, December 02, 2013 - 9:57 am

Use the USEOBSERVATIONS option without the GROUPING option.

C posted on Tuesday, December 10, 2013 - 1:19 pm

Hello,

I am trying to conduct a multiple group path analysis (with 4 groups) using the following syntax:

GROUPING = welfare (1=Med 2=Social 3=Post_Com 4=Cons);

Model:

educ ON books rooms occ feat age ;

wealth ON main income age books rooms occ feat ;

income ON main age ;

main ON educ occ age ;

Y ON wealth income main educ age books rooms occ feat ;

rooms WITH books occ feat ;
books WITH occ feat ;
w3_occ WITH feat ;

Model indirect:
Y IND books ;

I want to then test whether the path from books to Y is different between groups.

I have tried different ways to do this but keep getting errors probably due to my syntax, so I was wondering if there is simple syntax to do this.

Thanks.

Bengt O. Muthen posted on Tuesday, December 10, 2013 - 5:46 pm

You would have to use Model Constraint to do this yourself. Label the slope parameters involved in the indirect effects for each group and use those labels in Model Constraint to express the indirect effects, like for 2 groups:

Model Constraint:
new(ind1 ind2 diff);

ind1 = b11*b12+ b13*b14;
ind2 = b21*b22+ b23*b24;
diff = ind1-ind2;

C posted on Wednesday, December 11, 2013 - 2:31 am

Thanks. Just to check - is the above to test whether the indirect effect from books to Y is different between groups?

If I just wanted to test whether the direct effect from books to Y is different between groups, is there a simpler way?

Linda K. Muthen posted on Wednesday, December 11, 2013 - 7:00 am

Yes, this is the test. I don't know of a simpler way.

thanoon younis posted on Saturday, December 14, 2013 - 5:58 pm

hi
how can i apply multiple group nonlinear structural equation models in this program .
regards
Thanoon

Christopher Bratt posted on Sunday, December 15, 2013 - 1:08 am

I am trying to compare different approaches to testing for measurement invariance across 29 countries. I test a factor with only three indicators, applying both multigroup CFA (with maximum likelihood) and the new alignment method in Mplus (with Bayesian estimations, aiming at a model with approximate measurement invariance).

Such data can also be analyzed with multilevel CFA, which would seem advantageous if the aim is to later use the factor in a multilevel regression analysis.

Leaving aside configural invariance (which is not tested with only one factor and three indicators) and scalar invariance (which is unlikely with many countries involved): when metric invariance (invariant factor loadings) is supported for nearly all countries, a multilevel model with random intercepts for these countries should be justified. Is this correct?

Or, put differently: if metric invariance is not supported, the multilevel factor analysis is strictly speaking not justified, even though frequently used. Correct? One might define factor loadings as random, but that seems to lead to a rather complex model and I haven't seen this in applied research.

I would be thankful for guidance.

Christopher Bratt

Bengt O. Muthen posted on Sunday, December 15, 2013 - 10:47 am

I think your statements are correct. The paper

http://www.statmodel.com/download/PolAn.pdf

describes 3 different cases of invariance in the multilevel factor analysis case.

Linda K. Muthen posted on Sunday, December 15, 2013 - 11:13 am

Thanoon:

What kind of nonlinear model do you refer to?

thanoon younis posted on Sunday, December 15, 2013 - 9:07 pm

Thank Thank you for your help
I am using Quadratic effects on endogenous latent variable like x1^2.x2^2.x1x2.
Regards

Bengt O. Muthen posted on Monday, December 16, 2013 - 10:54 am

That means that you use XWITH, so multiple-group analysis has to be done using TYPE= MIXTURE RANDOM and the KNOWNCLASS option. See the User's Guide.

thanoon younis posted on Tuesday, December 24, 2013 - 7:51 am

dear dr. Muthen
i need your help to correct this program for SEM because i can not run this program and i dont see any errors and i put this variables X*Z AS a nonlinear effect on W is correct or not ??.
regrads
TITLE: multiple group SEM group 1
DATA: FILE = C:\Users\hp\Desktop\path (2).dat;
VARIABLE:
NAMES = Y1-Y11;
USEVARIABLES = Y1-Y11;
ANALYSIS: ESTIMATOR = ML;
MODEL:
X BY Y1 Y2 Y3 Y4 Y5;
Y BY Y6 Y7;
Z BY Y8 Y9;
W BY Y10 Y11;
W on X Y Z X*Z;
X with Y;
Y with Z;
OUTPUT:TECH1 TECH4 STDYX;

Linda K. Muthen posted on Tuesday, December 24, 2013 - 9:27 am

You need to define the latent variable interaction using the XWITH option. See Example 5.13 in the user's guide.

thanoon younis posted on Tuesday, December 24, 2013 - 4:55 pm

thank you so much for your help
i need also effect of X^2 AND Y^2 on W (nonlinear effect)how can i write this command.
regards

thanoon younis posted on Wednesday, December 25, 2013 - 8:31 am

dear dr. linda
i want to ask you question regarding data type in each group in multiple group SEM "the data should be independent or correlated in each group.
regards

Linda K. Muthen posted on Wednesday, December 25, 2013 - 9:47 am

int | x XWITH x;

Subjects in each group should be independent.

thanoon younis posted on Thursday, December 26, 2013 - 6:19 am

dear dr. linda
i saw in some referencres that the observed variables not only observations (subjects) should be independent in multiple group SEM.is this speech correct? because i want to simulate data to conduct multiple group SEM.
regards

Linda K. Muthen posted on Thursday, December 26, 2013 - 9:44 am

Observed variables should not be independent. It is the relationship among the observed variables that the analysis tries to explain.

marie_l posted on Tuesday, December 31, 2013 - 8:46 am

Hello

I tested a theoretical model in which I have indirect effect. Now I`d like to see if gender has a moderating effect. So, I am running a multiple group analysis. First, I tried to establish measurement invariance. To establish measurement equivalence, I am using the following language (see code below)

1) Does the code look OK?
2) May I use the language with Mplus 6?

VARIABLE: NAMES ARE mediaexp1 mediaexp2 mediaexp3 mediaexp4 alcohol1 alcohol2 alcohol3 alcohol4 alcohol5 hseek1 hseek2 hseek3 hseek4 interact1 interact2 interact3 interact4 norm1 norm2 norm3 norm4 norm5 r_male;

MISSING ARE ALL (9);
GROUPING IS r_male (0= male 1=female);

Model:
mediaexp by mediaexp1 mediaexp2 mediaexp3 mediaexp4;
alcohol by alcohol1 alcohol2 alcohol3 alcohol4 alcohol5;
hseek by hseek1 hseek2 hseek3 hseek4;
interact by interact1 interact2 interact3 interact4;
norm by norm1 norm2 norm3 norm4 norm5;

ANALYSIS:
MODEL = CONFIGURAL METRIC SCALAR;
ESTIMATOR IS MLR;
ITERATIONS = 1000;
CONVERGENCE = 0.00005

Thanks and happy holidays

Linda K. Muthen posted on Tuesday, December 31, 2013 - 8:57 am

MODEL = CONFIGURAL METRIC SCALAR; is not available in Version 6. See the Topic 1 course handout on the website for the inputs to test measurement invariance.

thanoon younis posted on Saturday, January 04, 2014 - 6:46 am

dear dr. muthen
i need your help to conduct this example in msem and as follow:
TITLE: Configural CFA model
DATA: FILE = C:\Users\hp\Desktop\BSI_18.dat;
VARIABLE:
NAMES = X1-X18 GENDER WHITE AGE EDU CRACK SITE ID;
MISSING = ALL (-9);
USEVARIABLES ARE X1-X18;
GROUPING = SITE (1=OH 2=KY);
!ANALYSIS: ESTIMATOR = ML;!default;
MODEL:
SOM BY X1 X4 X7 X10 X13 X16; !Somatization;
DEP BY X5 X2 X8 X11 X14 X17; !Depression;
ANX BY X3 X6 X9 X12 X15 X18; !AnxietX;
[SOM@0 DEP@0 ANX@0];
X8 WITH X5;
MODEL OH:
X9 WITH X12;
MODEL KY:
SOM BY X1@1 X4 X7 X10 X13 X16; !Somatization;
DEP BY X5@1 X2 X8 X11 X14 X17; !Depression;
ANX BY X3@1 X6 X9 X12 X15 X18; !Anxiety;
[X1-X18*];
X11 WITH X14;
X9 WITH X18;
OUTPUT: TECH1 TECH4;

Linda K. Muthen posted on Saturday, January 04, 2014 - 1:09 pm

If you are trying to specify a configural model, it looks correct.

thanoon younis posted on Thursday, January 09, 2014 - 6:11 am

dear dr. muthen
i want to ask you i can not get on any results in mplus what is the problem? when i took any example ican not get on results for example this code :
TITLE: Test invariance of marker item factor loadings
DATA: FILE ='C:\Users\hp\Desktop\BSI_18.dat';
VARIABLE:
NAMES = X1-X18 GENDER WHITE AGE EDU CRACK SITE ID;
MISSING = ALL (-9);
USEVARIABLES ARE X1-X18;
GROUPING = SITE (1=OH 2=KY);
!ANALYSIS: ESTIMATOR = ML;!default;
MODEL:
SOM BY X1* X4@1 X7 X10 X13 X16; !Somatization;
DEP BY X5* X2@1 X8 X11 X14 X17; !Depression;
ANX BY X3* X6@1 X9 X12 X15 X18; !Anxiety;
[SOM@0 DEP@0 ANX@0];
X5 with X8;
MODEL OH:
X9 WITH X12;
MODEL KY:
SOM BY X1 X4@1 X7 X10 X13 X16; !Somatization;
DEP BY X5 X2@1 X8 X11 X14 X17; !Depression;
ANX BY X3 X6@1 X9 X12 X15 X18; !Anxiety;
[X1-X18];
X5 with X8;
X11 WITH X14;
X9 WITH X18;
OUTPUT: TECH1;

this is example but i dont know where is the errors. please help me
regards

Linda K. Muthen posted on Thursday, January 09, 2014 - 6:43 am

What kind of message do you get?

thanoon younis posted on Thursday, January 09, 2014 - 4:29 pm

" the input setup produced syntax warnings/ errors caused to mplus to abort. please refer to the output file for these warninngs/ errors and fix the input setup accordingly"

Linda K. Muthen posted on Friday, January 10, 2014 - 7:20 am

Plrsdr send your output and your license number to support@statmodel.com.

thanoon younis posted on Friday, January 10, 2014 - 10:52 pm

I am sorry I dont have any output because of errors. I dont have license number because I am using demo version, so how can I solve this problem.

Linda K. Muthen posted on Saturday, January 11, 2014 - 6:29 am

Send the input and data to support@statmodel.com.

thanoon younis posted on Monday, January 13, 2014 - 9:24 pm

hi dr. linda
i want to ask you if i have dichotomous data how can i choose type of variables?? because i dont see dichtomous type in types of variables in mplus.is categorical data represents ordered categorical and dichotomous.
thanks in advance

Linda K. Muthen posted on Tuesday, January 14, 2014 - 6:29 am

We check the variables on the CATEGORICAL list to see how many categories they have and treat them accordingly.

thanoon younis posted on Tuesday, January 14, 2014 - 6:55 am

Hi dr. Linda
If I have two categories like male, ,female how can I treat with this variable in mplus.
Regards

thanoon younis posted on Tuesday, January 14, 2014 - 6:58 am

hi dr. linda
if i have variables with two categories such (male, female) how can i treat with it please explain to me.

Linda K. Muthen posted on Tuesday, January 14, 2014 - 11:27 am

Usually gender is either a grouping variable or a covariate. If it is a covariate, it is treated as a continuous variable in regression and the model is estimated conditioned on it. The scale does not matter as no distributional assumptions are made about it. Only the scale of dependent variables is an issue.

thanoon younis posted on Thursday, January 16, 2014 - 11:52 pm

dear dr. linda
i am working on multi group structural equation models and all my observed variables are dichotomous "you mean dependent variables as observed variables" and i received errors in variable command when i wrote "GROUPING ARE X1-X10;"
HOW CAN I SOLVE THIS PROBLEM PLEASE.??
regards

Linda K. Muthen posted on Friday, January 17, 2014 - 5:53 am

The GROUPING option is to name the grouping variable for multiple group analysis. If x1-x10 are binary dependent variables, you would say

CATEGORICAL ARE x1-x10;

thanoon younis posted on Friday, January 17, 2014 - 6:20 am

Thanks alot drs. Linda this is my question so categorical option represents dichotomous and ordered categorical data. Right?
Thanks again

Linda K. Muthen posted on Friday, January 17, 2014 - 8:55 am

Yes, the program counts the number of categories and treats the variables accordingly.

thanoon younis posted on Friday, January 17, 2014 - 8:56 am

dear drs. linda
can you explain to me how can i prepare my data in multi group structural equation models data file? and i hope to give me example?
regards

Linda K. Muthen posted on Friday, January 17, 2014 - 8:58 am

You have all data in one data set that contains the grouping variable. See Example 5.15.

thanoon younis posted on Friday, January 17, 2014 - 9:17 am

dear drs. linda
i want just to see the data "ex5.15" to see how can i manage this type of data to multi group can you tell me where can i find this file please.
regards

Linda K. Muthen posted on Friday, January 17, 2014 - 10:15 am

It is installed with Mplus and it is also available on the website with the user's guide.

S Gomez posted on Sunday, January 19, 2014 - 12:55 pm

Drs. Muthen,

I am running a multiple groups SEM from a single data file. My plan is to:

1) Run a model with measurement invariance and all structural paths held constant across the two groups.

2) Relax some equality constraints based on modification indices.

The grouping is working fine, and I have been able to run models with structural paths varying freely or held constant.

The problem is the factor loadings: I understand from the user guide these are held constant by default. However, when I run the model without specifying constraints for the BY statements, the output shows group differences in factor loadings,

Even when I specify the loadings to be held constant, the same group differences show up.

Any idea what I might be missing?

Thanks!

Linda K. Muthen posted on Sunday, January 19, 2014 - 5:19 pm

Please send the relevant outputs and your license number to support@statmodel.com.

thanoon younis posted on Monday, January 20, 2014 - 4:54 am

hi drs. linda
i want to conduct multiple group nonlinear structural equation models by using the commands below and i have error.

TITLE: MULTI SEM WITH CONTINUOUS DATA

DATA:
FILE IS "C:\Users\hp\Desktop\normal.dat";
TYPE IS COVARIANCE MEANS;
NGROUPS = 2;
NOBSERVATIONS = 500 500;

VARIABLE:
NAMES ARE X1 X2 X3 X4 X5 X6 X7 X8 X9 X10;
USEVARIABLES ARE X1 X2 X3 X4 X5 X6 X7 X8 X9 X10;
GROUPING IS (0= GROUP1 1=GROUP2);

ANALYSIS:
TYPE IS GENERAL BASIC;
ESTIMATOR IS BAYES;
ITERATIONS = 1000;
CONVERGENCE = 0.00005;

MODEL
TYPE=RANDOM
X BY X1 X2 X3 X4;
Y BY X5 X6;
Z BY X7 X8;
W BY X9 X10;
W ON X Y Z;
int | X XWITH Z;

OUTPUT: SAMPSTAT MODINDICES RESIDUAL STANDARDIZED CINTERVAL FSCOEFFICIENT
FSDETERMINACY TECH3 TECH4 TECH5;

SAVEDATA:
RESULTS IS TT;
TECH3 IS YY;
TECH4 IS UU;

and the error is
*** ERROR in ANALYSIS command
Unknown option:
Y

thanoon younis posted on Monday, January 20, 2014 - 4:59 am

Linda K. Muthen posted on Monday, January 20, 2014 - 6:51 am

Try putting a semicolon after TYPE=RANDOM. If that does not work, please send the full output and your license number to support@statmodel.com.

thanoon younis posted on Monday, January 20, 2014 - 8:27 am

after put semicolon also i have error
*** ERROR in ANALYSIS command
Unknown option:
X
i am so sorry i dont have license number

regards

Linda K. Muthen posted on Monday, January 20, 2014 - 11:01 am

You don't give a variable name in the GROUPING option. See the user's guide to see how the GROUPING option is specified.

thanoon younis posted on Monday, January 20, 2014 - 5:46 pm

thank you so much for your help
i saw in user guide the grouping is
GROUPING IS group (1 = g1 2 = g2); i put it in my program and still same problem
*** ERROR in ANALYSIS command
Unknown option:
X

DATA:
FILE IS "C:\Users\hp\Desktop\normal.dat";
TYPE IS COVARIANCE MEANS;
NGROUPS = 2;
NOBSERVATIONS = 500 500;

VARIABLE:
NAMES ARE X1 X2 X3 X4 X5 X6 X7 X8 X9 X10;
USEVARIABLES ARE X1 X2 X3 X4 X5 X6 X7 X8 X9 X10;
GROUPING IS group (1 = g1 2 = g2);

ANALYSIS:
TYPE IS GENERAL BASIC;
ESTIMATOR IS BAYES;
ITERATIONS = 1000;
CONVERGENCE = 0.00005;

MODEL
TYPE=RANDOM;
X BY X1 X2 X3 X4;
Y BY X5 X6;
Z BY X7 X8;
W BY X9 X10;
W ON X Y Z;
int | X XWITH Z;
int | X XWITH Y;
int | Y XWITH Z;

OUTPUT: SAMPSTAT MODINDICES RESIDUAL STANDARDIZED CINTERVAL FSCOEFFICIENT
FSDETERMINACY TECH3 TECH4 TECH5;

SAVEDATA:
RESULTS IS TT;
TECH3 IS YY;
TECH4 IS UU;

Linda K. Muthen posted on Monday, January 20, 2014 - 8:08 pm

I can't help without seeing your output and license number at support@statmodel.com.

thanoon younis posted on Wednesday, January 22, 2014 - 6:19 am

this is my output

Mplus VERSION 6.12
MUTHEN & MUTHEN
01/22/2014 5:17 PM

INPUT INSTRUCTIONS

DATA:
FILE IS "C:\Users\hp\Desktop\normal.dat";
TYPE IS COVARIANCE MEANS;
NGROUPS = 2;
NOBSERVATIONS = 500 500;

VARIABLE:
NAMES ARE X1 X2 X3 X4 X5 X6 X7 X8 X9 X10;
USEVARIABLES ARE X1 X2 X3 X4 X5 X6 X7 X8 X9 X10;
GROUPING IS group (1 = g1 2 = g2);

ANALYSIS:
TYPE IS GENERAL BASIC;
ESTIMATOR IS BAYES;
ITERATIONS = 1000;
CONVERGENCE = 0.00005;

MODEL
TYPE=RANDOM;
X BY X1 X2 X3 X4;
Y BY X5 X6;
Z BY X7 X8;
W BY X9 X10;
W ON X Y Z;
int | X XWITH Z;
int | X XWITH Y;
int | Y XWITH Z;

OUTPUT: SAMPSTAT MODINDICES RESIDUAL STANDARDIZED CINTERVAL FSCOEFFICIENT
FSDETERMINACY TECH3 TECH4 TECH5;

SAVEDATA:
RESULTS IS TT;
TECH3 IS YY;
TECH4 IS UU;

*** ERROR in ANALYSIS command
Unknown option:
X

MUTHEN & MUTHEN
3463 Stoner Ave.
Los Angeles, CA 90066

Tel: (310) 391-9971
Fax: (310) 391-8971
Web: www.StatModel.com
Support: Support@StatModel.com

Copyright (c) 1998-2011 Muthen & Muthen

Linda K. Muthen posted on Wednesday, January 22, 2014 - 6:24 am

TYPE-RANDOM should be in the ANALYSIS command not the MODEL command.

thanoon younis posted on Wednesday, January 22, 2014 - 8:19 am

after this change same error
the output is:

INPUT INSTRUCTIONS

DATA:
FILE IS "C:\Users\hp\Desktop\normal.dat";
TYPE IS COVARIANCE MEANS;
NGROUPS = 2;
NOBSERVATIONS = 500 500;

VARIABLE:
NAMES ARE X1 X2 X3 X4 X5 X6 X7 X8 X9 X10;
USEVARIABLES ARE X1 X2 X3 X4 X5 X6 X7 X8 X9 X10;
GROUPING IS group (1 = g1 2 = g2);

ANALYSIS:
TYPE IS GENERAL BASIC;
ESTIMATOR IS BAYES;
ITERATIONS = 1000;
CONVERGENCE = 0.00005;
TYPE=RANDOM;

MODEL
X BY X1 X2 X3 X4;
Y BY X5 X6;
Z BY X7 X8;
W BY X9 X10;
W ON X Y Z;
int | X XWITH Z;
int | X XWITH Y;
int | Y XWITH Z;

OUTPUT: SAMPSTAT MODINDICES RESIDUAL STANDARDIZED CINTERVAL FSCOEFFICIENT
FSDETERMINACY TECH3 TECH4 TECH5;

SAVEDATA:
RESULTS IS TT;
TECH3 IS YY;
TECH4 IS UU;SOMSOM

*** ERROR in ANALYSIS command
Unknown option:
Y

MUTHEN & MUTHEN
3463 Stoner Ave.
Los Angeles, CA 90066

Tel: (310) 391-9971
Fax: (310) 391-8971
Web: www.StatModel.com
Support: Support@StatModel.com

Copyright (c) 1998-2011 Muthen & Muthen

Linda K. Muthen posted on Wednesday, January 22, 2014 - 10:06 am

Put a colon after MODEL.

thanoon younis posted on Saturday, January 25, 2014 - 4:41 am

hi dr.linda
when i conducted sem in mplus i received this message in output page
*** WARNING in OUTPUT command
TECH4 option is not available for TYPE=RANDOM.
Request for TECH4 is ignored.
*** WARNING
Data set contains unknown or missing values for GROUPING,
PATTERN, COHORT, CLUSTER and/or STRATIFICATION variables.
Number of cases with unknown or missing values: 437
2 WARNING(S) FOUND IN THE INPUT INSTRUCTIONS

why i cannot get on these results and why "not available"

Linda K. Muthen posted on Saturday, January 25, 2014 - 8:44 am

When random slopes are estimated, the dependent variable variances vary as a function of the covariate so there is not a single TECH4 value to be printed.

thanoon younis posted on Monday, January 27, 2014 - 2:19 am

hi dr. linda
i want to ask you some questions:
1- which method is suitable to conduct multiple group structural equation models with continuous dependent variables (just ML).
2- which method is suitable to conduct multiple group structural equation models with ordered categorical dependent variables.
3- which method is suitable to conduct multiple group structural equation models with dichotomous dependent variables.

regards

Linda K. Muthen posted on Monday, January 27, 2014 - 8:44 am

See page 601 of the user's guide where there is a summary table of estimators available for different types of variables.

thanoon younis posted on Tuesday, February 11, 2014 - 5:04 am

dear dr. linda
i want to use bayes method with multiple group sem with categorical data and i want to use uniform distribution. my question is this procedure correct?thanks in advance

Bengt O. Muthen posted on Tuesday, February 11, 2014 - 3:01 pm

You can do multiple group SEM with categorical data using Bayes and the Knownclass approach. A uniform prior can be specified where appropriate.

thanoon younis posted on Wednesday, February 12, 2014 - 5:28 am

dear dr. muthen
as you know the basic assumptions of SEM the observed variables must distributed as a normal when i have continuous data but most of scientists like(lee 2007) depended on normal distribution on categorical data my question why he doesnt use uniform distribution with categorical data.
thanks alot

Bengt O. Muthen posted on Wednesday, February 12, 2014 - 3:00 pm

I think you are still talking about Bayes. With categorical outcomes, Bayes MCMC procedures have been developed using probit, generating underlying latent continuous normal variables that are then categorized. See work by Albert and Chib for instance as referred to in our technical reports on the Mplus Bayes implementation.

Elizabeth Barrett-Cheetham posted on Thursday, February 13, 2014 - 10:45 pm

Hello,
I am currently testing a structural equation model from 4 emotions to distinct subtypes of well-being via 2 different mediators. The social emotions were induced in a lab via 4 different conditions. I have run the structural equation model within a mutligroup design, using the condition as a grouping variable.

I would like to run a manipulation check to see whether participants were in fact reporting higher levels of the emotion if they were in that condition. I understand that you can do a manipulation check by assessing whether the means of the emotions are higher in each group/condition. What syntax would I use to get the model to compute these? And to see if there any significant differences?

I�ve had a look on-line and in the user manual, but I can�t seem to find anything.

Many thanks for your assistance,
Elizabeth

Linda K. Muthen posted on Friday, February 14, 2014 - 11:23 am

I am assuming emotions are observed variables not factors. You can use MODEL TEST or chi-square difference testing to determine if the means are different from each other.

thanoon younis posted on Friday, February 14, 2014 - 5:07 pm

dear dr. muthen
in multiple group analysis with bayes method when you have categorical data with 4 categories which distribution is suitable for this type of data (nornal or uniform).
thanks alot

Bengt O. Muthen posted on Sunday, February 16, 2014 - 10:41 am

If the variable is ordinal and the distribution reasonably symmetric, it might be ok to approximate it as normal.

Milena Batanova posted on Monday, February 17, 2014 - 10:47 am

Dear Dr. Muthen,

I'm testing a 2 group (by gender) mediation model (N=499), where I have 7 exogenous variables (and 2 baseline variables of my outcomes), one mediator variable, and two outcomes. All variables are indicators and continuous.

Although my output statements say "terminated normally" they also say,

THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE
TRUSTWORTHY FOR SOME PARAMETERS DUE TO A NON-POSITIVE DEFINITE
FIRST-ORDER DERIVATIVE PRODUCT MATRIX. THIS MAY BE DUE TO THE STARTING
VALUES BUT MAY ALSO BE AN INDICATION OF MODEL NONIDENTIFICATION. THE
CONDITION NUMBER IS 0.175D-15. PROBLEM INVOLVING PARAMETER 162.

When I check the parameter, it's simply one variance for one of the baseline variables.

When I bootstrap the model, however, my output statements produce no errors. Thus, is it ok to report all results from the bootstrapped model? Otherwise, what can I do with my non-bootstrapped model?

thanks!

Vera Skalicka posted on Monday, February 17, 2014 - 1:03 pm

I am interested in whether there are differences in means between groups (manova or anova). How do I write the correct syntax for this? After specifying the groups, I wrote Model: [varname] (1) ;
But the output for the restricted and the non-restricted model was the same, with the same number of df.

Linda K. Muthen posted on Monday, February 17, 2014 - 2:15 pm

Milena:

Please send the output with the error message and your license number to support@statmodel.com.

Linda K. Muthen posted on Monday, February 17, 2014 - 2:16 pm

Vera:

Please send the outputs and your license number to support@statmodel.com.

Elizabeth Barrett-Cheetham posted on Monday, February 17, 2014 - 5:59 pm

Hello,

Thanks for your response. I am hoping to test the differences in the means using chi square testing in the model constraint command. I'm unsure about what syntax to use. As an example, how would I constrain the self-report measure of pride to be equal across the neutral and pride group?

Many thanks,
Elizabeth

Linda K. Muthen posted on Tuesday, February 18, 2014 - 6:09 am

See pages 478-480 of the user's guide.

thanoon younis posted on Saturday, February 22, 2014 - 1:23 am

HI DR. MUTHEN
i want to define a new weight matrix regarding WLS OR WLSMV in multiple group SEM how can i write this matrix in mplus??
thanks

Linda K. Muthen posted on Saturday, February 22, 2014 - 6:20 am

You cannot read a weight matrix with WLS or WLSMV.

ehrbc1 posted on Tuesday, March 04, 2014 - 6:07 pm

Hello Linda,

I have read through pages 478-480 of the users� guide and I am still experiencing some difficulties. As mentioned above I want to see whether there are any differences between a self-report measure of pride across the neutral and pride group. In other words, I am expecting that the pride measure will be significantly higher in the pride group compared to the neutral group.

Extracts of my syntax are as follows.

VARIABLE:
NAMES ARE g POSREPORT SEX AGENTIC COMMUNAL SELFWB PRIDEREPORT COMPREPORTGRATREPORT OTHERWB;
USEVARIABLES ARE g POSREPORT AGENTIC COMMUNAL SELFWB PRIDEREPORT COMPREPORT
GRATREPORT OTHERWB;
Missing are all (999);
GROUPING is g (1 = neutral 2 = positivity 3 = gratitude 4 = compassion 5 = pride);

MODEL:

COMMUNAL on POSREPORT;
COMMUNAL on GRATREPORT;
COMMUNAL on COMPREPORT;
COMMUNAL on PRIDEREPORT;

MODEL neutral: PRIDEREPORT (1);
MODEL pride: PRIDEREPORT (1);

After running the analysis, I have examined the "chi-square test of model fit" and it is not significant. This is not what I expected given that the ANOVA I ran in SPSS showed that there was difference between the self-report measure of pride in the pride and neutral group.

Could you please assist?

Many thanks,
E

Linda K. Muthen posted on Wednesday, March 05, 2014 - 10:41 am

The following holds the variance of pridereport, an independent variable, equal across groups. Is this what you intend?

MODEL neutral: PRIDEREPORT (1);
MODEL pride: PRIDEREPORT (1);

ehrbc1 posted on Wednesday, March 05, 2014 - 8:07 pm

Hi Linda,

Thanks for your response.

No, that is not what I intend. My intention is to see whether the mean of the variable PRIDEREPORT is significantly different in the neutral and pride group. So a t-test.

Many thanks,
E

Linda K. Muthen posted on Thursday, March 06, 2014 - 1:59 pm

The mean of an observed exogenous variable is not an estimated parameter in a regression model. You refer to a mean by placing the variable name in brackets:

MODEL neutral: [PRIDEREPORT] (1);
MODEL pride: [PRIDEREPORT] (1);

In a regression, it is most common to test the equality of a regression coefficient across groups.

thanoon younis posted on Sunday, March 09, 2014 - 4:58 am

hi
i want to ask you which method is better to estimate parameters in multi group SEM with categorical outcomes wls or bayes and which one gives less SE.
thanks in advance

Linda K. Muthen posted on Monday, March 10, 2014 - 8:28 am

I don't think you will find much difference in the standard errors between the two methods. Try it out and see.

Lindsay Bell posted on Thursday, March 13, 2014 - 7:11 am

Hello -

I am doing a multigroup comparison between groups from two different cultures, and I need to control for a binary variable in one culture, but not the other. When I try to run the model, I get an error message because one group has no variance on that variable.

The only way I can figure out to get the model to run is to falsely change the value on one of the observations so that both groups have some variance on that variable, and then constrain all parameters with that variable to be zero in the group where it doesn't apply. Is that the best way to do that? Does constraining the parameters to zero keep the variable from having any effects in the group where the value has been falsely changed?

Thank you,
Lindsay

Linda K. Muthen posted on Thursday, March 13, 2014 - 11:23 am

See the following FAQ on the website:

Different number of variables in different groups

Tait Medina posted on Thursday, March 20, 2014 - 2:48 pm

I am estimating a multiple group CFA model with 2 groups, 6 observed continuous variables, and one factor. I have constrained 2 loadings to be invariant across groups, in addition to the loading that has been fixed to 1 in each group. I am freely estimating the intercepts in both groups. The residual variances are freely estimated as well. The factor means are fixed at 0. Now I would like to compare the substantive implications of extracting factor scores and using them as outcomes in a regression analysis in each group (a two-step approach), compared to estimating the effects of covariates on the factor in each group in a single step. However, I am having difficulty setting up my syntax for the single step approach and am hoping that I can receive some guidance. This example syntax leads to a nonidentified model. I thought that I had met the minimum number of constraints, but obviously I am missing something. Thank you.

MODEL:
f1 BY y1-y6;
f1 ON x1;
MODEL 2:
f1 BY y5 y6;
[y1-y6];

Bengt O. Muthen posted on Thursday, March 20, 2014 - 3:29 pm

This must not be the full model since the factor means are not fixed at zero. Please send your output and license number to Support.

Tait Medina posted on Friday, March 21, 2014 - 8:10 am

I'm so embarrassed. That is exactly what was missing from the single-step model (factor means fixed at 0 in both groups). This syntax runs just fine:

MODEL:
f1 BY y1-y6;
[f1@0];
f1 ON x1;
MODEL 2:
f1 BY y5 y6;
[y1-y6];
[f1@0];
f1 ON x1;

Thank you, and sorry for the bother.

Sherilynn Chan posted on Monday, March 24, 2014 - 3:35 pm

I am trying to run a multiple group analysis in a censored regression. All of my variables are observed. The Mplus user guide states that for censored with maximum likelihood estimation outcomes, multiple group analysis is specified using the KNOWNCLASS option of the VARIABLE command in conjunction with the TYPE=MIXTURE option of the ANALYSIS command. However, I receive the following error warning:

*** ERROR in VARIABLE command
CLASSES option not specified. Mixture analysis requires one categorical
latent variable.

My input is below:

VARIABLE:

USEVAR = x1 x2 x3 x4 x5 y1;

CENSORED ARE y1 (b);

Missing ARE all .;

KNOWNCLASS IS x1(0 = male 1 = female);

ANALYSIS:
TYPE = MIXTURE;
Algorithm = INTEGRATION;
ESTIMATOR = MLR;

MODEL:
y1 ON x2 x3 x4;
y1 ON x5(1);

Are the input commands incorrect? How do I test for moderation (categorical variable) for a censored outcome, when all variables are observed? Your help is much appreciated - thanks!

Linda K. Muthen posted on Monday, March 24, 2014 - 4:11 pm

You also need the CLASSES option. See Example 7.21. It shows how all of these options work together.

Sherilynn Chan posted on Tuesday, March 25, 2014 - 12:44 pm

Thanks for your quick response. I looked at Example 7.21 but am still unclear on how to apply this to my model as I do not have a categorical latent variable with known class membership. Instead I have a categorical observed variable - I am examining sex (0 = male 1 = female) as a moderator.

Linda K. Muthen posted on Tuesday, March 25, 2014 - 1:30 pm

The CLASSES option names a categorical latent variable which the KNOWNCLASS option makes equivalent to your observed variable.

ehrbc1 posted on Tuesday, April 01, 2014 - 7:34 pm

Hi Linda,

I have been able to run some pre-planned ANOVA contrasts in SPSS but I would like to now run them in mplus for my multi-group structural equation model.

I understand that to compare a DV (the mean self-reported feeling of pride) between a neutral group and a pride group, I would do the following:
MODEL neutral: [PRIDEREPORT] (1);
MODEL pride: [PRIDEREPORT] (1);

How would I compare the pride group to the combined average of the other groups in the model (not including neutral) on [PRIDE REPORT]. The other groups are compassion, positivity and gratitude.

Many thanks for your assistance,
E

Linda K. Muthen posted on Wednesday, April 02, 2014 - 11:04 am

Let's say you have four means that you have three means that you have labelled:

MODEL 1:
[y] (p1);
MODEL 2:
[y] (p2);
MODEL 3:
[y] (p3);

You can use MODEL CONSTRAINT as follows:

MODEL CONSTRAINT:
NEW (mean diff);
mean = (p2 + p3)/2;
diff = mean - p1;

thanoon younis posted on Sunday, April 20, 2014 - 7:35 am

hi dr. Muthen

i have multiple group SEM with ordered categorical variables and i want to use inverse normal to solve the identification problem is that correct??

thanks in advance

Bengt O. Muthen posted on Monday, April 21, 2014 - 8:19 am

I don't know which identification problem you refer to or what inverse normal you refer to.

thanoon younis posted on Monday, April 21, 2014 - 8:31 am

Identification proplem for the distribution of thresholds in ordered categorical data.can I use inverse normal as a distribution for thresholds.
Thanks alot doctor

Bengt O. Muthen posted on Tuesday, April 22, 2014 - 9:09 am

Are you talking about a Bayes prior? Is so, we don't have an inverse normal prior. I still don't understand what you are asking.

thanoon younis posted on Wednesday, April 23, 2014 - 12:42 am

yes i want to use bayesian analysis in SEM with ordered categorical and dichotomous data.

Linda K. Muthen posted on Wednesday, April 23, 2014 - 10:20 am

See MODEL PRIORS in the user's guide to see the priors available in Mplus.

thanoon younis posted on Tuesday, June 03, 2014 - 7:41 pm

hi
i want a real data example for multiple group SEM wuth ordered categorical and dichotomous data. can you help me to get on it?

many thanks in advance

Thanoon

Linda K. Muthen posted on Wednesday, June 04, 2014 - 11:31 am

We don't have one that we can share. Perhaps you should ask on SEMNET.

Eric Deemer posted on Tuesday, July 29, 2014 - 9:54 am

Hi,
Does version 7.11 still provide scaling correction factors for chi-square difference testing using MLR? I'm trying to do a chi-square difference test and I don't see the correction factors in my output. Thanks.

Eric

Linda K. Muthen posted on Tuesday, July 29, 2014 - 10:19 am

They should be there. Check to be sure the estimator being used is MLR.

Eric Deemer posted on Tuesday, July 29, 2014 - 10:28 am

Ah, there are the scaling correction factors! I thought MLR was the default estimator?

eric

Bengt O. Muthen posted on Tuesday, July 29, 2014 - 4:43 pm

Not in all cases.

Eric Deemer posted on Wednesday, July 30, 2014 - 3:34 am

I see. Okay, I specified MLR estimation and I got the correction factors. Thanks so much!

Eric

Melissa Lopez posted on Tuesday, August 26, 2014 - 1:44 pm

Hello,

I'm unfamiliar with the Wald's test for model comparison. I've been researching it and believe I understand how to do it, but I have a question. When conducting the comparison, do you test all parameters or only the ones that differ between the nested and comparison models?

Your advice is greatly appreciated. Thank you in advance.

Bengt O. Muthen posted on Tuesday, August 26, 2014 - 3:11 pm

It's your choice - you can test any set of parameters.

thanoon younis posted on Sunday, August 31, 2014 - 9:11 pm

Hello,

i need your help to get on some information on this data which is found it in your website for multiple group analysis. wmimicd.dat

many thanks in advance

Linda K. Muthen posted on Monday, September 01, 2014 - 6:50 am

Where do you find this data set?

thanoon younis posted on Monday, September 01, 2014 - 8:23 am

thank you for your reply.

i found it in Mplus Examples - Categorical Outcome- wmimicd.dat.

many thanks in advance

Linda K. Muthen posted on Monday, September 01, 2014 - 10:43 am

There is no information about this data set available. It may be simulated.

Melissa Lopez posted on Thursday, September 18, 2014 - 8:53 am

Hello,

I have another question regarding Wald's test. In the testing portion I began with a fully constrained model (parameters, means, covariances that had been added for better data fit). I am wondering if some of the effects can appear to be significant if they are not if the overall model shows significance. To account for that do you drop constraints based on the modification indices and make decisions based on how the other fit indices change?

I apologize for all the questions, but greatly appreciate your help. I have found a great deal of references, but none that have answered these questions.

Thank you again for providing this service.

Bengt O. Muthen posted on Thursday, September 18, 2014 - 4:51 pm

I don't understand your first paragraph.

Daniel Kopala-Sibley posted on Friday, October 17, 2014 - 12:21 pm

Dear Drs Muthen,

I am trying to run power analyses using monte carlo esimation for a moderated mediation model. All variables, including the moderator, are continuous. The code below generates power estimates for the indirect effect of X->X2->Y, and for detecting the effect of interaction between X and M (XM) on X2. My question is should I report power separately for the interaction term and for the indirect effect, or is there a way to test for power by combining the two?

Right now just for ease, I've specified all means = 0 and variances = 1, and regression effects are various levels of possible effect sizes.

Thank you so much in advance

MONTECARLO:
names are x x2 y m xm;
nobs = 70; sample size;
nreps = 1000;
seed = 2222; number generator;
DEFINE: xm = x*m;
ANALYSIS: TYPE=meanstructure;
MODEL POPULATION:
[x @ 0]; !mean of x set to 0;
[y @ 0];
[x2 @ 0];
[m @ 0];
[xm @ 0];
y @ 1.0;
x @ 1.0;
x2 @ 1.00;
m @ 1.00;
xm @ 1.00;
x2 on X @ 1.02;
x2 on xm @ .283;
x2 on m @ .283;
y on x2 @ 1.02;
y on x @ .283 x @ .283;
MODEL:
x * 1.00;
x2 * 1.00;
y * 1.0;
xm * 1.0
x2 on x * 1.02(gamma1);
x2 on m * 1.02;
x2 on xm * 1.02 (gamma2);
y on x2 * .283 (b);
y on x * .283 x * .283;
MODEL INDIRECT:
y IND x;

Bengt O. Muthen posted on Friday, October 17, 2014 - 4:42 pm

Perhaps you get power for the full effect - not just using the slope of the main effect and the slope of the interaction effect separately - by using Model Constraint to express the moderated mediation effect in line with the "indirect" expression in the pdf called

Loop plot for ex 3.18

on our Mediation web page

http://www.statmodel.com/Mediation.shtml

Daniel Kopala-Sibley posted on Sunday, October 19, 2014 - 9:54 am

Thank you for your very prompt response.

When I use the Loop commands:

MODEL CONSTRAINT:
LOOP(m,-2,2,0.1);
I receive the following error message:

*** ERROR in MODEL CONSTRAINT command
A parameter label or the constant 0 must appear on the left-hand side
of a MODEL CONSTRAINT statement. Problem with the following:
LOOP(M,-2,2,0.1) =

Would you happen to know what this means and/or how to adjust the code?

Thanks again

Linda K. Muthen posted on Sunday, October 19, 2014 - 10:34 am

The LOOP plot came out in Version 7. It sounds like you may be using an older version of the program where it is not available.

Daniel Kopala-Sibley posted on Sunday, October 19, 2014 - 10:54 am

Oh, I see, yes, I'm using version 6. Do you know if there's a way to test my question (see two posts above)/modify the code I wrote in version 6?

Thanks again

Linda K. Muthen posted on Sunday, October 19, 2014 - 1:05 pm

I think Kris Preacher has a website for creating this type of plot.

Daniel Kopala-Sibley posted on Sunday, October 19, 2014 - 4:03 pm

Thank you for the suggestion. He doesn't seem to have any utilities for calculating power for a moderated mediation model, but he did post some code to do it, which I modified for my purposes. Just to be sure I'm understanding my output correctly, the following code within a monte carlo power simulation yields power analyses for both the regression estimate of XM on X, as well as to direct the newly defined IND effect. If I want to calculate power for detecting moderated mediation, am I primarily interseted inthe power for the IND effect or for detecting the effect of X on XM?

MODEL:
y on m (b1)
x
xm (b2);
m on x (a1);
xm with m;
MODEL CONSTRAINT:
new (ind xmodval);
xmodval = -1;
ind = a1*(b1+b2*xmodval);

Thanks again, this has been quite helpful.

Bengt O. Muthen posted on Monday, October 20, 2014 - 7:44 am

I think you could be interested in the power for both "b2" and "ind".

But I am not sure you want to have

xm with m;

in the model.

Rachel posted on Thursday, October 23, 2014 - 9:24 pm

Hello,

I am doing sem mediation analysis (observed continuos variables) comparing two ethnic groups. I compared my baseline model to my fully restrained model using the satorra-bentler chi square difference test which showed a degrade in model fit. this means that the two groups are different, correct? how would i find out what individual paths they differ on? Would I remove one constraint at a time from my more restricted model and do the satorra bentler calculations again?

Linda K. Muthen posted on Friday, October 24, 2014 - 8:01 am

You can look at modification indices.

Rachel posted on Friday, October 24, 2014 - 8:54 am

so if there is a degrade in model fit then the two groups differ, yes? and I should request modification indices for my baseline model?

Bengt O. Muthen posted on Friday, October 24, 2014 - 10:23 am

You want the modindices for the fully constrained model.

When there is a degrade in model fit, that is large modindices, the groups differ.

Steven A. Miller posted on Friday, November 21, 2014 - 7:33 am

Is there a way to have Mplus read data in from two different files? I'm trying to do multigroup analysis and I have only summary data for one group but the actual data set for the second group.

Thanks,
Steve

Bengt O. Muthen posted on Friday, November 21, 2014 - 7:55 am

You can create summary data for the second group and read in summary data for both groups.

Steven A. Miller posted on Friday, November 21, 2014 - 8:19 am

Thank you for your quick response, Bengt. Is there an example of multigroup analysis based on summary data of this sort any where? How do I tell Mplus that part of the summary data is for group 1 and part is for group 2?

Thanks,
Steve

Linda K. Muthen posted on Friday, November 21, 2014 - 10:09 am

See pages 483-484 of the user's guide under Summary Data, One Data Set.

Nazli Baydar posted on Sunday, January 11, 2015 - 2:46 am

I have a "missing by design" problem. I have two groups, just about 50% of the sample in each. I get the error:

THE MISSING DATA EM ALGORITHM FOR THE H1 MODEL
HAS NOT CONVERGED WITH RESPECT TO THE LOGLIKELIHOOD
FUNCTION.
The question may be framed as more generally, how to have different model structures for two groups. Essentially for group 1: age3 --> age 4 --> age 5;
and for group 2: age3 --> age 4 --> age 5 --> age 6.

The discussion boards suggest that the coverage is low for one group that is why this error emerges. But of course, that is the point of "missing by design".

I would appreciate your suggestions. The inp file was as follows:
DATA:
FILE IS 'missing by design test mplus data v01.dat';
VARIABLE:
NAMES ARE anketno
a3ecbi a3pun a3homep
a4ecbi a4pun a4homep
a5ecbi a5pun a5homep
a6ecbi a6pun a6homep
schpat;
USEVARIABLES ARE
a3ecbi a4ecbi a5ecbi a6ecbi schpat ;
PATTERN is schpat(1=a3ecbi a4ecbi a5ecbi a6ecbi
2 = a3ecbi a4ecbi a5ecbi );
missing=all (-999);
model:
!structural
! Age 4;
a4ecbi on a3ecbi ;
!Age 5;
a5ecbi on a3ecbi a4ecbi ;
!Age 6 ;
a6ecbi on a3ecbi a4ecbi a5ecbi ;

Bengt O. Muthen posted on Monday, January 12, 2015 - 10:56 am

Try H1iterations = 5000; in the Analysis command. You can also say NoCHI in the Output command to suppress H1 calculations but then you won't get chi-square test of model fit.

You can also run this as a 2-group run to make H1 computations easier, but then you have to handle the difference in the number of variables for the two groups (see FAQ on that).

Nazli Baydar posted on Monday, January 12, 2015 - 11:11 am

Yes, I got a FAQ sheet on that a few minutes ago but that was not much help because it is referring to a model but that model is not specified.

Here is what I got:
"Note that
this output shows only one missing data pattern for both groups whereas
***females have two patterns***. If you do the analysis in two steps, saving the
data and then analyzing it, you see ** two missing data patterns for females **
and get the same results as doing it in one step. "

Bengt O. Muthen posted on Monday, January 12, 2015 - 11:19 am

All you need is the general statement in that FAQ:

For a dependent variable, it is best to create a missing value flag for that
variable in the group that does not have that variable using the DEFINE
command. You also need to fix the residual variance of the variable to a
very small value, or hold it equal to the other group (the estimate will not
be affected by the group that has missing data). Fixing it to zero creates
a non-invertible estimated covariance matrix.

Bengt O. Muthen posted on Monday, January 12, 2015 - 11:26 am

But trying an increase in the H1 iterations in the single-group approach should be a first step. I assume you don't have coverage=0 for any pairs of variables.

Nazli Baydar posted on Monday, January 12, 2015 - 11:42 am

When I do that, MPLUS never proceeds to estimating the model at all. It quits when it sees the variable for one group totally missing, but that is the whole point! The variable is missing by design! Here is the output that I am getting:
*** ERROR
One or more variables in the data set have no non-missing values.
Check your data and format statement.

Group AGE7SCH

Continuous Number of
Variable Observations Variance

A4ECBI 394 0.031
A5ECBI 393 0.029
A6ECBI 394 0.026
A3ECBI 394 0.026

Group AGE6SCH

Continuous Number of
Variable Observations Variance

A4ECBI 404 0.029
A5ECBI 400 0.027
**A6ECBI 0
A3ECBI 404 0.025

Bengt O. Muthen posted on Monday, January 12, 2015 - 5:05 pm

I was suggesting increasing H1iterations in the single-group approach (your original run), not the two-group approach.

If that doesn't work, you can send your output from this single-group run and the output from your two-group run, plus data and license number to support.

thanoon younis posted on Friday, April 10, 2015 - 8:14 am

I have question regarding the distribution of parameters in SEMs, can i use my own distributions as a distribution to SEMs parameters for example the distribution of the variance of dependent var. psi is gamma so can i change it to another distribution? is that correct?

Many thanks in advance

Bengt O. Muthen posted on Friday, April 10, 2015 - 9:53 am

I think you are referring to Bayes estimation and priors. The prior choices are described on page 698 of the UG.

thanoon younis posted on Friday, April 10, 2015 - 8:06 pm

Thank you very much for your quick response i want to ask you in the 698, can i use the dist. available for example gamma, uniform, log normal for psi or just default priors (inverse gamma)?

Many thanks again

Daniel Lee posted on Saturday, April 11, 2015 - 1:20 pm

Hi Dr. Muthen,

If you have a saturated model: 2 factor structure with 2 items loading on each factor, can you conduct a multiple group CFA analysis?

Bengt O. Muthen posted on Sunday, April 12, 2015 - 5:25 pm

Yes.

Daniel Lee posted on Monday, April 13, 2015 - 6:50 pm

Thank you!!

thanoon younis posted on Monday, April 13, 2015 - 9:09 pm

Dear Prof. Muthen
i want to ask you in the page 698, can i use the dist. available for example gamma, uniform, log normal for psi or just default priors (inverse gamma)?

Many thanks again

Bengt O. Muthen posted on Tuesday, April 14, 2015 - 8:19 am

You can only use what is mentioned for Psi on that page.

Bengt O. Muthen posted on Tuesday, April 14, 2015 - 8:40 am

I should add that

"The Mplus default variance prior is IG(-1,0) which implies a uniform prior ranging from minus infinity to plus infinity."

This is mentioned on page 22 of the paper on our website:

Muth�n, B. (2010). Bayesian analysis in Mplus: A brief introduction. Technical Report. Version 3. Click here to view Mplus inputs, data, and outputs used in this paper.
download paper contact author show abstract

Daniel Leopold posted on Friday, April 17, 2015 - 8:38 pm

Profs. Muth�n,

I am conducting a longitudinal cross-lag model with 6 waves and three latent factors at each wave. I've also 1) used parcels to simplify the estimation process, 2) have a large sample size, and 3) have already demonstrated measurement invariance prior to estimation of the full cross-lagged models.

I'm wondering how you'd suggest that I test gender differences in the individual stability and cross-lag paths, given that the significance of these differences is of primary interest (I'm currently using the GROUPING command to separate the models by gender). Because the Model Constrain and Model Test commands perform constraints/tests simultaneously, rather than one-by-one, and I don't think the DO option would solve this issue either, I'm at a bit of a loss as to how to accomplish these tests of the regression paths. Would you recommend another command option, or perhaps using the path estimates and SEs to explore significant differences using the confidence intervals (e.g., seeing if "x1-x2-1.96*sqrt(se1^2+se2^2) > 0")? If the latter, should I use the unstandardized or the standardized estimates? Thank you for your assistance, and please let me know if you need more information prior to making a suggestion.

I appreciate the amazing help and resource you provide to all of us modelers! It's a pleasure to use your software and the supporting information.

Bengt O. Muthen posted on Saturday, April 18, 2015 - 8:03 am

You can simply do separate runs using Model Test for each coefficient you want to test the difference across groups for. Another approach is to hold all of them equal across groups and then check Modindices for which ones are not equal.

Daniel Leopold posted on Saturday, April 18, 2015 - 8:15 pm

Thank you, Bengt. In order to save time initially, I prefer the latter option of holding paths equal across groups and looking at Modindices. Using a theoretical example with 3 waves and 2 latent factor at each wave, would I do this by requesting all Modindices on something like the following:

Grouping is (0 = female 1 = male);

MODEL:
A3 ON A2 (1);
A3 ON B2 (2);
A2 ON A1 (3);
A2 ON B1 (4);

MODEL male:
A3 ON A2 (1);
A3 ON B2 (2);
A2 ON A1 (3);
A2 ON B1 (4);

Would the provided ON/BY Modindices be for the the release of the equality constraints (i.e., 1, 2, 3, 4), or something else?

Daniel Leopold posted on Saturday, April 18, 2015 - 8:19 pm

P.S. I obviously meant the following:

GROUPING IS
gender (0 = female 1 = male);

And, for what it's worth, I'm using the strong invariant model as a baseline for these tests (with latent factor loadings and latent means held equal across waves).

Thank you!
Dan

Bengt O. Muthen posted on Sunday, April 19, 2015 - 9:04 am

That's right.

Alicia Lozano posted on Tuesday, April 21, 2015 - 1:52 pm

I am trying to run a path analysis with both continuous and categorical variables. My continuous variable (x2) is a dependent variable. I have a categorical variable (x1) (with three levels) predicting (x2). I have another variable (y1), which is a coategorical variable as a moderator in the relationship between (x1) and (x2) in my model controlled for demographic variables (both continuous and categorical). My hypothesis predicts that (y1) interacts with (x1) in predicting (x2). I want to run this model as a multigroup analysis. Would there be anyone who can lead me step-by-step on how to test the moderating effect? I am new to this software, but eager to learn.

Bengt O. Muthen posted on Tuesday, April 21, 2015 - 6:38 pm

I'll give you pieces of what you need and then you can check the UG to learn more about it.

Use

Grouping = y1(0=zero 1=one 2=two .....);

and then for the x1 dummy variable (you have more than one):

Model:
x2 on x1 demog;
Model zero:
x2 on x1 (b0);
Model one:
x2 on x1 (b1);
Model two:
x2 on x1 (b2);

Model Test:
! equality test:
0 = b1-b0;
0 = b2-b0;

lopisok posted on Tuesday, April 28, 2015 - 2:03 am

Dear forum,

I'm trying to do a multigroup analysis for a SEM with a binary outcome. I followed the procedures as described by Byrne (2012, p.259-282) but her examples did not concern a binary outcome. As suggested by Byrne factor means to zero, factor loadings not contrained equal, intercepts not constrained equal. I use the WLSMV estimator to get fit-indices.

My first question is if I should also include the binary outcome in the intercepts not contrained equal statement?

Secondly, I get the following error message but I have no clue why:
*** ERROR
The following MODEL statements are ignored:
* Statements in Group MALE:
[ HERB1 ]
[ HERB3 ]
[ HERB6 ]
[ HERB8 ]

My shortened syntax is this:

Analysis:
estimator =WLSMV;

Model:
structural model syntax...
[HERB PIW WH WTI FYS AW ZIW RL@0];

MODEL Female:

MODEL Male:
HERB by herb3 herb6 herb8; HERB BY AW5;
PIW by piw5 piw7;
WH by wh4 wh5;
WTI by wtin2 wtin3 wtin4;
FYS by fys2 fys4 fys5 fys6 fys7; FYS4 WITH FYS1; FYS5 WITH FYS1; FYS5 WITH FYS4;
AW by aw4 aw5;
ZIW by ziw5 ziw6;
RL by rl2 rl3 rl5;

[herb1-herb8 piw3-piw7 wh3-wh5 wtin1-wtin4
fys1-fys7 aw3-aw5 ziw3-ziw6 rl1-rl5];

Output:
MODINDICES TECH1;

Linda K. Muthen posted on Tuesday, April 28, 2015 - 6:24 am

I assume the herb variables are binary. Therefore thresholds not intercepts are the parameters you should refer to.

[ HERB1$1 ]
[ HERB3$1 ]
[ HERB6$1 ]
[ HERB8$1 ]

Linda K. Muthen posted on Tuesday, April 28, 2015 - 6:25 am

See multiple group analysis in the Topic 2 course handout on the website for the inputs to use to test for measurement invariance for binary items.

lopisok posted on Wednesday, April 29, 2015 - 12:38 am

Thank you very much. Both the herb and piw variables are binary indeed. I checked the topic 2 handout and applied all suggestions. Now I get the error: THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL. PROBLEM INVOLVING PARAMETER 197.

This involves the alpha matrix for the latent variable herb (based on binary variables). According to the manual this should have something to do with the means - or intercepts depending on the model - of this variable.
I specified that all the means of the latent factors are fixed to 0. With [HERB PIW WH WTI FYS AW ZIW RL@0];

Is there an exception for latent factors based on binary variables?
I'm sorry for these questions but I've never seen a good example of equivalence testing of a structural model with a binary outcome.

Kind regards,
Filip

lopisok posted on Wednesday, April 29, 2015 - 1:11 am

I do see that I did not include scale factors as mentioned in the handout.
Am I correct that I should include the following for the main model:
{herb1-herb8@1};

and this for each latent variable which is based on binary variables?

Even when the intercepts/thresholds are not constrained equal?

Linda K. Muthen posted on Wednesday, April 29, 2015 - 6:13 am

You should do exactly as shown in the handout. As shown, scale factor apply to the categorical variables not the latent variables. For further information, listen to the video. Measurement invariance models are also described in detail in the Version 1 language addendum on the website with the user's guide.

Kelly McGinn posted on Friday, May 01, 2015 - 12:12 pm

Dear Drs. Muthen
I am running a path analysis with all observed variables on my experimental data. I have three dummy coded variables representing my four conditions with zero being the control. I have created three interaction terms with student prior knowledge in order to determine the interaction between prior knowledge (continuous) and condition (dichotomous) on post-test measures. I am attempting to use multiple group analysis to interpret the interaction. I want to not only compare the experimental conditions to the control but also to each other. Can you advise me on the input for appropriately constraining the paths to test the interactions for the varying conditions and allowing the other predictors to vary? Most examples I find in the manual and online are not for experimental data.

Bengt O. Muthen posted on Friday, May 01, 2015 - 1:54 pm

If your groups are the 4 conditions, the interactions simply translate to slope differences in the regression of post-test on prior knowledge. You can test any such difference by labeling those parameters and using Model Test.

JOEL WONG posted on Monday, May 04, 2015 - 8:45 am

I've 2 questions about the use of multigroup SEM. I used the WLSMV estimator and I've 2 groups (countries).

1. I tested a fully constrained model (factor loadings and structural paths constrained to be equal across groups) versus a fully unconstrained model. I thought that the chi square value for the constrained model would be higher or at least equal compared to the unconstrained model. Instead the chi square value went down from 647 (for the fully unconstrained model) to 613 (for the fully constrained model). Does this reflect an error in my models or is it because the two models are not directly comparable because I used the WLSMV estimator. The DIFFTEST revealed no significant change in the scaled chi square values between both models.

2. When I examined the output for the constrained model, the unstandardized coefficients are identical across both groups (as I would expect). However, why are the standardized coefficients different across both groups since I specified that the paths would be constrained to be equal? Would it be ok to report the standardized coefficients in my results?

Thank you!

Linda K. Muthen posted on Monday, May 04, 2015 - 10:48 am

1. The chi-square statistics for WLSMV does not behave in the expected way. This is why DIFFTEST is required to test for differences in chi-square for WLSMV. You cannot compare the chi-square values. Only p-values can be compared.

2. Each group is standardized using their standard deviations which are not necessarily the same in each group.

chris mooney posted on Wednesday, May 13, 2015 - 1:32 pm

Hello Drs. Muthen --

I am performing a multgroup analysis. In examining measurement invariance, I have found that there is very good model fit for the configural, metric, and scalar models using the Metric = Configural Metric Scalar syntax. However, in one of my groups (female), I get an error message indicating that I have a negative residual variance for an indicator in my configural model. The sample size is quite large, and the residual variance is small and not significant. Of note, I do not get this message in the full group measurement model.

Is it appropriate to constrain the variance for this specific indicator to 0? For instance: y1@0?

Thanks,

Chris

Bengt O. Muthen posted on Wednesday, May 13, 2015 - 6:48 pm

You could - or, just report it.

Yeshim Iqbal posted on Thursday, May 28, 2015 - 11:36 am

Hello,

I am trying to run a multigroup analysis to compare a model (all observed variables) across different samples. As far as I understand, the way the syntax works is that the first MODEL command specifies the overall model (so, if I only included that, all paths would be constrained to be equal across groups), and the MODEL command followed by a label is where I specify the paths I want to set free, for the sample specified by the label.

Is this correct? I have been trying to do this - using the MODEL followed by the label command to set free particular paths - and I keep getting the same model fit results over and over again - the syntax appears to not be working.

Thanks so much for your help.

Best,

Yeshim

Bengt O. Muthen posted on Thursday, May 28, 2015 - 1:21 pm

The default with all observed variables is inequality across groups - reasoning that these are measurement-error-free "structural relationships" for which Mplus always allows variation across groups as the default. You can make them equal across groups by giving each parameter a label in the overall (see UG).

If this doesn't help, we need to see the output.

C. Gantz posted on Thursday, June 18, 2015 - 12:43 pm

Hello Drs. Muthen,

I just ran a multiple group model. I compared the chi-square of model where I fully constrained the two models to be equal across groups to the model where all paths were freely estimated - this chi-square difference was significant. However, I am specifically interested in the group differences in three paths in the model, and when I compared models where I freed each of the three single paths to the full constrained model, the chi-square was not significantly different.

I am wondering how to interpret this - does this indicate that there was not moderation in these paths, even though the overall models were significantly different between groups?

Thank you!

Linda K. Muthen posted on Thursday, June 18, 2015 - 1:14 pm

The overall test says there is a difference somewhere. Apparently, the three paths you are interested in are not different. The differences are elsewhere.

Christy Allen posted on Wednesday, September 02, 2015 - 9:37 pm

I'm testing a mediation model (all observed; X, M and Y = continuous). I would be deeply grateful for advice.

1. I want to test if model fit is different by gender (predicting model fit will be better for women). If model fit is poor when examining gender files separately or in combined data, can I even test this hypothesis?
2. Do I need to establish measurement invariance? Is there more to that beyond comparing fit in output with fixed parameters and output with parameters constrained to be equal?
3. Would I have to do more to test whether the model fit is better in women, or could I look at the output (e.g., path coefficients) of the freely estimated model? (If freely estimated model is better fit.)
4. Given my hypothesis, should I include MODEL CONSTRAIN for the indirect effect?
5. Am I missing parameters below?

Syntax parameters free:
Variable:
Grouping is gender (0= male, 1= female);

Analysis:
Type= general;
Bootstrap= 10000;

MODEL:
M on X;
Y on X M Cov1 Cov2 Cov3;
Cov3 WITH Z;

Model indirect:
Y IND X

Syntax parameters constrained equal:
Variable:
Grouping is gender (0= male, 1= female);

Analysis:
Type= general;
Bootstrap= 10000;

MODEL:
M on X (1);
Y on X M Cov1 Cov2 Cov3 (2);
Cov3 WITH Z (3);

Model indirect:
Y IND X

Bengt O. Muthen posted on Thursday, September 03, 2015 - 7:42 am

In a simple mediation model there is no model fit to be tested, that is, there are no left-out arrows.

Also, there is no measurement invariance to be tested because there is no measurement part of the model (no multiple indicators of a factor).

Your run with equalities is ok, except

Y on X M Cov1 Cov2 Cov3 (2);

should be restated because your label gives equality of the slopes on that line but you want equality across groups - so you should give a label to each slope.

Christy Allen posted on Thursday, September 03, 2015 - 6:02 pm

Dr. Muthen, thank you very much for your response!

To confirm, you are saying with my model it is not possible to answer the question as to whether model fit is better for one gender or the other since we cannot draw any conclusions about model fit for a simple mediation model?

If I still wanted to examine differences between men and women, is there a way to do so? Could I look at the relative strength of certain pathways, such as X-->M? Would I do this by running the syntax with free parameters and syntax with equal parameters, and then examine the path coefficients? Or could I just look run the model in men and women separately to compare?

Thank you for the syntax correction. Is this what you were suggesting?

MODEL:
M on X (1);
Y on X (2);
Y on M (3);
Y on Cov1 (4);
Y on Cov2 (5);
Y on Cov3 (6);
Cov3 WITH Z (7);

Bengt O. Muthen posted on Thursday, September 03, 2015 - 6:15 pm

First paragraph: Right.

Second paragraph: You can test equality across groups in a single, 2-group run using labels as you have done.

Christy Allen posted on Friday, September 04, 2015 - 3:34 pm

Thank you, Dr. Muthen.

Would this be done using the chi square difference test?

Bengt O. Muthen posted on Friday, September 04, 2015 - 6:19 pm

Right.

Christy Allen posted on Saturday, September 05, 2015 - 6:12 pm

Thank you for all of your help, Dr. Muthen!

zeyad almutawa posted on Thursday, October 08, 2015 - 12:50 am

Hello Drs. Muthen,
I have two different questionnaqires directed to two different groups of respondents (employees and customers). the first questionnaire addresses HRM and the second questionnaire assesses Service quality. Both questionnaires have different items. The question is, is possible to solve such projects using M-Plus, and if yes, how to do it?

Thanks
Zeyad

Bengt O. Muthen posted on Thursday, October 08, 2015 - 12:23 pm

Yes, that would be two separate analyses because you have neither people nor items in common for the two cases.

zeyad almutawa posted on Friday, October 09, 2015 - 2:10 am

Hello Drs. Muthen,

Thank you for your prompt reply. I have one more question please. The question is, after conducting two separate analysis, is it possible to join all the variables into one single model? since the first questionnaire addresses five different variables, whereas the second questionnaire addresses only one variable?

Thanks
Zeyad

Bengt O. Muthen posted on Friday, October 09, 2015 - 8:47 am

No. Because there are no items or people in common the joint model would not have different estimates than the 2 separate ones.

Rasha Qudisat posted on Friday, October 16, 2015 - 8:31 pm

Dr. Muthen,
I am running an SEM for three latent variable, and I want to test the structural invariance between groups (gender). The model has configural invariance, but no vmetric invariance.

Having a configural invaraince, will I still be able to find the direct and indirect effect of gender on the latent variables?

Bengt O. Muthen posted on Sunday, October 18, 2015 - 11:02 am

No, you need metric invariance to compare structural coefficients across groups, otherwise they are not on comparable scales.

Rasha Qudisat posted on Sunday, October 18, 2015 - 11:23 am

Thank you for your answer,
I also tried to find if there is interaction between gender the the latent variables (gender with guilt, and gender with shame), to cross-validate that there is no metric invariance.
In this case, how do I interpret the non-significant interactions with the non-existance of metric invariance?

Bengt O. Muthen posted on Sunday, October 18, 2015 - 4:35 pm

The latent variable means different things for the genders - it is like two different variables - so I don't think that absence of interaction has a clear meaning.

M.G. Keijer posted on Monday, November 09, 2015 - 1:50 am

Dear Bengt O. Muthen,

I want to run two models and a test, x1 and x2 are means of the same variables i.e.:
define:
x1 is mean (d1-d7)
x2 is mean (d1-d3)

model 1:
y on x1 (a)
model 2:
y on x2 (b)

Model test:
a=b

The idea is to check if a mean based on 3 observations fits as well as a mean based on 7 observations.

My problem is there's not real grouping variable to separate the models.
I was think of something like if x2 then d4-d7 are missing...
Are there other options to tackle such a problem?
Thank you in advance.

Bengt O. Muthen posted on Monday, November 09, 2015 - 4:12 pm

I don't see how this would be done. The regressions fit perfectly. Perhaps you can run each and just see if the slope is larger with a mean of more items.

Christine Yu posted on Thursday, December 17, 2015 - 5:52 am

Dear Drs. Muthen,

I have run a SEM MGA with moderator (for three groups) where the first group's LV means were set to 0 and the first group's LV variances were set to 1.

I have a handful of questions:

1. The other two group's LV means are now in reference to the first group, correct?
2. Is there a way to get the "true" LV means? [I tried to use the syntax to get the LV means, but the stats person on campus and I could not decipher the output?]
3. Is it possible to PLOT the moderator model for each group?

I found syntax to plot a moderator in a SEM analysis, but I don't think it did a group analysis? The syntax had set its first factor loading to zero, but my first factor loading of my LVs were allowed to be estimated since I had set the reference group LV means to zero and the reference group LV variances to one.

I was thinking it was "too much" to have a factor loading set to zero AND the LV means of the reference group also set to zero?

Essentially I'm feeling doubtful that I can plot the moderator model of this SEM MGA?

Any advice would be greatly welcomed and appreciated.

Sincerely,
Christine Yu

Bengt O. Muthen posted on Sunday, December 20, 2015 - 5:54 pm

1. Yes.

2. There are no true means apart from the means where on group is the reference group with mean zero.

3. Yes. See UG ex 3.18 using only the m on x z xz part, so deleting b from the indirect line.

Xu, Man posted on Monday, December 21, 2015 - 6:39 am

I am trying to estimate a multiple group model. One of the factors has categorical indicators. But the problem is that the items have three categories in group 1 and two categories in group 2. The model does not seem to like it. So I was wondering should I do the analysis in two different group specific models?

Bengt O. Muthen posted on Monday, December 21, 2015 - 6:23 pm

With ML you can use the option

Categorical = u1-u3(*);

described in UG Chapter 15 (page 544 of the first V7 version).

Xu, Man posted on Wednesday, December 30, 2015 - 8:26 am

Thank you. The model seemed to invoke ALGORITHM=INTEGRATION and needs to use known-class facility, for which correlated residual seem problematic. Given my data is longitudinal, correlated residuals are important.

I should note that the different number of categories in the two groups are true differences and they should not be treated as missing data (if this was dealt with in this way).

Bengt O. Muthen posted on Thursday, December 31, 2015 - 5:36 pm

I think you have only 3 time points so you could use factors to capture the residual correlation across time, one for each adjacent pair of outcomes.

Christine Yu posted on Wednesday, January 13, 2016 - 8:30 am

Dear Dr. Bengt Muthen,

Thank you for your feedback on 12/20/15.

I have some clarification questions.

Do I use my SEM MGA syntax (with its inclusion of MI, that makes some groups different from each other) but add the 3.18 model constraint syntax? And add the plot syntax? And output syntax?

My model would have the reference group syntax (model), then the two groups (model 2 and model 3), then have "model constraint", "plot", and "output"?

When you said to "delete b from the indirect line", can you explain what the references when it is in parentheses,

MODEL: y on m (b);

Do I keep that (b)??

Thank you for your help,
Christine Yu

Bengt O. Muthen posted on Wednesday, January 13, 2016 - 12:17 pm

Since this is focused on syntax, please send to Support along with your license number.

Christine Yu posted on Wednesday, January 13, 2016 - 12:33 pm

Sadly, I don't have a license number since I am a graduate student using MPLUS via the license agreement with IU Bloomington.

I'll see if I can figure it out by fiddling around :-)

Christy Allen posted on Monday, February 08, 2016 - 2:56 pm

Hello,

I am using the ML estimator to run a simple mediation model, all observed variables (includes some covariates). I want to compare differences between men and women in the sample. Based on a previous answer I've received on this site, I know I can't compare model fit due to the simplicity of the model but I can compare equality across men and women. I used the chi square difference test to do this and it appeared to work, indicating that models were not equal across gender.

However, when I try to look at any individual parameters one by one instead of the whole model (constraining just X-->M for example), the Chi square difference is a negative value.

Does that mean I cannot test individual parameters using ML with my model?

Is the only way for me to do this to use the MLR estimator (and I assume use Santorra-Bentler scaled method)? (My committee wants me to use bootstrapping which I know is incompatible with MLR-- they do not seem inclined to change their minds.)

Thank you!!!

Linda K. Muthen posted on Monday, February 08, 2016 - 3:23 pm

We allow bootstrapping with only ML because all maximum likelihood estimators give the same parameter estimates. It is the standard errors that are bootstrapped. So MLR would not give you different results.

You can try defiing your difference parameters in MODEL CONSTRAINT. You will get a z-test of the difference parameters.

Cristian Zanon posted on Wednesday, March 16, 2016 - 9:49 am

Hello,

I am testing the measurement invariance of 2 groups through exploratory structural equation modeling (MLR) and am getting this error message:"Variable I18_F4K1_T4 has INFINITY for a value at observation 13840." What does it mean? How can I fix this, so that I can run the model? The values in the data set are on z scores.

Linda K. Muthen posted on Wednesday, March 16, 2016 - 2:54 pm

Please send the output, your data, and your license number to support@statmodel.com.

Rosario Ivano Scandurra posted on Thursday, March 24, 2016 - 9:49 am

Hello, I have both continuous and categorical factor indicators in an SEM model. I would like to test structural parameters equality across groups and differences among the three groups. I do not figure out how can I do that. It seems model constraint is not the solution. Hope somebody can give some hint or better an example.

Here my piece of code I cannot post all for a matter of size.

Analysis: ESTIMATOR IS WLSMV; PARAMETERIZATION=THETA;
Model: f2 by k fin; f3 by zzh_c ggh_c qh_c bbh_c;
f4 by zzw_c ggw_c qw_c rlw_c;
f5 by l1-l10; t on pp; t on y; f5 on t; f2 on t;
f2 on pp; f2 on y; f2 on x;
f4 on f2 (p1); f4 on x; f4 on t; f4 on y;
f5 on f4; f5 on f2; f5 on pp; f5 on x; f5 on y;
f3 on f2; f5 on f3; f3 on x; f3 with f4;
Model US:
f2 by fin; f3 by ggh_c qh_c bbh_c;
f4 by ggw_c qw_c rlw_c;
f5 by l2-l10; t on pp; t on y; f5 on t; f2 on t;
f2 on pp; f2 on y; f2 on x;
f4 on f2 (p2); f4 on x; f4 on t; f4 on y;
f5 on f4; f5 on f2; f5 on pp; f5 on x; f5 on y;
f3 on f2; f5 on f3; f3 on x; f3 with f4;
Model JP:
f2 by k; f3 by ggh_c qh_c bbh_c; f4 by ggw_c qw_c rlw_c;
f5 by l2-l10; t on pp; t on y; f5 on t; f2 on t;
f2 on pp; f2 on y; f2 on x;
f4 on f2 (p3); f4 on x; f4 on t; f4 on y;
f5 on f4; f5 on f2; f5 on pp; f5 on x; f5 on y;
f3 on f2; f5 on f3; f3 on x; f3 with f4;
Model constraint:
p1=p2; p1=p3;

Greg Egerton posted on Thursday, March 24, 2016 - 12:13 pm

Hello!

I�m currently working on a series of 3 cross-lagged SEM panel models. Two of the models have 4 variables (3 latent with continuous indicators, 1 observed ordered categorical) across 3 time points (12 total variables, estimating all autoregressive and cross-lagged paths). The third model replaces the observed categorical variable with an observed continuous variable but the rest of the model is the same as the previous 2 (again, the same 3 latent variables with continuous indicators).

The 2 models with categorical variables use theta parameterization and the WLSMV estimator, whereas the 3rd model uses delta parameterization and MLR.

I performed multiple group analysis testing differences in regression paths by sex using the DIFFTEST option for the WLSMV models, and calculated a chi-square difference test for the MLR model. I found no differences in the WLSMV models by sex, but did find a sex difference in the MLR model. Upon examining the source of these differences, it appears to be due to relationships among the variables shared across all 3 models (and not the unique variable in the model).

1) Could these differences in multiple group findings between these models be due to the estimator used in these models (WLSMV vs MLR)?
2) Is there an estimator I could use for all three models in order to be sure these differences are not due to the estimator?

Thanks!

Bengt O. Muthen posted on Thursday, March 24, 2016 - 3:45 pm

Answer for Rosario:

You can do 2 runs, one of which imposes the structural equalities, and do chi-square difference testing. Or you can do a Wald chi-square test using Model Constraint to test the equalities.

Bengt O. Muthen posted on Thursday, March 24, 2016 - 3:48 pm

Answer to Greg:

1)The third model, for which you use MLR, has a continuous variable instead of a categorical one and therefore is likely to have more power to find differences.

2) You can use the same estimator for all the models: WLMV, MLR, or Bayes.

You said Delta param and MLR. Delta is only for WLSMV.

Cheng posted on Sunday, April 03, 2016 - 8:38 pm

Dear Linda,
I try to run invariance tests on a CFA model. I studied on your ex5.27 to ex5.27e for multiple group ESEM. I just wonder are all the command under �Model:� and �Model g2�, can be applied to multiple group CFA? I have read a SEM book by another author, the Mplus command under �Model:� for testing factor covariance invariance and factor mean invariance for a CFA model are different from ex5.27d and ex5.27e respectively. I just wonder whether is because the author was testing invariance for CFA model and your ex5.27 are testing for ESEM model. Can I email my input file to you to check whether I did it right?

Bengt O. Muthen posted on Monday, April 04, 2016 - 6:47 pm

The UG ex 5.27 is for ESEM, not for CFA.

Michelle posted on Thursday, April 21, 2016 - 12:48 pm

Dear Drs. Muthen,
I am very new to SEM and MPlus. I am testing a mediated model twice - once for men and once for women with the groupings option. The model runs and looks good, but the estimate of the dv on the mediator is exactly the same for men and women, and I don't think it should be.
What is this caused by and how can I fix it?

Linda K. Muthen posted on Thursday, April 21, 2016 - 1:14 pm

Please send the two outputs and your license number to support@statmodel.com.

Jennifer Hepditch posted on Sunday, May 01, 2016 - 8:21 pm

Hello
I am a little confused on how to follow up on (or if I need to) an interaction using multi group. I ran a 1 df Chi square difference test comparing a path constrained to be equal for girls and boys vs free to discover it was significantly different between boys and girls. So, I essentially have a 2-way interaction between that IV and Sex. Any probing interactions resources I find ask for a regression coefficient for the moderator (sex) and the interaction (IV x sex) which I do not get because I conducted a multi group rather than putting sex in the regression (using Mplus instead of SPSS because of non-normality and controlling for clustering). Do you probe such an interaction using simple slopes? I do not see to have the PLOT option in Mplus (student basic version).

Bengt O. Muthen posted on Monday, May 02, 2016 - 9:39 am

You can translate your estimates from the multi-group analysis to those of a regression with an interaction using MODEL CONSTRAINT. Then use PLOT and LOOP to get a plot with conf intervals that probes the interaction like in UG ex 3.18 (see our Mediation web page).

Jennifer Hepditch posted on Monday, May 02, 2016 - 10:08 am

Hello Dr Muthen,
Can you shed a bit more light on HOW I would: translate my estimates from the multi-group analysis to those of a regression with an interaction using MODEL CONSTRAINT? What would I be constraining?I am brand new to MPlus and have only used MODEL CONSTRAINT for getting Wald after constraining a path equal across sex (using GROUPING).

Also, can you clarify exactly where to get these values in MPlus (for probing another continuous x continuous observed interaction using simple slopes):
Variance of coefficient of IV:
Variance of coefficient of interaction:
Covariance of coefficients of IV and interaction:

I understand that Tech 1 assigns a number to a parameter and Tech 3 is where to find variances and covariances between coefficients, but I am not sure what those values mean (e.g. how do I write this as a number? 0.340621D+01? What does the D+01 mean etc?)

Jennifer Hepditch posted on Tuesday, May 03, 2016 - 11:48 am

Hello
Can you clarify which is correct since I got two different results for testing if a path is significantly different across boys and girls:

1) Chi square diff test with 1 DF (comparing constrained and not constrained models). I actually used log-likelihood and the correction factor because I am using MLR).
RESULT: Significant

2) Wald statistic using Model Test (again constraining the path equal). RESULT: NS

My sample size is 197 and I am using Type = complex and Cluster = teacherID to account for clustering.

Bengt O. Muthen posted on Wednesday, May 04, 2016 - 9:21 am

They can disagree for smaller samples. Perhaps you have few clusters? You can use a third method as an arbiter here - use bootstrapping and Model Constraint where you create a NEW parameter as the difference between the boys and girls parameters.

Bengt O. Muthen posted on Wednesday, May 04, 2016 - 9:23 am

Actually, bootstrapping is not available for Type=Complex.

Colleen Ray posted on Thursday, May 12, 2016 - 7:55 am

Hello. I am running a multiple group path analysis, and running into some issues with data being "missing" on the grouping variable. I am grouping the models by location, as the data was collected at two sites. Checking the data in other programs, there are no missing. I convert the data to be used in Mplus (through Stat Transfer or by using a .csv file), but still have issues. Parts of my syntax are below.

Variable: NAMES ARE
ID_Num
female
Greek
phyabusescaleT
selfctrlSC
entitleSC
sexriskSC
riskdrugbeh
drinkSC
DVperpD
location;

USEVARIABLES ARE
female
Greek
phyabusescaleT
selfctrlSC
entitleSC
sexriskSC
riskdrugbeh
DVperpD
location;

MISSING ARE all (9999);

GROUPING IS location (0=GA 1=UNL);

Analysis: type= mgroup;

----It produces the following:
*** WARNING
Data set contains unknown or missing values for GROUPING,
PATTERN, COHORT, CLUSTER and/or STRATIFICATION variables.
Number of cases with unknown or missing values: 336

Thanks in advance for any help that you can provide!

Linda K. Muthen posted on Thursday, May 12, 2016 - 9:03 am

Perhaps you have blanks in the data set causing the data to be misread. If not, send the files and your license number to support@statmodel.com.

Gabriel van Beusekom posted on Tuesday, June 21, 2016 - 6:13 am

Dear Dr. Muthen

I have assessed a unigroup model (cross-lagged model of longitudinal data with 3 measurement occasions). I arrived at a final most parsimonous model by testing constraints.

Now i want to test for sex differences in my model with multigroup analyses.

Can i use the parsimonious model and test differences in that model between boys and girls. That is, assume that the constrained paths in the unigroup model, also exists for boys and girls, and than assess whether these constrained paths are equal for boys and girls or should be freely estimated.

Thank you for your time

Bengt O. Muthen posted on Tuesday, June 21, 2016 - 9:26 am

Optimally, you should start with each of the two groups separately and see if the constraints fit.

Stephanie S posted on Sunday, August 07, 2016 - 2:33 pm

Hi Drs. Muthen,

I am running a multiple group (male, female) simple mediation model with latent variables (all continuous observed variables). I am using MLR estimation. My research question requires that I look at gender differences across models. I have already established model fit. In order to look at gender differences in the model, I constrained all paths to be equal (across models) and compared the fit of the constrained vs. unconstrained models utilizing Satorra-Bentler scaled chi-square difference testing. I then looked at modification indices to determine which constraints should be released to determine model fit. This, however, does not tell me if the indirect effect is significantly different across models so I then used the Model Constraint option to examine differences in the indirect effect.

Would you recommend these analyses or is there a more efficient way to look at significant differences in both the direct and indirect effects across groups? I am happy to provide my syntax. Thank you for your help. This message board has been extremely helpful to me.

Bengt O. Muthen posted on Monday, August 08, 2016 - 3:20 pm

Your Model Constraint approach is what I would recommend.

Manni posted on Thursday, November 10, 2016 - 6:19 am

Dear MPlus Team,

I used multiple imputation (100 data sets; missings: T1: 0%; T2: 25% T3: 50%, T4: 50%; sample size 450 each; MLR) and latent change score models to compare two groups.

I used WALD test to see if a model assuming equal means of corresponding change scores across groups fits the data worse. This test is not significant. However, if I add the tested equality constraints to the model, the fit gets worse (RMSEA .000, CFI 1.000, df = 1, chi� = .597 vs. RMSEA .044, CFI .993, df = 4, chi� = 8.997, df = 3).

If I test the equality contrains seperatly using Wald test, one becomes significant. If I estimate this difference freely and constrain only two other difference, the fit is much better (RMSEA .005, CFI 1.000 df = 3, chi� = 2.259).

I wonder why the Wald test using all constrains vs. only one constrain comes to different conclusions ( I thought testing all simultaiously should be more powerful). Is this an issue due to lower power using Wald test when multiple parameters are compared using MI with several missing data? What would be a good way to handle this? Many thanks in advance!

Tihomir Asparouhov posted on Thursday, November 10, 2016 - 11:55 am

Testing one constraint at a time is fine and correct. Testing multiple constraints at the same time runs into a glitch and the results are not correct (which explains what you have noticed). The corrected version will be in the next release. Again this is in regard to using model test with multiple imputations and multiple constrain equations.

Manni posted on Monday, November 14, 2016 - 3:12 am

Many thanks for your quick and helpful response! Could you tell me when the next release will be?

Bengt O. Muthen posted on Monday, November 14, 2016 - 7:50 am

It is not possible to predict because the development involves research. It is several months away.

Konstantin Tskhay posted on Sunday, November 20, 2016 - 7:48 am

Hi,

I am looking to run the following model:

VARIABLE:
NAMES = id att chr exp int pod lead sex race arg eye glasses accent complete time;
USEVARIABLES = att chr pod lead sex race arg eye glasses;
GROUPING = time (5=5seconds 15=15seconds 30=30seconds);

ANALYSIS:
TYPE = MGROUP;
ESTIMATOR = MLR;

MODEL:
chr ON arg eye glasses race sex att;
lead ON chr;
pod ON chr;

lead WITH chr pod;
chr WITH pod;

There are three groups with all constraints added in additional statements. The model runs.

However, I would like to estimate cluster-robust standard errors based on the "id" variable. However, TYPE = MGROUPS does not allow me to estimate the model .

I have tried to estimate the model in complex, however, the model results (SEs) do not converge with the results obtained in STATA. Any suggestions would be appreciated greatly!

Many thanks,

Konstantin

Bengt O. Muthen posted on Monday, November 21, 2016 - 4:49 pm

I don't know why you have both

lead on chr;

as well as

lead with chr;

Type=Mgroups is not a currently existing option.

I would use Type=Complex. Check that your Mplus model has the same number of parameters and the same loglikelihood value as the model you analyzed in Stata.

Grace Quib posted on Wednesday, November 30, 2016 - 1:10 am

I'm very much new with SEM approach and Mplus and have a question about a two-group model I am running.

I have established measurement invariance my 3 latent variables in my model. Would it be okay to run my model separately with each group instead of running multiple group analysis? My goal is not to directly compare path coefficients but to see whether the pathways are the same/different for each group.

The reason I ask is because running multiple group does not give me fit statistics for both groups. Would it be justifiable for me to run the models separately?

Thanks!

Linda K. Muthen posted on Wednesday, November 30, 2016 - 6:04 pm

I would run them separately as a first step to determine if the same model fits well in each group. I would only go on to multiple group analysis if the same model fits well in each group.

Grace Quib posted on Wednesday, November 30, 2016 - 7:30 pm

Thank you very much for the reply.

Just a follow-up question. I ran the models separately and the fit was satisfactory for both groups. Could I stop there and use those results if my goal is just to compare differences in pathways and not directly comparing the coefficients? Or must I run multiple group analysis?

Thanks!

Bengt O. Muthen posted on Thursday, December 01, 2016 - 10:33 am

You would run a multiple group analysis if you have some parameters that you want to hold equal across groups - such as with measurement invariance across groups.

Kathy Xiao posted on Tuesday, December 13, 2016 - 11:06 am

Dear Dr. Muthens,

I am doing a SEM model with mediations like:
X->M1->Y
X->M2-Y
X->M2->M1->Y
All the X, Y, M1, M2 are latent variables, actually X is constructed by 3 other latent variables.
and also I have covariants (C1, C2, C3) on Y.

The initial test of SEM with the whole sample is significant with good model fit.

But then I need to do multiple group analysis to test whether this SEM model is different in terms of race (3 Groups: Black, White, Others) and SES (2 Groups: low, high SES).

So I proposed the following analytic strategies:
(1) Test the Measurement Invariance of the constructs (X, Y, M1, M2) step by step (configural, factor loading, intercet, residual, factor mean, covarints)
(2) Test the Structural Invariance through Regression Coefficient.

Is it a reasonable approach?

Kathy Xiao posted on Tuesday, December 13, 2016 - 11:07 am

Also I am confused with this process when i do the analysis. I would like to ask:

(1) Do I really need to do the Measurement Invariance Test before testing Structural Invariance?
(2) Do I need to go through all the steps to test the MI for each of the constructs I have in the model separately? Or can I test only factor loading? Or can I do that simultaneously? (since the key question I want to ask is not the CFA, but the SEM)
(3) How can I do MI and SI for three groups? I read through the Mplus Guide and many online resources, they are basically for 2 groups. Is there any syntax code for 3 group analysis?
(4) To test the structural invariance, I saw many papers just test the regression coefficients. Is it acceptable?
Actually, what are the syntax for comparing it? I know some of the parameters are from TECH1 TECH4, but would you help me point them out where are these results in output?

I can provide the syntax and more background information if necessary.

If you would recommend any resource on this related question, please feel free to post and I can go to study.

Thanks and look forward to your reply!

Bengt O. Muthen posted on Tuesday, December 13, 2016 - 6:17 pm

We request that postings are limited to one window. And general analysis strategy questions (not Mplus-specific) are better suited to SEMNET.

First posting: Yes, this is a reasonable approach.

Second posting:

(1) Yes, at least invariance of loadings.

(2)Simultaneously

(3)No extra feature is introduced going from 2 to 3 groups.

(4)Run with and without the equality restrictions and compute the likelihood-ratio chi-square (see our short courses on the web).

Kathy Xiao posted on Tuesday, December 13, 2016 - 7:00 pm

Thank you Dr. Muthen for your reply! I will post these questions in one window under general analysis topic in the future.

Just a follow up question for (3). If no extra feature for 3 groups, how can I write the Mplus syntax step by step to compare for 3 groups? Say I want to compare whether Black youth differs from White and Other racial/ethnical groups in my proposed model?

(4) Would you mind specifying which topic in the short courses on the web covers the multiple group analysis?

Thanks so much!

Linda K. Muthen posted on Wednesday, December 14, 2016 - 6:44 am

Topic 1 covers multiple group for continuous outcomes. Topic 2 covers multiple group analysis for categorical outcomes. These should answer your other question.

Kathy Xiao posted on Wednesday, December 14, 2016 - 1:25 pm

Thanks for your reply Dr. Muthen.
I read through Topic 1 as my outcome is ordinal (5-point, 1=Mostly D's and F's to 5 Mostly A's and B's, for students' grade)
But do you mind specifying that if I want to compare 3 groups (White, Black, Others) rather than 2 groups. What group-specific model syntax shall I put under the main model? Shall I put the group that I am interested in (e.g. Model Black) and freed the factor loading and intercepts?
MODEL:
FU by F21a-F21l;
NSA by revN6a-revN6h;
NSU by revN2a-revN2g;
NY by revN3a-revN3h ;
NQU by NSA NSU NY;
PD by H1-H6;
[FU NSA NSU NY NQU PD@0];
PD on NQU(1);
S1 on NQU(2);
S1 on PD(3);
S1 on FU(4);
S1 on A1 (5);
S1 on A2 (6);
FU on NQU(7);
PD on FU(8);
MODEL INDIRECT:
S1 IND PD FU NQU;
S1 IND FU NQU;
S1 IND PD NQU;
MODEL Black:
PD on NQU (b1);
S1 on NQU (b2);
S1 on PD (b3);
S1 on FU (b4);
S1 on A1 (b5);
S1 on A2 (b6);
FU on NQU (b7);
PD on FU (b8);

Linda K. Muthen posted on Wednesday, December 14, 2016 - 2:23 pm

If you want to compare three groups, you have three group-specific MODEL commands instead of 2. You can label parameters in the three groups and use MODEL CONSTRAINT to compare differences:

MODEL:
y ON x;
MODEL g1:
y ON x (p1);
MODEL g3:
y ON x (p2);
MODEL g1:
y ON x (p3);

MODEL CONSTRAINT:
NEW (diff1 diff2);
diff1 = p1 - p2;
diff2 = p1 - p3;

Kathy Xiao posted on Wednesday, December 14, 2016 - 3:27 pm

Thanks for your reply Dr. Muthen!

So these comparisons are done after I have tested the model for seperate group->factor loading invariance-> factor loading and intercepts invariance->add covariates, is that right?

Or for structural invariance, we only need to test the structural model using the syntax above?

Do we need to add constrains for each group-specific model? (like what is written in the handhout p. 219: "MODEL male: [rsci malg];")

Thanks!

Yeshim Iqbal posted on Thursday, December 15, 2016 - 6:05 am

Hello,

I am running a path analysis across 4 groups in which some paths are constrained and some paths are free according to theory. When I run the model across all 4 groups, I find differences across the groups and would like to test specifically which groups those differences are on. I did this by comparing two groups at a time (by comparing the models when a particular path is constrained or set free and running the multigroup model across only two groups - so I did this multiple times, to compare all the possibilities and figure out where the significant differences were) However, I find that some of the path coefficients change when I run the model comparing only the two groups vs. all four. How do I address this problem?

Is there a different way to test the differences between coefficients across the groups?

Thanks a lot for your help.

Linda K. Muthen posted on Thursday, December 15, 2016 - 8:15 am

You can use MODEL CONSTRAINT. Label the parameters with different labels. Create difference parameters.

MODEL:
y ON x;
MODEL g1:
y on x (p1);
MODEL g2:
y ON x (p2);

MODEL CONSTRAINT:
NEW (diff);
diff = p1 - p2;

You can create many of these. They are independent tests.

Kathy Xiao posted on Thursday, December 15, 2016 - 8:26 am

Dear Dr. Muthen,

Shall we use MODEL CONSTRAINT only after we checked the model for seperate group->factor loading invariance-> factor loading and intercepts invariance->add covariates?

Also, may I know the meaning of this "diff" test? Does it subtract the regression coefficient of each path?
(e.g. if p1 p2 are both negative, what does the negative sign of "diff" means?)

Many thanks!

Linda K. Muthen posted on Thursday, December 15, 2016 - 3:50 pm

You need to establish measurement invariance for any comparisons of structural parameters to make sense.

The diff parameter is the difference between two coefficients. The test is if the difference is different from zero. If not, they are not different from each other.

Kathy Xiao posted on Thursday, December 15, 2016 - 8:32 pm

Thanks very much Dr. Muthen!

Katy Roche posted on Friday, January 13, 2017 - 9:36 am

I am running a multi-group model using the type=imputation command. The model is running fine (no parameter constraints across groups). However, I am testing for mediation and am aware that I cannot use MODEL indirect commands to do this when using TYPE=IMPUTATION. Therefore I am using the MODEL CONSTRAINT command to test indirect pathways. The model runs but I am only getting one set of estimated parameters for indirect paths.

My syntax includes grouping is statement. I want to freely estimate parameters so I am not specifying each group in a separate model statement. My constraint syntax is as follows

Model Constraint:

new (pind1 Ptot1);
pind1 = p1*p2;
ptot1 = pind1+p3;

Why am I only getting a single set of parameter estimates for pind1 and ptot1 when for all other parameters I get a separate set of estimates for each group?

Linda K. Muthen posted on Friday, January 13, 2017 - 4:18 pm

You need to give different labels in each group to define the indirect effects in each group.

Jamie Summers posted on Sunday, February 05, 2017 - 1:01 am

Hello! I'm seeking to compare the fit of my path model across racial groups (4 groups). The SB scaled chi-square difference test between my general and constrained models was significant, suggesting that the constrained the model does not fit well across groups.

Could you explain what post-hoc analyses I would do at this point? I would like to explore how the relationships between variables are different across groups. I believe I am to free individual parameters to try to increase fit (to identify which paths are not equal across groups)?

Any help you can provide would be great! Thank you!

Linda K. Muthen posted on Sunday, February 05, 2017 - 6:54 am

Modification indices can help you find where the differences may be.

Jamie Summers posted on Monday, February 06, 2017 - 8:28 pm

Thanks for your response, Dr. Muthen.

I'm having some difficulty getting my model to converge. I'm seeking to compare 4 groups. When I add the grouping variable, it doesn't converge; however, when I run any any combination of 3 groups, it appears fine. Any ideas why this is happening?

Thank you!

Linda K. Muthen posted on Tuesday, February 07, 2017 - 6:12 am

The first step is to run each group separately, If the same model does not fit well in each group, multiple group analysis is not called for.

Jingjing Li posted on Monday, June 12, 2017 - 9:20 am

Dear Dr. Muthen,

I have two questions about a 2-group path analysis (AKA, 2-group multivariate multiple regression). Plain regression. No mediation.

My model includes 4 dependent dichotomized variables, 10 dummy independent variables, grouping variable is gender. I want to compare the coefficient of all dummy variable between groups.

I assigned each coefficients a name and did MODEL TEST for one comparison each time.
So in total, I did MODEL TEST 40 times for all comparisons.

My first question is -- would 40 times model test increase my type 1 error? If so, any alternative way to compare each coefficients?

Second question -- if the coefficients for bisexual in male sample (a1) and in female sample (b1) are significantly different, what's the interpretation? Gender does moderate the relationship between bisexual and the outcome (any cigarette use)?

MODEL male:
Anycigs on bisexual (a1)
Ethnic parented black others homo hetero public Tech HBCU;

MODEL female:
Anycigs on bisexual (b1)
Ethnic parented black others homo hetero public Tech HBCU;

ANALYSIS:
ESTIMATOR IS WLS;

Model test:
a1=b1;
output: stdyx;

Thank you so much!

Bengt O. Muthen posted on Monday, June 12, 2017 - 5:51 pm

Q1: You can test all 40 at the same time.

Q2: Yes.

Jingjing Li posted on Monday, June 12, 2017 - 6:05 pm

Thank you so much Dr Muthen!!

For Q1, however, I have tried test 40 coefficients at the same time using MODEL TEST using following codes:
MODEL TEST
a1=b1
a2=b2
....
a40=b40.

However, the output only give me one wald test output (i.e., one p value) in the model fit statistic section. It did not give me 40 p-values that I need.

What might be the correct codes for testing 40 altogether? repeating MODEL TEST 40 times in the same group of syntax like below?

Model test
a1=b1

Model test
a2=b2

Model test
a3=b3

....

Thank you so much for your kind guidence!!

Bengt O. Muthen posted on Tuesday, June 13, 2017 - 6:20 pm

Yes, repeat the Model Test run 40 times.

Carillon J Skrzynski posted on Tuesday, August 22, 2017 - 6:56 am

Hello,

I'm trying to do structural invariance modeling with a model that contains an XWITH command (I would like to see whether my interaction involving two latent continuous variables on an observed count outcome is invariant across sex). It appears that I need to use the KNOWNCLASS / TYPE=MIXTURE commands to do this, but from my understanding, that requires a latent categorical variable which I do not have in my model. Any assistance is working through this would be heavily appreciated.

Thank you,
Cari

Linda K. Muthen posted on Tuesday, August 22, 2017 - 10:56 am

When all classes are known, the KNOWNCLASS option is like the GROUPING option. This is just how it is implemented in Mplus. You do not need to have a categorical latent variable. Specify the CLASSES and KNOWNCLASS options as shown in Example 7.21 in the user's guide. Use TYPE=MIXTURE and set up the MODEL command as shown in the example.

Carillon J Skrzynski posted on Friday, August 25, 2017 - 7:21 am

Thank you, Linda. Unfortunately, I'm trying to follow along with the example but also tailor it for my specific needs with the interaction, and I'm not certain that I'm doing this correctly as I'm running into errors. I've added the separate models (overall, c, and cg) with the means of all my useable variables allowed to vary across the classes of c, and their variances allowed to vary across the classes of cg and then I've specified my model (the latent variables, covariates, and interaction) as well as constraining factor means to zero and not allowing the factor loadings or intercepts to be constrained equal as I've seen done for other configural models. Any further assistance would continue to be greatly appreciated. I can also send output if that would be easier / you're willing to look at it.

Linda K. Muthen posted on Friday, August 25, 2017 - 7:25 am

Please send the output and your license number to support@statmodel.com.

Nagwan zahry posted on Tuesday, August 29, 2017 - 7:30 am

Hi Linda,

Would you please look at my models and let me know if there are nested or non nested because I need to compare which message leads to a better model.

(Personally, I think the models are non nested)

I test the effect of four messages (i.e, four models) on consumer'emotions, risk perceptions, benefit perceptions, and willingness to buy.

each model has the same variables and same relationship between variables.

My variables are emotions, risk percpetions, benefit perceptions, company reputation, and buying behavior

Each model depicts the following relationships:

Message-->positive emotions which lead to high benefit, lower risk, high reputation.

Benefit decreases risk which leads to higher willingness to buy

reputation will increase benefit which leads to higher willingness to buy

reputation will decrease risk will lead to higher willingness to buy

Thank you in advance for your help

Noga
Risk willnot ----->

Bengt O. Muthen posted on Tuesday, August 29, 2017 - 5:06 pm

These general modeling questions are better suited for SEMNET.

Lieke ten Brummelhuis posted on Monday, September 18, 2017 - 4:44 pm

Dear Dr. Muthen,

I have a multilevel model with men and women nested in couples and measurements for 5 consecutive days.

I structured the data as a two-level dyad model. The husband and wife each have their own daily predictor variables and one common daily outcome variable.

I have a mediation model and I'd like to test if the indirect effect is significantly different between men and women. I have used constraints to test if the first part of the indirect effect (x -> m) differs significant between men and women (see syntax below), and this is the case. If I use the constraint approach in the indirect model, however, this does not work.

Is there another way to test if the indirect effects differ significantly between groups?

Thank you in advance for your help.

Lieke

USEVARIABLES ARE
MJDE wJDE mJRC wJRC
mnesgiv wnesgiv cnFQ;
WITHIN are Mjde mjrc wjde wjrc;
CLUSTER IS resp;

ANALYSIS: TYPE IS TWOLEVEL;
MODEL:
%WITHIN%
cnFQ on mnesgiv wnesgiv;
mnesgiv ON MJDE (1);
mnesgiv ON mJRC (2);
wnesgiv ON wJDE (1);
wnesgiv ON wJRC (2);
mnesgiv with wnesgiv;
%BETWEEN%
cnFQ; mnesgiv; wnesgiv;
OUTPUT: sampstat STAND modindices;

model indirect:
cnFQ IND mnesgiv mjde;
cnFQ IND mnesgiv mjrc;
cnFQ IND wnesgiv wjde;
cnFQ IND wnesgiv wjrc;

Bengt O. Muthen posted on Tuesday, September 19, 2017 - 3:33 pm

You can use Model Constraint to do any test you want. For instance, instead of saying (1)
(2)
(1)
(2)
you can say
(p1)
(p2)
(p3)
(p4)

And then in Model Constraint say

New (diff1 diff2);
diff1 = p1-p3;
diff2 = p2-p4;

The diffs will tell if the differences are significant or not. You can also express the indirect effects in Model Constraint.

If this doesn't help, send the problematic output to Support along with your license number.

Lieke ten Brummelhuis posted on Thursday, September 21, 2017 - 11:26 am

Hi Bengt,

Thanks for this suggestion!
The model Constraint (and differences) works for the first part of the indirect effect, but not for the indirect effect.

The error I get is:

*** ERROR in MODEL INDIRECT command
IND statements with X and X* values or the mediator value are not allowed with
TYPE=TWOLEVEL.

I will send the model and error message in the output file(below) to support as well.
Thanks,
Lieke

ANALYSIS: TYPE IS TWOLEVEL;
MODEL:
%WITHIN%
cnFQ on mnesgiv wnesgiv;
mnesgiv ON MJDE;
mnesgiv ON mJRC;
wnesgiv ON wJDE;
wnesgiv ON wJRC;
mnesgiv with wnesgiv;
%BETWEEN%
cnFQ; mnesgiv; wnesgiv;
OUTPUT: sampstat STAND modindices;

MODEL INDIRECT:
cnFQ IND mnesgiv MJDE (p1);
cnFQ IND mnesgiv MJRC (p2);
cnFQ IND wnesgiv WJDE (p3);
cnFQ IND wnesgiv WJRC (p4);

MODEL CONSTRAINT:
New (diff1 diff2);
diff1 = p1-p3;
diff2 = p2-p4;

*** ERROR in MODEL INDIRECT command
IND statements with X and X* values or the mediator value are not allowed with
TYPE=TWOLEVEL.

Bengt O. Muthen posted on Thursday, September 21, 2017 - 4:21 pm

The parameter labels don't go in the Model Indirect command but in the Model command. In Model Constraint you can create anything you like such as "a*b" for an indirect effect - or two of them for which you can then express a difference.

Orpha de Lenne posted on Tuesday, September 26, 2017 - 5:20 am

Dear Dr. Muthen,

I want to test for invariance of common regression paths across four countries.

When I conduct the SEM model in general, I get a good model fit. However, when I conduct the model but specify the groups by using the "GROUPING =" command, I suddenly get a poor model fit.

What can cause this poor model fit and how do I solve this?

Thank you in advance for your help.

Bengt O. Muthen posted on Tuesday, September 26, 2017 - 6:00 pm

When you say "model in general" I wonder if you mean a single-group analysis of all four countries together. What you want to do as a first step is an analysis of each country by itself and see if you have good fit in all of those analyses.

Julie Lewis posted on Monday, October 30, 2017 - 9:32 am

I would like to duplicate these analyses using Mplus. The analysis consists of 2-IV�s, 1 mediator (intervening variable), and 1-DV. I would like to run analyses on four models:

Model 1: IV1 + IV2-->Mediator-->DV
Model 2: IV1 + IV2-->Mediator-->DV with a direct path from IV1-->DV
Model 3: IV1 + IV2-->Mediator-->DV with a direct path from IV2-->DV
Model 4: IV1 + IV2-->Mediator-->DV with direct paths from IV1-->DV and IV2-->DV (saturated model)

Question: In example 5.12 for SEM, will the "model indirect" command capture the "direct paths" in my model? I don't see any examples with direct effects in SEM.

Bengt O. Muthen posted on Monday, October 30, 2017 - 4:22 pm

Drop the mentioning of the mediator, so in ex 5.12 terms, say

f4 IND f1;

Sara De Bruyn posted on Friday, December 01, 2017 - 4:19 am

Dear Dr. Muthen,

I am doing multiple group SEM in which I compare 3 groups. I have a pathway which is significant for 2 groups and not significant for the other group. I did a difftest to test whether the pathways of the 3 groups significantly differ. No differences were found.

I have two questions:
a. Does it make sense to test the difference between the significant pathway of 1 group and the non-significant pathway of the other group? Or should both pathways be significant?
b. If a. makes sense, how should I interpret the results? In my case there wasn't a significant difference between the two groups (where one path was significant and the other not), but I find it hard to understand why a non-significant pathway and a significant pathway don't differ.

Thank you in advance for your help.

Bengt O. Muthen posted on Friday, December 01, 2017 - 2:51 pm

a.
Q1: Yes
Q2: No

b.
It makes sense; take the following example with 2 groups. Grp 1 has estimate 0.5 (say) and Grp 2 has estimate 1.0. Grp 2's estimate is significant but Grp 1' estimate is not. The difference is not significant. Assuming the same SEs, this happens because 1.0 is larger than the difference of 0.5

Nina Sommerland posted on Monday, December 04, 2017 - 4:36 am

Dear Dr. Muthen,

I am, as Sara De Bruyn above, interested in comparing pathways between groups. I am trying to understand your answer to question 1.b, esp. "Assuming the same SEs, this happens because 1.0 is larger than the difference of 0.5".

Wouldn't this then be the case for any greater number since the largest values will always be larger than the difference from a smaller value? Is there perhaps another way you could explain it?

Bengt O. Muthen posted on Monday, December 04, 2017 - 2:57 pm

It depends on the SEs as well. So for two parameter estimates p1 and p2 you have:

SE(p1-p2) = sqrt[V(p2)+V(p1)-2Cov(p2,p1)]

where the Vs and Cov are found in Tech3.

Nina Sommerland posted on Wednesday, December 13, 2017 - 5:26 am

Thank you for your reply. So, I have a model with three groups where some coefficients are significant in certain groups but not in others. However, when I compare whether the coefficients differ significantly between groups through releasing individual paths, no differences are significant. Does this mean that there is no point in dividing the sample into the groups, even if I get significant effects in some groups but not in others?

Thank you for your help.

Bengt O. Muthen posted on Wednesday, December 13, 2017 - 2:06 pm

This question is suitable for SEMNET.

You may also consult my 1989 Psychometrika paper on our website.

Pete Parkers posted on Wednesday, December 13, 2017 - 3:33 pm

Dear MPlus-Team

I want to conduct a multi-group ESEM (testing for invariance). Unfortunately for one group I only have the covariance matrix and not the raw data.
I thought about generating data with the Kaiser-Dickmann algorithm (Kaiser, H. F. and Dickman, K. (1962). Sample and population score matrices and sample correlation matrices from an arbitrary population correlation matrix. Psychometrika, 27(2), 179-182. doi:10.1007/BF02289635)
My question is, whether I get a useful result out of this - at least for configural and weak (loadings) measurement.
Any help would be much appreciated.

Best Pete

Pete Parkers posted on Wednesday, December 13, 2017 - 3:51 pm

A quick follow-up question:
From your point of view - is there a problem with different group sizes say - 12.000 to 1.000

Once again thank you very much in advance!

Bengt O. Muthen posted on Wednesday, December 13, 2017 - 4:20 pm

I would not recommend trying something like this. But you may want to ask on SEMNET.

Chenqq posted on Tuesday, February 06, 2018 - 5:02 am

I have a question about MG Bifactor-ESEM I am working on.How to write if I want to get results on measurement invariance testing(configural,metric,scalar,strict) of MG Bifactor-ESEM?
For example:three groups(g1,g2,g3),factor F1 including y1-y5,factor F2 including y6-y14,factor F3 including y15-y19.
Thanks a billion!

Bengt O. Muthen posted on Tuesday, February 06, 2018 - 3:35 pm

Answered elsewhere.

Chenqq posted on Tuesday, February 06, 2018 - 11:30 pm

Thank you for your guidance,Bengt O. Muthen,I will try to find them.

aprile benner posted on Thursday, March 08, 2018 - 10:12 am

If I am interested in comparing a just-identified model to a nested model in which a set of coefficients within the model is constrained to be equal, is it acceptable to use the traditional chi-square difference test?

Bengt O. Muthen posted on Thursday, March 08, 2018 - 11:29 am

Yes, but that should be the same as the regular H0 chi-square that gets printed.

Ali O. Ilhan posted on Wednesday, April 04, 2018 - 4:29 am

Dear all,

We are trying to run a multi-group SEM model. Our grouping variable have two levels (1 for low and 2 for high), and although we do not have any missing values, we had the following error message:

*** WARNING
Data set contains unknown or missing values for GROUPING,
PATTERN, COHORT, CLUSTER and/or STRATIFICATION variables.
Number of cases with unknown or missing values: 61
1 WARNING(S) FOUND IN THE INPUT INSTRUCTIONS

We double-checked the data in other software, and indeed there is no problem with the grouping variable (also, when we ask Mplus for the descriptive statistics it does not find any missing data and displays the correct descriptives).

Any help is much appreciated,

Ali O Ilhan

Bengt O. Muthen posted on Wednesday, April 04, 2018 - 3:45 pm

We need to see your full output - send to Support along with your license number.

Megan Ames posted on Monday, April 30, 2018 - 12:46 am

I am doing structural equation modelling with latent variables. I have a multilevel model with data obtained from supervisors and subordinates.
The model has supervisor attitude -> supervisor satisfaction -> 3 dimensions of employee level attitudes -> LMX -> employee satisfaction -> 2 outcome variables

I have read as much as possible but I don't understand how I can operationalize this model using MPlus.
Please help!

Bengt O. Muthen posted on Monday, April 30, 2018 - 6:10 pm

The keywords BY and ON will take care of this:

For your latent variables you use

f BY indicators

and for the -> you use ON.

Then combine that with a multilevel UG example such as 9.6.

Megan Ames posted on Tuesday, May 01, 2018 - 12:47 am

Thank you for your guidance. I will look into this and hopefully operationalize the model.

Ting Dai posted on Thursday, May 17, 2018 - 8:48 pm

Dear Drs. Muthen:

I was trying to set up the mplus syntax so that my 7-group CFA model (1 factor with 7 items) has the following constraints:
1. the loadings of the first 4 items are equal across groups (the last 3 freely estimated across groups).
2. the intercepts of of the 4 items are equal across groups (the last 3 freely estimated across groups).
3. the error variances of of the 4 items are equal across groups (the last 3 freely estimated across groups).
4. I do not want to estimate the latent means.

I was able to achieve #1 and #4, but not #2 or #3.
It seems that the intercepts are set to be equal across groups by default, and the error variances are set to be freely estimated also by default. I am not quite sure why.

Is there a way to at least accomplish my goal #3, and if possible also #2?

Thanks in advance!

Bengt O. Muthen posted on Friday, May 18, 2018 - 1:35 pm

All of this can be easily done. See UG ex 5.15 for an example. The default multiple-group model is scalar invariance. To deviate from this, parameters not to be held equal across groups are simply mentioned in group-specific Model statements.

Samantha-Kaye Johnston posted on Monday, May 28, 2018 - 12:36 pm

Hello,

I am running a multi-group SEM. In one group, the data is non-normal, so this means I use the MLR as the estimator. The data for the other group is normal so this means the ML estimator would be the correct selection as the estimation method (based on my reading).

Against this background, how would I approach estimation in a multi-group context. I reason that I would need to specify the type of estimation for each group? If so, how would I go about doing this?

Thank you

Bengt O. Muthen posted on Monday, May 28, 2018 - 5:42 pm

Q1-2: You can't and don't need to do that. Just use MLR - it is fine also with normal data.

Samantha-Kaye Johnston posted on Wednesday, May 30, 2018 - 8:27 am

Thank you so much for clarifying this Bengt.

Samantha-Kaye Johnston posted on Wednesday, May 30, 2018 - 8:33 am

1.As a follow-up, since MLR is used, would I then use the Satorra- Bentler scaled method to test for invariance?

2.I am using a multi-group approach that includes a mixture of continuous observed (X) and latent variables (M and Y). Most multi-group analysis focus only on CFA, that does not include mediation. In an earlier forum (in 2015), another user suggested (a) establishing measurement invariance, including configural/baseline model (Model 1), metric invariance (Model 2), scalar invariance (Model 3) and error invariance (Model 4) first for the latent variables (M and Y), (b) Fit the mediation model for both groups (Model 5), then (c) establish structural invariance, including factor variance-covariance invariance (Model 6) and factor mean invariance (Model 7). This approach was said to be acceptable by Bengt. Is this approach still acceptable? Or, to your knowledge, has there been any developments in the field since this initial post (I am yet to find any relevant resources to guide this decision)?

Bengt O. Muthen posted on Wednesday, May 30, 2018 - 2:35 pm

1. Right.

2. Still a good approach.

Samantha-Kaye Johnston posted on Wednesday, May 30, 2018 - 3:28 pm

Hi Bengt

Thank you very much for your quick replies. I have one final question to complete my analysis.

Following from confirmation that my above-mentioned approach is acceptable, and assuming that Models 1 to 4 have confirmed measurement, scalar and error invariance, would I compare chi-squares and dfs between Model 4 (error invariance) and Model 6 (factor variance-covariance invariance), given that Model 5 is just a check to determine if there is a good fit to the data when the observed variables are added? Or would I compare chi-square and dfs between Model 4 (error invariance) and Model 5 (fit model adding observed variables with no constraints)?

Bengt O. Muthen posted on Friday, June 01, 2018 - 3:07 pm

There are many ways to approach this - SEMNET will have opinions.

Nicole Tuitt posted on Thursday, July 12, 2018 - 4:55 pm

Hello,

I am new to Bayesian analyses. I would like to conduct a Bayesian SEM multi-group analysis, but I'm not really sure how to do this. I found an article on measurement invariance, but how to I test for structural invariance. Are there any articles or resources that you could please share?

Thanks

Nicole

Bengt O. Muthen posted on Friday, July 13, 2018 - 2:00 pm

Model Constraint can do this. Use parameter labels in the Model command and then define a difference between a measurement parameter in two groups. E.g. for the first loading in groups 1 and 2:

diff1 = lambda11 - lambda12;

This will give you a posterior distribution for diff1, that is, the estimate and its CI.

Jennifer Dealy posted on Sunday, July 15, 2018 - 5:00 pm

Dear Dr. Muthen,
I have two questions: 1) Is the following syntax correct to look at differences in model fit and parameter significance for men and women 2) Would I need to run subsequent analyses specifying m=w and m1 and w1? Thank you.

grouping are PGen(1=M 2=W);

model:
WEMWBSTot ON SingleNM SingleP SingleD CGenD;
BCSSMTot ON WEMWBSTot SingleNM SingleP SingleD CGenD;
DTS_Total COPEBeDis COPEAct DERSTot ON WEMWBSTot SingleNM SingleP SingleD CGenD;
COPEAct on BCSSMTot;
COPEBeDis on BCSSMTot;

Model indirect:
COPEAct ind WEMWBSTot;
COPEBeDis ind WEMWBSTot;
model m:
WEMWBSTot ON SingleNM SingleP CGenD(a1);
DTS_Total COPEBeDis COPEAct DERSTot ON WEMWBSTot SingleNM SingleP CGenD (b1);
BCSSMTot ON SingleNM SingleP CGenD (c1);
BCSSMTot ON WEMWBSTot (d1);
COPEAct ON BCSSMTot (e1);
COPEBeDis ON BCSSMTot (f1);
model w:
WEMWBSTot ON SingleNM SingleP CGenD(a2);
DTS_Total COPEBeDis COPEAct DERSTot ON WEMWBSTot SingleNM SingleP CGenD (b2);
BCSSMTot ON SingleNM SingleP CGenD (c2);
BCSSMTot ON WEMWBSTot (d2);
COPEAct ON BCSSMTot (e2);
COPEBeDis ON BCSSMTot (f2);

MODEL constraint:
new(m w m1 w1);
m=d1*e1;
w=d2*e2;
m1=d1*f1;
w1= d2* f2;

output:ampstat cinterval standardized TECH1 tech4;

Bengt O. Muthen posted on Monday, July 16, 2018 - 2:35 pm

There are at least 3 approaches of interest:

(1) You can use Model Constraint to test equality of each product, saying

new(m w m1 w1 diff diff1);
m=d1*e1;
w=d2*e2;
diff = w-m;
m1=d1*f1;
w1= d2* f2;
diff1 = w1-m1;

The regular model fit section will not be affected by this.

(2) You can use Model Test to test both of them being equal:

0 = d2*e2-d1*e1;
0 = d2*f2-d1*f1;

The regular model fit section will not be affected by this.

(3) You can use Model Constraint to constrain them to be equal:

d2*e2=d1*e1;
d2*f2=d1*f1;

The regular model fit section WILL be affected by this.

Jennifer Dealy posted on Monday, July 16, 2018 - 4:08 pm

Thank you for your quick reply. Is my above syntax equivalent to the following syntax for evaluating gender differences in model fit without constraints? My colleague said that this is the typical MPLUS syntax that would still allow me to run bootstrap analyses:

Usevariables are
BCSSMTot WEMWBSTot DTS_Total
DERSTot COPEBeDis COPEAct
DMar CGenD MGen;
Grouping is MGen(1=F 2=M);
MISSING ARE all (-99);

ANALYSIS:
Bootstrap = 10000;

model:
WEMWBSTot ON CGenD DMar;
DTS_Total COPEBeDis COPEAct DERSTot ON WEMWBSTot;
BCSSMTot ON WEMWBSTot;
COPEAct ON BCSSMTot;
COPEBeDis ON BCSSMTot;

Model indirect:
COPEAct ind WEMWBSTot;
COPEBeDis ind WEMWBSTot;

output:
sampstat cinterval standardized TECH1 tech4 CINTERVAL(BCBOOTSTRAP) ;

Bengt O. Muthen posted on Wednesday, July 18, 2018 - 6:44 am

Q1: Yes

Q2: You can do bootstrapping with any of these setups.

Samantha-Kaye Johnston posted on Friday, July 20, 2018 - 5:36 am

Dear Dr Muthen,

I am running a simple mediation model (one mediator) for three groups. My predictor variables are observed variables, but my mediator and dependent variables are latent constructs. So, in the first instance, I run a measurement model analysis on the latent constructs. The issue however is that the two constructs are not significantly correlated for two of the groups, but significant for one group.

Moreover, when I do test the influence of the IV (i.e., observed variables) in a structural model, separately for each of the three groups, again the indirect, direct and total effects are non-significant for two groups, but there are some significant pathways for the remaining group.

Given the absence of significant effects in two of the three groups for the relationships I am looking at, is there much point in running a multiple group analysis? WOuld it be acceptable to just report single group analysis for each of the three groups?

My understanding of multiple group analysis is that you do this type of analysis if you want to make group comparisons across significant pathways.

Thank you.

Bengt O. Muthen posted on Friday, July 20, 2018 - 10:48 am

An analysis with multiple groups benefits from a specification of measurement invariance if that fits well because then the groups borrow information from each other - this then reduces SEs. You need measurement invariance to support the notion that they constructs are (measured) the same. If no measurement invariance, you should analyze the groups separately.

Samantha-Kaye Johnston posted on Tuesday, August 28, 2018 - 10:54 pm

Hi Bengt,

I am having trouble with figuring out if multi-group analysis can be achieved. I want to compare reading scores across two groups, but for one of the groups (group 2), the reading construct does not contain the same number of indicators as group 1.

The reason for this is because the indicator loads well for well group 1, but not for group 2.So, I decided to remove that indicator for group 2, but retain it for group 1 (since that seems to be a better representation of the construct).

Given this difference, is there anyway that I can still use a multi-group analysis? Or is this enough grounds to say that there is potentially no measurement invariance and as such, the analysis has to be done separately for each group?

Thank you.

Bengt O. Muthen posted on Wednesday, August 29, 2018 - 1:18 pm

You can keep the time and let its loading and intercept be different across the groups.

Jinxin ZHU posted on Wednesday, September 05, 2018 - 8:51 pm

I am now doing a multigroup random intercept two-level path analysis, with 18 groups.
I would like to summarize the results (regression coefficients and total effects).

I planned to take the mean for each coefficient (and total effect) across 18 groups to present the final result.

Does taking the mean for each coefficient across groups make sense and can Mplus help to do this, cause I am not quite sure about how to calculate the joint SE by myself?

Thank you very much for your time and consideration.

Bengt O. Muthen posted on Thursday, September 06, 2018 - 3:06 pm

If the 18 groups are what constitute your clusters, the mean of each random effect is given in the output for the between level. But maybe I am misunderstanding your question.

Jinxin ZHU posted on Thursday, September 06, 2018 - 5:44 pm

Dear Dr. Muthen,

Thank you very much for your prompt reply and Sorry for not expressing clearly in the previous message. The clusters was constituted by schools and counties constituted the grouping variable.

Bengt O. Muthen posted on Friday, September 07, 2018 - 2:19 pm

I see, so groups of clusters. In your multiple-group (18-group) run you give parameter labels in the Model command for the between-level means of the random effects and then you use Model Constraint to express the average over those 18 means in terms of the labels - this gives you the SE of the average as well.

Jinxin ZHU posted on Sunday, September 09, 2018 - 6:35 pm

Thank you very much!

Is it possible that I calculate the joint SE myself using Excel, based on the parameters and SE reported by Mplus?

If yes, may I know where I can find the formula to compute the joint SE myself?

Jinxin ZHU posted on Sunday, September 09, 2018 - 7:21 pm

Sorry. I planned to use the method you suggested (labeling the parameter for each group) but found that I need to label 18 times for each parameter. Is there any way that I can do the labeling very fast, such as using "#_"?

Indeed, I am running a multiple group analysis path analysis:

Achi on PTRaschR CURSUPP EMOSUPP ESCS ST004D01T
;
EMOSUPP on PTRaschR ESCS ST004D01T;
CURSUPP on PTRaschR ESCS ST004D01T;

I would also like to compute the averaged total effect across groups. Any suggestions that I can do this quickly?

Thank you so much!

Bengt O. Muthen posted on Monday, September 10, 2018 - 2:41 pm

It is awkward to do the joint SE yourself because you need all the covariances in TECH3, not only the variances (which are the SEs squared).

Labeling can be done like this in the different groups:

y on x1-x10 (p1-p10);

Also, see labeling ideas using DO in the UG and also in our Topic 10 Short course.

Wim Beyers posted on Wednesday, November 07, 2018 - 6:17 am

After reading all above, I tried to run a multigroup CFA with 1 factor and 5 indicators across four groups; free version of model, with intercepts free from second group onwards, and latent means fixed at zero:

MODEL:
int BY i1-i5;
MODEL GROUP 2:
int BY i1-i5;
[i1-i5];
[int@0];
MODEL GROUP 3:
int BY i1-i5;
[i1-i5];
[int@0];
MODEL GROUP 4:
int BY i1-i5;
[i1-i5];
[int@0];

But always same identification error: THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL. PROBLEM INVOLVING THE FOLLOWING PARAMETER: Parameter 31, Group 2: INT

So the variance of the laten variable in the second group...

What am I doing wrong?

Bengt O. Muthen posted on Wednesday, November 07, 2018 - 5:32 pm

Your first statement in the overall part of the model:

MODEL:
int BY i1-i5;

fixes the first loading to set the metric of the factor. But when you say

MODEL GROUP 2:
int BY i1-i5;

you free all these loadings so there is no metric setting. The idea is that such a Model-specific statement is meant to alter what's specified in the overall part. So instead say

MODEL GROUP 2:
int BY i2-i5;

which maintains the fixing of the i1 loading at 1.

Mengting Li posted on Thursday, February 07, 2019 - 12:18 pm

Hi,I am doing multi-group analyses and testing the difference between unconstrained model in which parameters were free to vary across groups and the constrained model in which the parameters were fixed across groups.Results indicated a worse chi-square and CFI, however a better RMSEA for the constrained model. Is that possible? Is the constrained model worse or better in that case?
Many thanks!

Bengt O. Muthen posted on Thursday, February 07, 2019 - 5:13 pm

Answered elsewhere.

asma kashif posted on Monday, March 25, 2019 - 10:33 pm

Dear Dr. Bengt

i am trying to estimate impact of participation in a cash transfer program on women empowerment a latent variable that i am measuring using decision making in various domains as F1 and F2. i have two groups treatment and control and they are not randomly assigned.most of the observable characteristics are not significantly different between groups.
i am using MIMIC as provided in example 5.16 to see the impact of program but i am unable to understand how to report the average treatment effect. my input is given below, is the model i am using is valid for this kind of analysis. your help is much appreciated.

USEVARIABLES ARE jd5 jd6 jd8 jd10 m2 gatt2 gatt3 gatt4 gatt5;
GROUPING IS treat (0 = Control 1 = Treat);
WEIGHT IS weight;

CATEGORICAL ARE jd5 jd6 jd8 jd10 m2 ;

analysis:
estimator = wlsmv;

MODEL:
f1 BY jd5 jd6 jd8 jd10;
f2 BY m2 ;
f1 f2 on gatt2 gatt3 gatt4 gatt5;
MODEL Treat:
f1 BY jd5 jd6 jd8 jd10;
[jd5$1];
{jd5@1};
[jd6$1];
{jd6@1};
[jd8$1];
{jd8@1};
[jd10$1];
{jd10@1};

f2 BY m2 ;
[m2$1];
{m2@1};

Bengt O. Muthen posted on Tuesday, March 26, 2019 - 5:31 pm

Instead of your multiple-group analysis, I would recommend handling the treatment variable as a dummy x variable in a single-group analysis.

Also, don't put a factor behind a single, categorical indicator - this is better handled by Mplus.

asma kashif posted on Thursday, April 11, 2019 - 8:21 pm

Thanks a lot for your feedback sir.

but i am a bit confused to underpin how i am going to report my results and justify that that the difference in coefficient is due to program participation. i can show that groups are overall balanced on exogenous variables. and how to introduce my control variables in this model.

Regards

Bengt O. Muthen posted on Friday, April 12, 2019 - 2:00 pm

You may want to ask this sort of question on SEMNET.

Michelle Jongenelis posted on Tuesday, May 21, 2019 - 6:50 pm

Hello,

I am attempting to test gender invariance in a two-factor single order model. The overall model (without separating by groups) terminates normally. However I am now receiving error messages for the sub groups when assessing the configural model. I have checked the residual variance for the offending observed variable and it is not negative. I have checked the THETA matrix in TECH1 and cannot spot correlations greater than 1. Any advice would be much appreciated!

Thanks!

Bengt O. Muthen posted on Wednesday, May 22, 2019 - 3:52 pm

We need to see your full output - send to Support along with your license number.

Ryan Veal posted on Tuesday, May 28, 2019 - 9:48 pm

Hello,

Is it possible to run a multisample SEM across different samples using only correlation matrices (no raw data) as input data?

That is, basically testing a model across different groups to see if there are any significant differences in the factor loadings or correlations between factors.

I have the summary data in correlation matrices from several different samples only (no raw data), and i was hoping to use these matrices as groups to compare using the same model.

Thank you.

Bengt O. Muthen posted on Wednesday, May 29, 2019 - 4:18 pm

This question is suitable for SEMNET.

Zhang Rui posted on Tuesday, September 03, 2019 - 5:46 am

Hi, professor,
I am trying multi-group analyses to test a moderated mediation model.The moderated variable is dichotomous variable (0 versus 1). First, we did an unconstrained model, in which all paths were allowed to vary freely between groups. Second, we test a constrained model, in which all the paths in the original mediation model were constrained to be equal between groups. However, the unconstrained and constrained model were saturated, i.e., x2=0, df=0. So How can I compare the unconstrained model and constrained model? Thanks.

Bengt O. Muthen posted on Tuesday, September 03, 2019 - 6:52 am

The 2-group run with equality across groups is not saturated; if you want help with it, send its output to Support along with your license number.

Joel Fishbein posted on Wednesday, September 11, 2019 - 5:23 am

I am analyzing bifactor structural equation models (B-ESEM) to analyze data from a novel self-report measure. I have arrived at a final model and want to conduct measurement invariance testing with the 3 groups I administered the measure to.

The data indicate that configural invariance is supported but full metric invariance is not for my B-ESEM. I would like to run partial metric invariance analyses. I am wondering how I would do this in MPlus?

Tihomir Asparouhov posted on Wednesday, September 11, 2019 - 1:52 pm

You can use the EWC method( ESEM-within-CFA) described here

http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.310.3475&rep=rep1&type=pdf

Zhang Rui posted on Sunday, September 15, 2019 - 7:43 pm

Thank you for your reply, professor. I solved the problem. Another problem rises. When I compared the fully constrained model with the unconstrained model, there was no difference by using ��2 /��df. Next, I allowed one path to be freely estimated at a time, while constraining all other paths to be equal. However, there was a very significant path difference because of significant Wald statistics. So my question is whether the result is possible. The former comparison was not significant, while the latter was significant.Thank you.

Bengt O. Muthen posted on Monday, September 16, 2019 - 4:12 pm

Send the 2 outputs to Support along with your license number.

matthew finster posted on Friday, September 27, 2019 - 5:04 pm

Using the convenience features from VERSION 7.1 Mplus LANGUAGE ADDENDUM , I am testing a configural and scalar model with binary variables using weighted least squares estimation and the Delta parameterization. If I test the chi-square using the two step process with the DIFFTEST command and demonstrate a non-significant chi-square difference test (p = .49) indicating that the more restrictive model (scalar [H0] versus configural [H1] model) does not significantly worsen the model fit, can I conclude that there is metric and scalar invariance?

Bengt O. Muthen posted on Saturday, September 28, 2019 - 12:36 pm

Yes.

matthew finster posted on Monday, September 30, 2019 - 10:06 am

Thank you. Given that the invariance testing did not proceed in the typical fashion (e.g., including configural, metric,and scalar) some of my colleagues raised concerns about whether or not we could conclude there is also metric invariance.

Kelly Edyburn posted on Monday, November 18, 2019 - 10:05 am

Hello. In the project that I'm working on, we're examining measurement invariance and using the MLMV estimator. I know that the resulting chi-square value in the output cannot be used to compare nested models and DIFFTEST is indicated. However, the DIFFTEST command does not yield the actual chi-square value, only the difference between nested models and significance level. I'm hoping to obtain the correct chi-square value so I can use it to hand calculate the CFI using an alternative null model. Can the same scaling correction be used with MLMV as can be used with MLM/MLR/WLSM? Or is there another way to obtain the correct chi-square value from the MLMV DIFFTEST output? Thank you in advance!

Tihomir Asparouhov posted on Monday, November 18, 2019 - 6:22 pm

The DIFFTEST command does yield the actual chi-square value, and not the difference between nested models and the significance level that is reported corresponds to that chi-square value.

Kelly Edyburn posted on Tuesday, November 19, 2019 - 1:51 pm

Ah, okay! Sorry, I misunderstood that. Thank you for clarifying!

Michelle Keck posted on Thursday, February 20, 2020 - 12:48 pm

Hello,

I am trying to run a Monte Carlo simulation for a multigroup path analysis (grouping is gender). I have two questions:

1. How would I enter the grouping variable in the Monte Carlo syntax?

2. For the actual analysis, would I be able to get the graphs that visualize the slope comparison across the grouping variable? If yes, how?

Bengt O. Muthen posted on Thursday, February 20, 2020 - 4:18 pm

1. See multi-group examples in the UG. Each UG example has a monte carlo counterpart on our website.

2. Only if you do generate one data set and analyze it as if it were real.

Michelle Keck posted on Thursday, February 20, 2020 - 4:43 pm

Dr. Muthen,

Thank you for your answer! I hope you can provide more insight.

1. I was able to find the syntax; thank you!

2. I ran a MG path analysis, but cannot find the syntax to ask for a graph (slope comparison across the grouping variable). I can only find commands that generate scatterplots. Is this possible in Mplus or would I have to use another program to generate this graph?

Bengt O. Muthen posted on Thursday, February 20, 2020 - 4:53 pm

Look up

LOOP
PLOT

in the UG. You can plot anything that you define in Model Constraint. For instance, in our RMA book examples that we show on our website, you can look at the Table 1.8, part 3 example where you have

model constraint:
loop(x,-1,1,0.1);
plot(tx0 tx1);
tx0 = b0 + b2*x;
tx1 = b0 +b1 + (b2+b3)*x;

Georg Henning posted on Tuesday, April 07, 2020 - 6:45 am

Dear Prof. Muth�n,
I set up a multigroup model with four groups. In the first step, I just wanted to look at the freely estimated means and variances of my outcome variable.

So I just did:

grouping is group (0=g1,1=g2 2=g3, 3=g4);

model:

[outcome];

Although all other fit indizes are of course perfect, for some reason, the CFI is 0.000. Did I do something wrong? Thank you very much!

Tihomir Asparouhov posted on Tuesday, April 07, 2020 - 5:14 pm

It is sort of an artifact when the model is the unrestricted model. See formula (127)
http://statmodel.com/download/techappen.pdf

CFI (as well as the other fit indices) is meant to be used in the context of comparing a structural model with the unrestricted model, i.e., it is not something that applies in your situation.

Georg Henning posted on Tuesday, April 07, 2020 - 11:08 pm

Thank you very much! If I restrict the model, so every group has the same intercept , the cfi is still 0.000, while the other indizes change. Is this still the same phenomenon?

Bengt O. Muthen posted on Wednesday, April 08, 2020 - 4:19 pm

The CFI becomes relevant only when you model more than one variable because it compares to a model with uncorrelated variables.

Ahmad posted on Tuesday, May 05, 2020 - 9:15 am

Dear Prof. Muthen

Pls confirm Table 1.11 syntax of your RMA book modified to test diff. in Bs, for latent vars:

Model: agg5 BY y1-y5; agg1 BY x1-x8;
agg5 ON agg1; agg5 (1);

Model control: agg5 ON agg1 (tx0);
Model intervention: agg5 ON agg1 (tx1);

MODEL CONSTRAINT: NEW (Diff);
Diff = tx1 - tx0;

Bengt O. Muthen posted on Wednesday, May 06, 2020 - 2:15 pm

This is correct.

Ahmad posted on Thursday, May 07, 2020 - 2:15 pm

Thank you Prof. Muthen for the response to the above noted query regarding modified Table 1.11 syntax. Please futher clarify how to extend the multigroup analysis to include moderation with an additional variable measured by four items. Using XWITH command gave error message: "ALGORITHM=INTEGRATION is not available for multiple group analysis. Try using the KNOWNCLASS option for TYPE=MIXTURE." Following suggestion and developing syntax along UG Ex.7.28 resulted in: "To declare interaction variables, TYPE = RANDOM must be specified in the ANALYSIS command."

Ahmad posted on Thursday, May 07, 2020 - 2:22 pm

Modified syntax with TYPE=MIXTURE RANDOM solved the problem. Sorry for unnecessary posting.