Extremely high path coefficient scores PreviousNext
Mplus Discussion > Structural Equation Modeling >
 yao lu posted on Tuesday, March 13, 2012 - 1:57 pm
Hi Dr. Muthen,

I have a structural regression model reads as:

fcom on ccom;
ftrust on ctrust;
ftrust on fcom;
fatt on catt;
fatt on fcom;
fatt on ftrust;
ctrust on ccom;
catt on ccom ctrust;
rec pur on fatt;

All the variables above are latent. ANALYSIS: Estimator = MLR;

The SEM solutions indicated a very good model fit. The results show that the coefficients of ftrust on fcom, rec on fatt, and pur on fatt, were all very high, all exceeding .85.

Although significant coefficient is desired, I hardly seen such high coefficients in other literature. I wonder should I be cautions that those high coefficients might indicate some technique errors I made in SEM? Model specification is theoretically sound.

Thank you very much for your help in advance.
 Linda K. Muthen posted on Wednesday, March 14, 2012 - 2:36 pm
You may have multicollinearity among you latent covariates. Ask for TECH4 in the OUTPUT command to see if this is the case.
 yao lu posted on Thursday, March 15, 2012 - 12:25 pm
Dr. Muthen,

Thank you. Yes, multicollinearity was the case. There are no theoretical reasons for me to combine or delete any of highly correlated variables.

Are there anyways to deal with this problem in Mplus? I tried centering all the observed indicators for all the latent variables in my model, however, the results were the same? My analysis is MLR.

Thank you very much in advance.
 Bengt O. Muthen posted on Thursday, March 15, 2012 - 4:12 pm
There is no really good way around multicollinearity.

If the collinearity is among factors, you may want to investigate why they are so highly correlated, perhaps the fixed zero cross-loadings aren't all exactly zero, in which case inflated factor correlations appear.
 Maarten Pinxten posted on Tuesday, July 17, 2012 - 8:47 am
Dear Professor Muthen,

I'm running a longitudinal model with 4 measurement points and 4 latent constructs

I tested my final model on the total sample (N=5000) with the results for the following paths (Standardized XY solution)

Meff_G4 -> Maff_G5 .09 (est.) .05 (S.E.) .079 (p-value)

Meff_G4 -> Mcom_G5 .02 (est.) .04 (S.E.)
.67 (p-value)

When I try to run a multigroup analysis for boys and girls (50%-50%) I get the following (again XY standardized) results:


Meff_G4 -> Maff_G5 .00 (est.) .06 (S.E.) .997 (p-value)

Meff_G4 -> Mcom_G5 -.06 (est.) .05 (S.E.)
.24 (p-value)


Meff_G4 -> Maff_G5 .74 (est.) .04 (S.E.) .000 (p-value)

Meff_G4 -> Mcom_G5 .50 (est.) .03 (S.E.)
.000 (p-value)

How is it possible that the standardized path coefficients for boys are so extremely high (even higher than path coefficients relating the same construct over time). The model terminated normally but I guess something 'computational' is going on but I can't figure out what it is. I do not expect the paths to be the same for boys and girls but this difference is just too large.

Thank you!
 Linda K. Muthen posted on Tuesday, July 17, 2012 - 10:59 am
Please send the output and your license number to support@statmodel.com.
 Elizabeth Barrett-Cheetham posted on Sunday, April 07, 2013 - 3:37 am
Dear Linda and Bengt,

I am currently trying to assess whether multicollinearity is a problem in my model.

In a nutshell, I have 3 emotions that affect 2 types of motivation. This motivation, in turn, influences 2 types of well-being. The emotions and the 2 types of motivation are both observed variables; 1 of the well-being variables is observed and the other is a latent variable.

The estimator is MLM.

The model is:
OtherFocusedWell-Being by mastery growth self-acceptance purpose;

CommunionMotivation on gratitude compassion pride;

AgencyMotivation on gratitude compassion pride;

OtherFocusedWell-Being on AgencyMotivation CommunionMotivation;

SelfFocusedWell-Being on AgencyMotivation CommunionMotivation;

I have searched the mplus user guide and mplus discussions, but canít seem to find a solution. What would be the best way to see if multicollinearity is a problem?

Thank you,
 Linda K. Muthen posted on Monday, April 08, 2013 - 11:51 am
Multicollinearity is caused by high correlations so I would look at the correlations among the variables. I don't know of any cutoff for how high constitutes a problem. You might want to ask this on a general discussion forum like SEMNET.
 Aurelie Lange posted on Monday, June 01, 2020 - 1:46 am
Dear dr Muthen,

I am analysing fairly simple multilevel models, each having the following structure but with varying outcomes:
UO2 on predictor1 UO1;
UO2 on predictor2 UO1;

Some of my results have very high s.e.. For example an estimate of -2.5 and s.e. of 306 on the between level. If I inspect tech4 I do encounter a high correlation between between UO2 and predictor2 (r > .80).

1. Would this indicate problematic multicollinearity?
2. If so, would it be appropriate to exclude UO1 from the between level?

For one of the other outcomes using this model, I find a correlation of > .80 for predictor2 and UO2. Als the predictor is the variable of interest, I cannot simply delete this variable from the model.

3. Is there some other solution to take care of the high s.e.?

Thank you so much!

Kind regards,
 Bengt O. Muthen posted on Tuesday, June 02, 2020 - 3:14 pm
How about following Table 5.11 of the Raudenbush & Bryk (2002) book and group-mean center UO1 on within and use its group mean version on between. Both can be obtained using Define.
Back to top
Add Your Message Here
Username: Posting Information:
This is a private posting area. Only registered users and moderators may post messages here.
Options: Enable HTML code in message
Automatically activate URLs in message