The current version of Mplus does not provide these residuals. You can use the DEFINE command to create them.
Adam Blake posted on Tuesday, March 30, 2010 - 10:31 am
I had a followup question involving calculating predicted values for latent variables, so as to calculate the predicted values and residuals of one my endogenous variables. You can see a simplified copy of my model below.
MODEL: pvig by sd ph; cswf on n p k s cswm on pvig cswf
I have been able to calculate predicted values for endogenous variables (such as cswf) with only non-latent variable predictors from the unstandardized coefficients and the intercepts provided by the m-plus output. Am am unsure how to calculate the predicted values for an endogenous variable (such as cswm) that is dependent upon latent variables (such as pvig). To calculate these predicted values I would first need to calculate the predicted values of the latent variables, but I am unsure how to derive these values from coefficients and intercepts used to calculate the predicted values for indicators (such as sd or ph). Any help or advice would be appreciated.
You can get the mean and variance of the pvig factor from TECH4. You can use the mean, the mean plus one standard deviation, and the mean minus one standard deviation to get predicted values.
Adam Blake posted on Wednesday, March 31, 2010 - 11:18 am
I not sure I follow you here. I am unsure how I would calculate predicted values from simply the mean and the variance. Perhaps I was unclear in my question. I was interested in calculating predicted and residual values for individual observations not for the variables themselves.
If you want individual values, you would need to use the factor scores as the covariate. With only two indicators, I doubt that the factor scores would have acceptable factor score determinacy because that part of the model is not identified.
Adam Blake posted on Thursday, April 01, 2010 - 10:02 am
So my best bet if I want individual predicted/residual values would be to eliminate the latent variables from the model allowing me to calculate these values based on the unstandardized coefficients and intercept values given in the output? This should not impact my model to a great degree as I only have one latent variable and my indicators had fairly high reliability.
That sounds correct if you don't want to use factor scores.
Adam Blake posted on Thursday, April 01, 2010 - 1:52 pm
I had another look through the manual after I knew that I was looking to output factor scores. I was then able to output both the factor scores and the factor score determinacy. The determinacy for the latent variable pvig (from the above model) was 0.942, which based on my limited understanding of factor score determinacy seems acceptable. Given its acceptability, I should then be able to use these factor scores to generate individual values for cswm without a problem?
It is surprising that would have such a high factor score determinacy with only 2 indicators, but it is probably because the factor score estimation also draws on information from the other observed variables in the model, including covariates and the DV. The fact that the DV contributes to the factor score may make the prediction questionable. It would perhaps be better to use the factor score based on the 2 factor indicators and covariates. Unfortunately you cannot base the factor score on the factor indicators alone since that part of the model is not identified (you need at least 3 indicators). You could for instance fix the parameters at the values of the full model (including the DV) and estimate the factor scores pretending that the DV has missing values for everyone.
Adam Blake posted on Friday, April 02, 2010 - 10:42 pm
I was constraining the standardized parameters of the two indicators to be equal to ensure convergence so the measurement model is identified with the model constraints below. The factor score determinacy for the measurement model was 0.948, similar to that for the full model. Given this situation the factor scores from the limited model should then be acceptable?
I would think this is a mistake in the book. The definition of a residual is pretty clear.
chris pp posted on Sunday, April 08, 2012 - 4:51 pm
A question about standardized residuals for SEM: Some of the standardized residual mplus outputs are reported as 999.00, and looks like it is for an exogenous variable with one or two outcome variables (that are also set as predictors in the model). I imagine the 999 indicates that the residual was not calculated. Is this correct and if so why is it not relevant to calculate in this instance?
USEVARIABLES ARE edu cfit social selfr pa pop age sex;