Mplus Discussion >> Residuals in WLSMV

Topics
Last Day
Last 3 Days
Last Week
Tree View

Edit Profile


Residuals in WLSMV

Mplus Discussion > Categorical Data Modeling >

Message/Author

Thomas A. Schmitt posted on Monday, March 21, 2011 - 1:11 pm

Hello,

I'm doing an ESEM model using WLSMV and was wondering if the residual correlations from the "Residuals for Covariances/Correlations/Residual Correlations" matrix in Mplus can be interpreted in a meaningful way. Can these be interpreted as correlations and used to diagnose areas of misfit? Also, is there any way to get standardized residuals (z-scores) with WLSMV? Thank you,

Tom

Bengt O. Muthen posted on Monday, March 21, 2011 - 5:08 pm

They are not correlations. In principle one could get z scores for them, but it may not be easy to do by hand.

Perhaps a better approach is to use Modification indices which point more directly to the part of the model that needs improvement. MIs are available with ESEM.

Doug posted on Monday, November 14, 2011 - 10:52 am

I found invariance of the loadings/thresholds but non-invariance of the residual variances. I identified the residuals of two items that were non-invariant (partial invariance). In researching on how to interpret these findings I found several articles, for continuous data, which have found non-invariance of the uniqueness component (not residuals variances). In these articles they suggested non-invariance of uniqueness would mean that even though the instrument measured on the same scale units across groups i.e. equality of factor loadings, they did so with a different degree of reliability. They define reliability as:

loadings(squared)* factor variance/
loadings(squared)* factor variance + uniqueness.

How would this interpretation relate to non-invariance of residual variances in Mplus? I know from other postings that residual variance is viewed slightly different than uniqueness in Mplus, but would the same interpretation as above hold for residual variance and categorical data, i.e. that the items are measured with a different degree of reliability? Would the same formula hold, but just substitute residual variance for uniqueness?

Linda K. Muthen posted on Tuesday, November 15, 2011 - 9:37 am

This sounds correct.

Doug posted on Tuesday, November 15, 2011 - 4:12 pm

Thanks Linda. Do you think that the definition of reliability is correct based on how Mplus handles MGCFA with categorical data? In Mplus loadings and thresholds are constrained equal at the same time, and I am wondering how you would incorporate thresholds in the above formula.

Heike B. posted on Wednesday, November 16, 2011 - 4:01 am

I am struggling with the interpretation of the matrices from the RESIDUAL Output and the correlation matrix from the SAMPSTAT output.

My manifest path model contains two continous and two categorical mediator variables and two categorical final outcome variables, i.e. I have a model with categorical variables containing covariates. And I am using WLSMV.

From the user guide and previous postings here I supposed that

1. the non-diagonals of the SAMPSTAT correlation matrix contain sample probit residual correlations.

2. the diagonal of the SAMPSTAT correlation matrix contains variances for the continous variables.

3. the Model Estimated Covariances/Correlations/Residuals Correlation Matrix from the RESIDUAL output follows the same logic just that this matrix contains estimated values not sample values.

4. the Residuals for Covariances/Correlations/Residuals Correlation Matrix from the RESIDUAL output should contain the difference between the correlation matrix from the SAMPSTAT output and the Model Estimated Covariances/Correlations/Residuals Correlation Matrix from the RESIDUAL output. However in my output the differences do not match completely. They match if the element refers to categorical variables only or if they are the diagonal elements otherwise they don't.

What would be the right interpretation of all of these matrice's contents in my context?

Many thanks in advance.
Heike

Linda K. Muthen posted on Wednesday, November 16, 2011 - 2:13 pm

Heike:

1. These are residual tetrachoric or polychoric correlations in a model with covariates.

2. Yes.
3. Yes.
4. Please send the output and point to where you see differences.

Linda K. Muthen posted on Thursday, November 17, 2011 - 8:37 am

Doug:

Thresholds are not involved in reliability. With categorical outcomes, this reliability is for the latent response variable underlying the categorical outcome.

Doug posted on Friday, November 18, 2011 - 8:34 am

Thanks again. One last question. I found non-invariance of the loadings/thresholds and I identified the items that were contributing to the non-invariance. For some items, the thresholds are larger in group A than group B, and the corresponding factor loadings are smaller in group A than in group B. For two items, the opposite is true, the thresholds are larger in group B than group A, and the corresponding factor loadings are smaller in group B than in group A. Regardless of the group findings, the pattern seems to be that larger thresholds are associated with smaller factor loadings. Going through the Mplus articles I am trying to figure out what is the relationship between the loadings and thresholds in Mplus, in order to explain my findings. Can you provide some insight into the relationship between the loadings and thresholds in MPlus?

Bengt O. Muthen posted on Friday, November 18, 2011 - 6:21 pm

I don't think there necessarily is a relationship in general. A high threshold means that the item is difficult or, with attitude or behavioral items, extreme. Sometimes such items have large loadings, meaning that when your ability is high enough, the probability of getting it right goes up very quickly. On the other hand a difficult item may have a weak relationship with ability because the students weren't exposed to the content of the item, so high threshold and low loading.

Doug posted on Wednesday, November 23, 2011 - 11:31 am

Thanks Bengt. What did you mean by "students weren't exposed to the content of the item" in the above message? I am having a little difficulty following the example you have used in your answer. In my situation, I am using a social desirability responding measure. For 5 of the 40 items, the thresholds were higher and loadings were smaller in group A than in group B.

Bengt O. Muthen posted on Wednesday, November 23, 2011 - 2:03 pm

My example of a reason for a low loading concerned an item that is not strongly related to its factor. If the factor is ability and you have a low-ability student and a high-ability student, you won't see much of a difference in their performance if they haven't been exposed to the topic - this gives a low slope. That prototype can be translated to other situations.

H. R. posted on Tuesday, March 06, 2012 - 11:55 pm

I have a question regarding the Residual Correlation Matrices in the SAMPSTAT and RESIDUAL Outputs.

To my understanding the residual is the difference between the observed value and the estimated value. Hence there should be one residual per variable that is derrived from sample data and estimated data.

But why are there two different residual correlations? And how is the residual correlation calculated from sample data only in the SAMPSTAT section?

Thanks a lot.

Linda K. Muthen posted on Wednesday, March 07, 2012 - 9:10 am

The RESIDUAL output gives the difference between the observed and model estimated correlations. The residual correlations given with SAMPSTAT are part of the sample statistics used for model estimation of a conditional model using WLSMV. They are not the same thing.

H. R. posted on Wednesday, March 07, 2012 - 10:11 am

Thank you, Linda.

Just for my understanding: how is the residual correlation in the SAMSPSTAT section defined?

Linda K. Muthen posted on Wednesday, March 07, 2012 - 10:22 am

In a conditional model, each variable is regressed on the covariates to obtain the probit regression coefficients. Each pair of variables is regressed on the covariates to obtain the residual correlations.

H. R. posted on Wednesday, March 07, 2012 - 8:40 pm

Thank you, Linda.

Cindy Masaro posted on Wednesday, March 06, 2013 - 8:10 am

Hi Linda,

My question is about the Residual output for a CFA I am conducting using categorical data (WLSMV estimator). In the Mplus 7 user's guide the example given on pg 724-725 shows a heading for Standardized Residuals for Covariances/Correlations/Residual Correlations but I do not get this in my output. My questions are as follows:

1)I'm assuming the output under the heading Residuals for Covariance/Correlations/Residual Correlations is unstandardized. Is this correct?
2) Am I not getting this output because I am using WLSMV as an estimator and is it possible to get the standardized residuals?

3) What are the suggested unstandardized residual values (e.g., values >.10) to look for in determining areas of strain or to indicate that the model does not explain the corresponding sample correlation very well?

Linda K. Muthen posted on Wednesday, March 06, 2013 - 9:54 am

Please send the output and your license number to support@statmodel.com so I can see the situation you refer to.

Max posted on Tuesday, August 04, 2015 - 4:22 am

Dear Dr.s Muth�n and Muth�n,

I have a question very similar to question no.2 to the person above this post (Cindy Masaro). I am conducting a CFA with categorical data using the WLSMV estimator.

From what I have read on the discussion board, the residuals obtained from "OUTPUT: RESIDUALS" are not standardized and data requested from "OUTPUT: TECH10" is not available for WLSMV estimation.

Is there a way to obtain standardized residuals from Mplus in my case? Or is the only option to do it by hand?

Thank you in advance

Bengt O. Muthen posted on Tuesday, August 04, 2015 - 2:27 pm

Q1. No

Q2. That would be hard.

TECH10 for WLSMV is coming in Version 7.4.

I would recommend using Modindices to capture misfit.

Nicole Brocato posted on Sunday, May 01, 2016 - 3:26 pm

Drs. Muth�n:

I am having a problem similar to Heike�s (from above). I am using ESEM with WLSMV in Mplus version 7.4 to evaluate a measurement model.

The analyses are being conducted on a set of imputed files. The variables are in a mixture of ordinal and continuous formats.

Because TECH10 and MODINDICES are not available, I am trying to obtain correlation residuals to identify possible item-level misfit

Like Heike's, my correlation residuals in the Residuals for Covariances/Correlations/Residuals Correlation Matrix portion of the RESIDUAL output exactly match hand-calculated differences between the observed and model expected correlation values, but only when both items in a pair are categorical. When one item is categorical and the other is continuous, the residual values in the output do not match hand-calculated differences.

Do you know how I can interpret the residuals for the mixed categorical/continuous item pairs? Or how I might find the correlation residuals for these item pairs?

Many thanks.

Tihomir Asparouhov posted on Wednesday, May 04, 2016 - 10:44 am

The section "ESTIMATED SAMPLE STATISTICS" uses correlation scale while the section "ESTIMATED MODEL AND RESIDUALS (OBSERVED - ESTIMATED)" uses covariance scale.

[model estimated covariance] + [the residual for the covariance] = [sample correlation]*sqrt([sample variance for the continous])

Nicole Brocato posted on Thursday, May 05, 2016 - 11:08 am

Ah! That makes more sense now. Thanks for the clarification and the formula.

Based on the formula you provided above, could I calculate the model-estimated correlations for the item pairs with mixed continuous and categorical indicators as:

[model-estimated correlation] = [model-estimated covariance] / (sqrt[model-estimated variance of the continuous item])

Many thanks again.

Tihomir Asparouhov posted on Monday, May 09, 2016 - 2:35 pm

Yes

Marion Freyer posted on Thursday, June 02, 2016 - 3:43 am

Hello,

I made a CFA with two Factors an seven Items. Five Items are categorical and I used the default WLSMV.
The Diagram and Output shows me no Residual Variances for the categorical variables. Can you tell me why?

Thanks.

Linda K. Muthen posted on Thursday, June 02, 2016 - 8:35 am

Residual variances are not model parameters for categorical items. If you ask for STANDARDIZED in the OUTPUT command, you will obtain residual variances computed as remainders after model estimation. They are printed with R-square.