Mplus Discussion >> Threshold identification constraints & invariance

Topics
Last Day
Last 3 Days
Last Week
Tree View

Edit Profile


Threshold identification constraints ...

Mplus Discussion > Categorical Data Modeling >

Message/Author

Thomas Klausch posted on Wednesday, April 18, 2012 - 7:04 am

Dear discussion group,

I am following advice on threshold identification in Millsap's 2011 book wabout measurement invariance testing. In Mplus I am using ordinal CFA models with WLSMV.

Millsap suggested that one should first free all thresholds except those necessary for identification and test for loading equivalence. For identification, it is recommended to constrain the thresholds of a reference threshold of each indicator equal across groups (plus a second threshold on one of the indicators; this applies to the cluster loading structure with more than two cateogries).

However, I noticed that depending on the choice of the reference threshold, the model can change substantially: for example estimates of loadings are affected by choice of the reference threshold, alongside the result of the invariance test on loadings. That is, under one threshold invariance constraint I find invariant loadings, but under a second configuration invariance is rejected. Moreover, estimates of the free thresholds (e.g. their order) are also affected.

Now I wonder how I should best identify thresholds to set equal for identification or whether there are alternative identification parameterizations I could use. It seems very disturbing to me that models are so easily influencable by these paramter constraints.

--Thanks.

Thomas Klausch posted on Thursday, April 19, 2012 - 9:36 am

I should add that my main interest is understanding the differential use of categories across groups, whilst assuring equivalence of loadings. There is subtantial reason to believe that there is a lot of invariance, so a backward elimination procedure is not the ideal choice, I believe.

As an alternative parameterization I considered constraining the residual variances to 1 in all groups (Theta parameterization). Additionally the factor means are constrained to zero. Then all thresholds can be freed and are identified.

The interpretation of a plot of all thresholds is meaningful, after assuring loading invariance. It also avoids constraining a threshold equal on which there is bias. But I am wondering if this is a useful approach or if constraining error variances to 1 can have effects that I am not aware of. My understanding is that group differences in the residuals would then be absorbed by the loadings, so could be tested in the 'first' invariance testing step.

Bengt O. Muthen posted on Thursday, April 19, 2012 - 6:37 pm

Which pages of Millsap (2011) are you looking at?

Thomas Klausch posted on Friday, April 20, 2012 - 1:19 am

For the identification constraints: pages 127-131 (in particular the sections about the cluster and general structure with C>1)

For the forward testing procedure of invariance: pages 136-141

Bengt O. Muthen posted on Monday, April 23, 2012 - 4:48 pm

Just to let you know that I am thinking about an answer to your item invariance questions.

Let ask you how many items and how many groups you have.

Also, you said earlier that "there is a lot of invariance, so a backward elimination procedure is not the ideal choice". I thought the term "backward" meant that you start with the fully invariant model, which seems like the starting point closest to the model you believe in given your phrase "a lot of invariance".

Thomas Klausch posted on Tuesday, April 24, 2012 - 2:45 am

Thanks -- I mixed up the words invariance and bias/DIF here. So the sentence should read "there is a lot of item bias, so a backward elimination procedure is not the ideal choice".

Then starting with a backward procedure (fully invariant model) will mean having to modify a lot of parameters, if MI at all indicate the correct spots of misfit. In my tests with backward the options for modification quickly ran out or were implausible.

I think forward procedures might point rather to the sources of measurement bias. Perhaps the correct procedure is ongoing debate. But here I get the "choice of minimal identification constraints" problem.

In my analysis I am interested in various scales (one with 2 factors and six items, two with one factor and four items; always five categories). I always have four groups.

Bengt O. Muthen posted on Wednesday, April 25, 2012 - 8:55 am

Because you expect so much non-invariance in the thresholds, I wonder if it would be of any help to use a model with the "nu" intercepts in addition to the thresholds. That's an item-specific intercept. For instance, you could start with a model with invariant thresholds, but non-invariant intercepts. This means a rigid shift of all thresholds across groups for an item. That's different than the shift you get by different factor means across groups because it is an item-specific shift. Perhaps MIs are easier to work with in such a more flexible model as a starting point. Identification has to handled carefully, where a reference group has both the factor mean and all intercepts fixed at zero. Haven't tried this, so take the suggestion with a grain of salt. Because the nu intercepts don't exist in Mplus with categorical, the way you add them is to create perfectly measured factors behind the items.

Thomas Klausch posted on Friday, June 22, 2012 - 6:16 am

Hi Bengt,

In your post above you say that one can create intercepts with categorical by ceating "perfectly measured factors behind the items". Not sure how to do this...Can you check my example or give an own one please?

Say x1-x3 are categorical items:

f1 by item1-item3;
item1 by x1;
x1@0;
item2 by x2;
x2@0;
item3 by x3;
x3@0;

Additionally one should set all thresholds equal in some way, I suppose.

Thanks, Thomas

Bengt O. Muthen posted on Friday, June 22, 2012 - 9:05 am

With say 3 categorical items u1-u3, you say:

f by u1-u3; ! to define the regular factor
f1 by u1; f1@1;
f2 by u2; f2@1;
f3 by u3; f3@1;

This gives you a "nu" intercept for each of the 3 indicators. It will appear under the "alpha" (factor mean) heading.

Thomas Klausch posted on Friday, June 22, 2012 - 9:48 am

I have additionally constrained the covariances between all f to 0.

Should I impose any additional identification constraints? At the moment my model is not identified. I am looking at a multiple group model.

Linda K. Muthen posted on Friday, June 22, 2012 - 9:54 am

Please send the output and your license number to support@statmodel.com.

Tait Medina posted on Wednesday, November 20, 2013 - 10:14 am

Hi Thomas. I am wondering if you were able to successfully introduce a perfectly measured factor behind each y*? If yes, would you mind sharing your syntax here? Also, did you find such an approach useful?

Thank you,
Tait

Tait Medina posted on Friday, November 22, 2013 - 10:36 am

I am wondering if you can point me in the right direction. I am having difficulty understanding why residual variances can be freely estimated (in non-reference groups) only when thresholds are constrained to be equivalent across groups (in addition to setting the metric by either fixing the variance to one in all groups or fixing the first loading to one in all groups). I've read through Web Note 4, but am still having a hard time getting my head around this. Do you have another reading that might help me understand this.

Thank you

Bengt O. Muthen posted on Sunday, November 24, 2013 - 3:30 pm

I have written about this as follows, also giving references to a paper and a 2011 book by Millsap:

"It is of interest to understand if factor loading invariance can be tested
separately from threshold invariance. In the case of continuous outcomes,
invariance of factor loadings make it possible to identify and estimate factor
variance-covariances in the different groups, while intercept invariance is
necessary only for identifying and estimating factor means in the different
groups. This holds even if there is residual variance noninvariance because
the residual variances do not influence the conditional expectation function
of the outcome given the factors. In contrast, with binary outcomes, the
residual variances do influence the conditional expectation function, that
is, the item characteristic curve. In the binary case, a model with non-
invariant thresholds is not identified when allowing group-varying residual
variances. To see the indeterminacies, consider again multiplying all scale
factors by the same constant in a certain group. This change can be
absorbed in the factor variance as before and in the thresholds as seen in
(1.49) and (1.50). This implies that threshold invariance and factor loading
invariance cannot be separately tested in the binary case without further
restrictions, one case being residual variance invariance (see also Millsap
& Tien, 2004 and Millsap, 2011). Muthen and Asparouhov (2002) discuss
further identification and testing matters for multiple-group analysis and
show the equivalent issues for invariance across time in growth models.

In the polytomous case, each item has more than one threshold and the
identification status is different from the binary case. Millsap and Tien
(2004) and Millsap (2011) give identification rules for invariance restrictions
on model parameters. As these authors show, it is possible to identify non-
invariant factor loadings in conjunction with a minimal set of restrictions
on the thresholds, while at the same time allowing group-varying factor
means, factor variances, and residual variances."

Tait Medina posted on Monday, December 02, 2013 - 7:45 am

Dr. Muthen, this is very helpful and thank you for taking the time to address my question.

Kind Regards,
Tait

John D Peipert posted on Friday, December 22, 2017 - 8:32 pm

I am conducting measurement invariance tests on categorical data. For one model, I need to freely estimate (release constraints) on the thresholds and loadings in the code below.

I am unsure how to modify the code so that the thresholds and loadings are not constrained to be invariant. Thanks in advance for the help!

MODEL:
f1 BY y1* y2-y4(L1-L4);
f1@1;
[f1@0];
{x1-x4@1};
[y1-y4@0];
y1-y4@0;
[x1$1-x1$4*](T1-T4);
[x2$1-x2$4*](T5-T8);
[x3$1-x3$4*](T9-T12);
[x4$1-x4$4*](T13-T16);

MODEL black:

f1 BY y1* y2-y4 (L1-L4);
f1@1;
[f1@0];
{x1-x4*};
[y1-y4@0];
y1-y4@0;
[x1$1-x1$4*](T1-T4);
[x2$1-x2$4*](T5-T8);
[x3$1-x3$4*](T9-T12);
[x4$1-x4$4*](T13-T16);

Bengt O. Muthen posted on Saturday, December 23, 2017 - 2:42 pm

When you don't want invariance you drop the parameter labels.

It is not clear to me what the role of the x's is in your input.

John D Peipert posted on Saturday, December 23, 2017 - 6:44 pm

Thanks for your quick response!

I omitted definitions and code to reduce the size of the original message.

x's are observed indicators. y's are latent variables created so that intercepts could be manipulated. These are defined as:

y1 BY x1@1;

John D Peipert posted on Thursday, December 28, 2017 - 11:24 am

Thanks again for your quick response, Dr. Muthen. I have an additional follow-up question.

I attempted to remove invariance constraints on the thresholds by removing the labels on those parameters for both groups (black and white), but that model was not identified. I then removed the labels for just one group (black) and the thresholds were estimated, as I had hoped.

Even though it is the result I am seeking, I'm unsure why removing them from only one group leads both groups' thresholds being estimated. Can you explain why the labeling works that way?

Thanks in advance for your help!

Bengt O. Muthen posted on Thursday, December 28, 2017 - 12:55 pm

The UG describes how to handle invariance testing. In the V8 version on our website, the relevant pages are 544-545.