Missing by design / Categorical Data ... PreviousNext
Mplus Discussion > Categorical Data Modeling >
 Martin Brunner posted on Friday, November 07, 2003 - 9:04 am
I am analyzing data of a mathematics test consisting of 200 categorical items.
The 200 items were distributed across 10 booklets. About 200 pupils worked on
each booklet. The booklets were randomly assigned to the pupils (thus data
for several items are missing by design).
The booklets are linked pairwise by 5 items, e.g. 5 items appear
in booklet A and B, 5 different items appear in booklet B and C, etc.

As I learned from the Mplus users´ guide and previous discussions on this list
neither the pattern option of Mplus2 nor the "classical" multigroup approach
to missing data (Muthen, Kaplan, & Hollis, 1987) will work.

I also learned that Mplus3 may deal with missing by design and categorical data.
But unfortunately I cannot wait until the release of Mplus3.

Is there any possibility to establish measurement invariance across the booklets?

I am thinking of constraining thresholds and factor loadings of the linking items (e.g. for
booklet B) to the values for those linking items that I optained when analyzing the
item set of booklet A.

Any help is highly appreciated.


Muthen, B., Kaplan, D., & Hollis, M. (1987). On structural equation modeling with data that are not missing completely at random. Psychometrika, 52(3), 431-462.
 Linda K. Muthen posted on Sunday, November 09, 2003 - 4:19 pm
To deal with the designed missingness, you could treat this as a multiple group analysis where each test booklet is a group. If you have missing data within test booklet, you would have listwise deletion for each test booklet. You would then place appropriate equalities on the anchor items.
 Daniel posted on Monday, May 10, 2004 - 2:10 pm
Hi, is the modeling feature for LGM with missing data a pairwise deletion method? I am writing a methods section and am not sure how I should describe the missing data method for categorical dependent variables.
 Linda K. Muthen posted on Monday, May 10, 2004 - 2:59 pm
Are you using the weighted least squares estimator or the maximum likelihood estimator?
 Daniel posted on Monday, May 10, 2004 - 4:06 pm
Weighted least squares
 Linda K. Muthen posted on Monday, May 10, 2004 - 4:32 pm
WLS missing uses a three stage (just like WLS without missing) estimation that is pairwise deletion based. The method guarantees more than MCAR consistency but less than the full MAR consistency. The exact condition is called MAR-covariates, the estimates are consistent even if covariates influence the missing data patterns. For the full MAR consistency use the ML estimator.
 Aleksandra Holod posted on Friday, June 10, 2011 - 12:13 am
I am analyzing a model with latent constructs composed of some categorical items (which are indicator variables). Therefore, I am using TYPE=COMPLEX and the estimator is WLSMV. Is Mplus 6 conducting FIML to account for missing data when I run this type of model? If not, how is the model dealing with missing data?
 Linda K. Muthen posted on Friday, June 10, 2011 - 1:13 am
I'm not sure why you are using TYPE=COMPLEX. Categorical indicators do not require this.

Pairwise present is used.
Back to top
Add Your Message Here
Username: Posting Information:
This is a private posting area. Only registered users and moderators may post messages here.
Options: Enable HTML code in message
Automatically activate URLs in message