Error Message - Categorical variable ... PreviousNext
Mplus Discussion > Confirmatory Factor Analysis >
Message/Author
 Michael posted on Thursday, April 21, 2005 - 2:43 pm
I’ve been trying to run a CFA with binary outcomes. I keep getting an error message which reads, “Categorical variable S1 contains less than 2 categories.” I’ve double checked the data file and there are definitely observations that reflect both possible responses (true and false) along with some missing data. The proportion of responses is approximately 90% true and 10% false. Do you know what could account for this error message?

TITLE:
CFA multi item and scale test
DATA:
FILE IS G:\RCS Items (WOMEN)(04.15.05).dat;
FORMAT IS free;
TYPE IS individual;
VARIABLE:
NAMES ARE D1-D2 S1-S10 L1-L10 C1-C10 P1-P10
DN1-DN10 A1-A10 H1-H10;
USEVARIABLES ARE S1-S10 L1-L10 C1-C10 P1-P10 DN1-DN10 A1-A10 H1-H10;
CATEGORICAL ARE S1-S10 L1-L10 C1-C10 P1-P10 DN1-DN10 A1-A10 H1-H10;
MISSING ARE all (99);
ANALYSIS:
TYPE IS general;
ESTIMATOR = WLSMV;
MATRIX = covariance;
MODEL:
S BY S1@1 S2-S10;
L BY L1@1 L2-L10;
C BY C1@1 C2-C10;
P BY P1@1 P2-P10;
DN BY DN1@1 DN2-DN10;
A BY A1@1 A2-A10;
H BY H1@1 H2-H10;
OUTPUT:
standardized sampstat modindices (0) residual;
 Linda K. Muthen posted on Thursday, April 21, 2005 - 3:41 pm
If you are reading your data correctly, then it is most likely the case that after listwise deletion, some variables have only one category. Any observation with a missing value on one or more analysis variables is eliminated by the default of listwise deletion. If you cannot resolve this, please send the input/output, your data, and your license number to support@statmodel.com.

Note also that with categorical factor indicators, MATRIX = COVARIANCE is not allowed.
 Tamika Gilreath posted on Tuesday, August 26, 2008 - 1:25 pm
While trying to run a multi-group LCA model, I am receiving the following error.

*** ERROR
Categorical variable SMOKE contains less than 2 categories.

I have double checked the data and my smoke category contains more than two categories. I have also tried deleting the smoke variable and the error repeats with the next variable in the model. I would like to know if I am specifying the model correctly.

My input is as follows:

VARIABLE:
NAMES ARE

WEIGHT PSU STRATUM BMIPCT
PSU2 blckwht smoke alcohol1 alcohol2 pot
coke inhale hard;

USEVARIABLES ARE smoke alcohol1 alcohol2 pot
coke inhale hard;

Categorical ARE smoke alcohol1 alcohol2 pot coke inhale hard;

CLASSES = cg (2) c(4);
KNOWNCLASS = cg (blckwht = 0 blckwht=1);

WEIGHT IS WEIGHT;
CLUSTER IS PSU2;

Missing ARE all (999);

ANALYSIS:
type = mixture complex;
starts = 500 10;
iterations = 1000;

Model:
%overall%
c#1-c#3 ON cg#1;

OUTPUT: SAMP stand cint tech11;

Plot:
TYPE IS PLOT3;
SERIES IS smoke alcohol1 alcohol2 pot coke inhale hard(*);
 Linda K. Muthen posted on Tuesday, August 26, 2008 - 1:28 pm
Please send your input, data, output, and license number to support@statmodel.com.
 Martin H. posted on Thursday, September 02, 2010 - 8:31 am
My problem is quite similar.
I've been trying to run a CFA with binary outcome (Rasch model) with multi-matrix data.

The data looks like this
(example data) :
1330010415 1 1 1 1 1 . .
1330030125 1 1 1 0 1 . .
1330050102 1 1 0 1 0 . .
1330060304 1 0 0 0 1 . .
1340020211 1 0 0 0 1 . .
1330010417 3 . . 0 1 1 1
1330030127 3 . . 1 0 1 0
1330050104 3 . . 0 1 1 1
1330060306 3 . . 0 0 1 1
1340020213 3 . . 0 1 0 0


TITLE:
muma_rasch_test;

DATA:
file = "muma_rasch_test1.dat";
format = free;

VARIABLE:
names = idperson booklet Item1 Item2 Item3 Item4 Item5 Item6;
usevar = booklet Item1 Item2 Item3 Item4 Item5 Item6;
categorical = Item1 Item2 Item3 Item4 Item5 Item6;
missing = .;
grouping = booklet(1=booklet1, 3=booklet3);

MODEL:
Latent by Item1 Item2 Item3 Item4 Item5 Item6 (1);
Latent@1;

MODEL booklet1:
Latent by Item1 Item2 Item3 Item4 (1);
[Item1] (2); [Item2] (3); [Item3] (4); [Item4] (5);

MODEL booklet3:
Latent by
Item3 Item4 Item5 Item6 (1);
[Item3] (4); [Item4] (5); [Item5] (6); [Item6] (7);


*** ERROR
Categorical variable ITEM5 contains less than 2 categories.
 Linda K. Muthen posted on Thursday, September 02, 2010 - 9:21 am
Please send your input, data, output, and license number to support@statmodel.com. You may be reading the data incorrectly or subsetting is a way that one group has only one value for item5.
 Idrissi Othman posted on Tuesday, March 22, 2011 - 12:57 pm
Hi;

While trying to run a CFA model, I am receiving the following error.

*** ERROR
Categorical variable V2 contains less than 2 categories.
Thank you for your help
 Linda K. Muthen posted on Tuesday, March 22, 2011 - 5:21 pm
You may be subsetting the data such that v2 has the same value for everyone. If you can't figure it out, please send your input, data, output, and license number to support@statmodel.com.
 IYH Boon posted on Friday, June 01, 2012 - 1:54 pm
I'm trying to estimate a LCGA using some data that I simulated. The outcome is binary, but in some time points there is no variation across observations (i.e., everyone has a 0 or everyone has a 1). As a result, I'm getting the "Categorical variable contains less than 2 categories error." Is there an easy workaround for this problem?
 Linda K. Muthen posted on Friday, June 01, 2012 - 4:23 pm
Try adding VARIANCES=NOCHECK to the DATA command.
 IYH Boon posted on Monday, June 04, 2012 - 7:55 am
Thanks, Linda, but unfortunately that didn't do the trick. After adding VARIANCES=NOCHECK to the DATA command, I still get the same error: "Categorical variable V2 contains less than 2 categories."
 Linda K. Muthen posted on Monday, June 04, 2012 - 10:17 am
You can try the CATEGORICAL * option with ML but I don't think that will help. You may have to use only time points where you have variability.
 Sarah Moens posted on Thursday, December 06, 2012 - 2:17 am
I also had the same error message stating that the categorical variable contained less than 2 values.

However, I figured out that in my case this had to do with the way Mac saves txt-csv-dat... files. Mac uses other line breaks than e.g. Windows, \r vs. \r\n
Although this is not visible in the file itself, this is how it apparently is stored.

When you use a general text editor like TextWrangler, you can usually specify what type of line break to use: Mac (CR), Unix (LF), or Windows (CRLF). Choosing Windows (CRLF) fixed the error for me and allowed the file to be read properly.
 Susu Zhang posted on Friday, February 14, 2014 - 10:50 am
This happened to me, too, when I tried running LCA. And here is the output with the error message:
INPUT INSTRUCTIONS

TITLE: LCA for Puerto Rico
DATA: FILE IS PuertoRicoLCA.dat;
FORMAT IS (1F4,120F1);
VARIABLE:
NAMES ARE id cb1-cb55 cb56a cb56b cb56c cb56d cb56e cb56f cb56g
cb57-cb112;
USEVARIABLES ARE cb14 cb29 cb30 cb31 cb32 cb33 cb35
cb45 cb50 cb52 cb71 cb91 cb112 cb1 cb8 cb10
cb13 cb17 cb41 cb61 cb80 cb3 cb16 cb19 cb20
cb21 cb22 cb23 cb37 cb57 cb68 cb86 cb87
cb88 cb89 cb94 cb95 cb97 cb104;
CATEGORICAL ARE cb14 cb29 cb30 cb31 cb32 cb33 cb35
cb45 cb50 cb52 cb71 cb91 cb112 cb1 cb8 cb10
cb13 cb17 cb41 cb61 cb80 cb3 cb16 cb19 cb20
cb21 cb22 cb23 cb37 cb57 cb68 cb86 cb87
cb88 cb89 cb94 cb95 cb97 cb104;
CLASSES = c (3);
...
... (a few more lines)

*** ERROR
Categorical variable CB29 contains less than 2 categories.

I went back to the data set and checked variable CB29, and it does contain 2 categories.
Is there anything I can do to fix this?
 Linda K. Muthen posted on Friday, February 14, 2014 - 11:19 am
You should look at the data set that Mplus reads not the original data set. You may have blanks in the data set Mplus reads causing the data to be read incorrectly. If you can't see the problem, send the input, data, output, and your license number to support@statmodel.com.
Back to top
Add Your Message Here
Post:
Username: Posting Information:
This is a private posting area. Only registered users and moderators may post messages here.
Password:
Options: Enable HTML code in message
Automatically activate URLs in message
Action: