Mplus Discussion >> Ordered Logistic Regression

Topics
Last Day
Last 3 Days
Last Week
Tree View

Edit Profile


Ordered Logistic Regression

Mplus Discussion > Categorical Data Modeling >

Message/Author

Peter Mulhall posted on Saturday, November 09, 2002 - 4:52 am

I have conducted an ordered logistic regression with the following results:
RESULTS FOR LOGISTIC REGRESSION

Odds .95 C.I.
Estimates S.E. Est./S.E. Ratio Lower Upper
Thresholds
NEWUCAT2$1 -1.648 0.463
NEWUCAT2$2 0.320 0.449

Mplus VERSION 2.01 PAGE 2
PARTNER ORDERED REGRESSION

Slopes
TREATM -0.385 0.276 -1.392 0.681 0.396 1.170
GENDER -0.149 0.305 -0.488 0.862 0.474 1.567
UNITS1 -0.024 0.010 -2.334 0.977 0.958 0.996
PARTNER -0.372 0.118 -3.164 0.689 0.547 0.868
PARTINT 0.265 0.144 1.834 1.303 0.982 1.730

I have two questions. Firstly how can I use these results to plot the probability of category membership given a certain value of 'PARTNER'. Secondly is it possible to calculate a fit statistic for my model or, in terms of the outcome measure, the percentage of accurate classification.

Many thanks
Peter Mulhall

bmuthen posted on Saturday, November 09, 2002 - 6:23 am

For the first question, please see the Mplus User's Guide, Technical Appendix 1, bottom of page 342. You may also check the Agresti pages mentioned here. You want to compute probabilities as a function of Partner values, but because you have several other covariates in addition to Partner, you have to decide at which values of the other covariates you want to do this, e.g. for males, for a specific treatment, for average Units, etc.

For the second question, yes, this should be possible - please refer to the Hosmer-Lemeshow logistic regression book in the Reference section of the Mplus web site.

Peter Mulhall posted on Sunday, November 10, 2002 - 3:08 pm

Thank you for your prompt reply. I did look at the manual but for the benefit of someone with a phobia of formulae would you mind spelling out as linear equations how to calculate the probabilities for the three categories based on one variable (say PARTNER) and the values above.

Many,many thanks
Peter Mulhall

bmuthen posted on Monday, November 11, 2002 - 10:46 am

Although this goes a little beyond what we typically do, I think this is a phobia we can easily help with. Look at equation (27), page 342. Beta2 corresponds to your highest threshold which looks like it is 0.320 from your message above - except that you have to switch the sign, so Beta2 = -0.320 (Similarly, Beta1 is the negative of your lowest threshold -1.648). If you had only Partner among your predictors and got the slope coefficient -0.372, this would be the value to use for Beta in (27), multiplying by the Partner value (x). Using a hand calculator, (27) then gives you the probability for the highest category as a function of different Partner (x) values.

Note also that there are good introductory books on this. If Hosmer-Lemeshow is not suitable, Agresti has 2 books - one that is called An Introduction to Categorical Data Analysis.

Anonymous posted on Tuesday, November 12, 2002 - 6:49 am

Following this thread would one be right in saying then that the probability of of being in the middle category was ((p=1/x or 2/x)-(p=2/x)) and that the probability of being in the lowest category was 1-(p=1/x or 2/x)?

bmuthen posted on Tuesday, November 12, 2002 - 8:57 am

Yes.

Jennifer Power posted on Sunday, May 07, 2006 - 9:18 am

With an ordinal logistic regression, does MPLUS assume proportional odds for indepent variables? Is it possible to test this assumption in MPLUS as can be done in SAS?

Bengt O. Muthen posted on Sunday, May 07, 2006 - 9:23 am

Yes, a proportional odds model is used. There is no automatic test of the assumption. I think it should be possible to do such testing via the Mplus multinomial logistic regression for unordered categorical response, using Model test (Wald test) and Model constraint features (see the Version 4 User's Guide on the Mplus web site).

socrates posted on Tuesday, November 14, 2006 - 4:39 am

Dear Dr. Muth�n

In order to compare two sets of different predictors, I ran two ordinal logistic regression models with the same criterion (positive, neutral and negative) but different predictors. How can I now decide which set of predictors is the better one? To my knowledge, a likelihood ratio test is not appropriate because the models are not nested (they contain different predictors). However, can I rely on the BIC and if yes, is there a significance test to compare the two BIC values (like, e.g., the Lo-Mendell-Rubin test in GMM)?

Linda K. Muthen posted on Tuesday, November 14, 2006 - 7:18 am

Because your models are just-identified, I would look at significance of predictors and R-square. The regression literature may have other suggestions.

Rick Sawatzky posted on Tuesday, May 08, 2007 - 10:01 am

Hello Linda and Bengt,

I estimated a single factor model for ordinal observed variables (six ordinal categories) using ML:
CATEGORICAL ARE y1-y4;
MODEL: f by y1-y4;
[f@0];
My understanding is that the thresholds in this model are the cumulative log odds for y <= category j. I thought that the predicted probability for category 1 for y1 should then be exp(y1$1)/(1+exp(y1$1)), and for category 2 it should be exp(y1$2)/(1+exp(y1$2)) minus the predicted probability for category 1, and so on. However, this does not correspond to the estimated proportions in the residual output for the univariate distributions. Would you mind indicating what is wrong with my calculations and how to calculate the predicted probabilities from the estimated thresholds in a latent variable model?

Linda K. Muthen posted on Tuesday, May 08, 2007 - 10:20 am

Are you doing probit regression with weighted least squares or logistic regression with maximum likelihood?

Rick Sawatzky posted on Tuesday, May 08, 2007 - 10:25 am

I'm using logistic regression (estimator = ML).

Linda K. Muthen posted on Tuesday, May 08, 2007 - 3:31 pm

You won't be able to compute the probabilities by hand in this case because numerical integration over the factor is required.

Rick Sawatzky posted on Tuesday, May 08, 2007 - 5:29 pm

Oh, I see. Thanks for clarifying this. Am I still correct in stating that the thresholds are in principle representative of the cumulative log odds for each of the ordinal categories of Y1-Y4?

Linda K. Muthen posted on Tuesday, May 08, 2007 - 5:36 pm

That sounds plausible.

Selahadin Ibrahim posted on Friday, May 11, 2007 - 2:00 pm

I recently ran a path model which contained both a categorical dependent variable and two categorical mediator variables. I have now received some comments back from a journal asking me to run a model with logit coefficients instead of the default probit coefficients. My analysis includes a sampling weight and I have found that specifying method = MLR will run the model and give me a logit coefficients in the outcome (although I am not completely sure how this differs from method = ML). However, I am wanting to know how I should interpret these coefficients, in particular for the mediating categorical variables in the model. Can I assume the coefficient is the logit coefficient and the thresholds in the output are the intercepts for each of the different categories of the outcome? and that the odds are proportional? I hope that question makes sense.

Thanks

Linda K. Muthen posted on Friday, May 11, 2007 - 4:53 pm

The estimates from ML and MLR are the same. Only the standard errors and fit statistics differ. MLR has robust standard errors. All regression coefficients, for both final and mediating variables, are ogistic regression coefficients. A threshold is the negative value of an intercept. Yes, the odds are proportional.

Selahadin Ibrahim posted on Monday, May 14, 2007 - 6:38 am

Thanks very much for your reply to my previous question. I have one follow-up question. Why does MLR allow the weight option to be used and ML does not?

Thanks

Linda K. Muthen posted on Monday, May 14, 2007 - 8:02 am

Maximum likelihood estimation in general is not defined for weights for either parameter estimates or standard errors. The MLR standard error computations can include weights and the parameter estimates are with weights pseudo maximum likelihood.

Karen Offermans posted on Wednesday, July 14, 2010 - 6:54 am

I am a new user of Mplus and have some questions on how to run ordinal logistic regression models when also testing mediation or moderation. We have 4 categorical independent variables on the pubertal and psycho-social timing of adolescents and made 8 dummies (late or early timing)to include them in the analysis. Furthermore, there is one continuous independent variable in the analysis (alcohol specific rules set by parents).The outcome variable is Alcohol use defined by: non-drinkers, drinkers, bingers.
We want to test in 2 separate models if the effect of the timing measures (dummies)on Alcohol use is mediated by alcohol specific rules set by parents OR if there are any interaction effects between the timing variables and alcohol specific rules set by parents regarding alcohol use.
In the mediation model we used a bootstrap, estimator WLSMV
In the moderation model we used MLR.

- The ordinal outcome variable is skewed to the right. Is there something I should do to overcome the problem of skewness?
- Which estimators are most appropriate for these 2 analyses and which output would you recommend to ask for? sample size = 1893.
- In de mediation model, I do get significant indirect results for the dummy variables, however, the direct model results of some dummies on alcohol use are not significant. Does this mean that there can't be mediation for these variables, because there is no direct effect?

Linda K. Muthen posted on Wednesday, July 14, 2010 - 4:16 pm

Categorical variable methodology takes care of floor and ceiling effects. You do not need to do anything.

You cannot get indirect effects unless you use WLSMV.

A direct effect is not required for mediation. It is the indirect effect that is important.

Karen Offermans posted on Thursday, July 15, 2010 - 2:57 am

Thanks so much for your asistance!

Karen Offermans posted on Friday, July 16, 2010 - 6:01 am

Hi Linda,

I still have one more question regarding the mediation and moderation analyses I described in my previous question. I used the MLR estimator for moderation analysis. Is this the best estimator to use in this analysis or are there better onces I could use?
Furthermore, which fit indices should I at least report in these mediation and moderation analyses?

Linda K. Muthen posted on Friday, July 16, 2010 - 9:41 am

Your estimator choice will be determined in most cases by the analysis. MLR is a good choice. However, if you want to estimate indirect effects you need to use WLSMV. All fit indices that are available will be given as the default.

Marina Epstein posted on Tuesday, June 12, 2012 - 4:59 pm

I am estimating a logistic regression with a number of predictors. Because Mplus deletes cases with missing X variables, I am calling in variances of some of the predictors. I also have a quadratic term. However, when I try to call in the variance of the quadratic term, the model no longer converges. Is there some reason this cannot be done?

Linda K. Muthen posted on Tuesday, June 12, 2012 - 5:29 pm

If you bring the variances of predictors into the model, it must all of the predictors. I'm unclear what you mean by quadratic term. Is this an observed exogenous variable?

Marina Epstein posted on Wednesday, June 13, 2012 - 9:40 am

Linda, thank you for a quick answer. I am estimating STI infection risk from number of partners. I have both the number of partners and number squared to see if the relationship is linear or quadratic. When I call in variance of all but the quadratic term, the quadratic term is significant, but I am losing N because it has missing values. Calling for variances of all of the predictors (including the quadratic term) results in non-convergence.

Linda K. Muthen posted on Wednesday, June 13, 2012 - 11:13 am

Then it is an observed exogenous variable and its variance should be mentioned. Please send the two outputs and your license number to support@statmodel.com.

Marina Epstein posted on Wednesday, June 13, 2012 - 1:58 pm

Thank you. I have sent my particular question to the support staff. I also have a broader question regarding calling in variances. You say that, if I bring in variances, it should be for all of the predictors. What about dichotomous predictors like gender (especially when they do not have any missing values)?

Linda K. Muthen posted on Wednesday, June 13, 2012 - 3:15 pm

You must bring them all in.

Simone Croft posted on Thursday, October 17, 2013 - 7:09 am

I am having difficulty running a logistic regression on my ordinal data (25 items loading onto 5 factors). My outcome variables are dichotomous (7) and continuous (1). The data is weighted so I am using the MLR estimator. This is what I am trying to model:

bcon BY BCON1 BCON2 BCON3 BCON4 BCON5;
bhyp BY BHYP1 BHYP2 BHYP3 BHYP4 BHYP5;
bemo BY BEMO1 BEMO2 BEMO3 BEMO4 BEMO5;
bpeer BY BPEER1 BPEER2 BPEER3 BPEER4 BPEER5;
bpro BY BPRO1 BPRO2 BPRO3 BPRO4 BPRO5;

CADHD CASD CTPSE DADHD DASD DTSEN2 DTBHYP DTMIDEP ON bcon bhyp bemo bpeer bpro;

But I get this fatal error message:
THERE IS NOT ENOUGH MEMORY SPACE � THE ANALYSIS REQUIRES 5 DIMENSIONS OF INTEGRATION RESULTING IN A TOTAL OF 0.75938E+06 INTEGRATION POINTS � YOU CAN TRY TO REDUCE THE NUMBER OF DIMENSIONS OF INTEGRATION OR THE NUMBER OF INTEGRATION POINTS OR USE INTEGRATION=MONTECARLO WITH FEWER NUMBER OF INTEGRATION POINTS�
So I tried it with the MONTECARLO simulation and got this error:
THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY DUE TO A NON-ZERO DERIVATIVE OF THE OBSERVED-DATA LOGLIKELIHOOD.
THE MCONVERGENCE CRITERION OF THE EM ALGORITHM IS NOT FULFILLED. CHECK YOUR STARTING VALUES OR INCREASE THE NUMBER OF MITERATIONS. ESTIMATES CANNOT BE TRUSTED. THE LOGLIKELIHOOD DERIVATIVE FOR PARAMETER 70 IS 0.14864617D+02.
Can you shed any light on this? Thanks in advance.

Linda K. Muthen posted on Thursday, October 17, 2013 - 8:25 am

With categorical factor indicators and maximum likelihood estimation, numerical integration is required, with each factor requiring one dimension of integration. You have five which is computationally demanding. You can try INTEGRATION=MONTECARLO (5000). For categorical factor indicators and many factors, WLSMV may be a better choice because numerical integration is not required.

Jacqueline Sims posted on Wednesday, October 19, 2016 - 7:09 am

Similarly to spelling out the equations, this may also be beyond what you all typically do. However, I am interested in plotting the predicted probability of category membership in an ordered logistic regression (similarly to Peter Mulhall's question back in 2002). However, I am interested in plotting them with an interaction-- at different levels of two continuous predictors. I haven't yet had luck in the User's Guide finding an example of such an equation. Is there perhaps one that you can point me towards? Thanks very much.

Bengt O. Muthen posted on Wednesday, October 19, 2016 - 5:50 pm

See our new book, Chapter 5, for an example. I would plot such probability curves for key values of the continuous predictor.

Salmi Md Zahid posted on Tuesday, February 13, 2018 - 1:03 am

Hi, Im a beginner user of MPLUS. I would like to know if we can run Partial Proportional Odds regression using MPLUS? Which chapter from User's Guide i should refer to? If it can be done using MPLUS, i will purchase this software as soon as possible.