| Message/Author |
|
|
| Sanjoy posted on Thursday, April 28, 2005 - 3:41 pm
|
|
|
Dear professor/s ... can we measure polychoric correlation in MPlus ... what should be my "analysis" and "output" command ... I couldn't find any in the MPlus CD I have six five-scaled categorical variables thanks and regards |
|
|
| Sanjoy posted on Thursday, April 28, 2005 - 3:45 pm
|
|
|
Oh! in connection to my earlier post ...I forgot to mention onething ... none of them are covariates, all six are indicator outcome variables .. and they are categorical in nature, I suppose in order to check the association among these six , we need to find the polychoric correlation ... thanks |
|
|
|
|
| If you put your outcomes on the CATEGORICAL list in the VARIABLE command and ask for TYPE = BASIC; in the ANALYSIS command without the MODEL command, you will get polychoric correlations. |
|
|
| Sanjoy posted on Friday, April 29, 2005 - 5:06 pm
|
|
|
Thank you madam, it worked nicely ... Two very quick questions before the weekend starts Q1. Kindly tell me whether my codes are correct ... below is what I want to do I have latent factor "R" being loaded onto "R7 - R9" I have latent factor "B" being loaded onto "B6 - B8" I WANT to check another indicator named "R1" is related with the latent factor "R", similarly for "B1" with "B" below is my code DATA: FILE IS d:\mpluspaper1.txt; VARIABLE: NAMES ARE X1-X19 Y1-Y4 XB1-XB6 XP1-XP9 R1-R9 B1-B11 T1-T4; USEVARIABLES ARE R1 R7-R9 B2 B6-B8; CATEGORICAL ARE R1 R7-R9 B2 B6-B8; ANALYSIS: PARAMETERIZATION=THETA; ESTIMATOR=WLSMV; MODEL: R BY R7-R9; B BY B6-B8; R1 WITH R; B2 WITH B; output is ok, In fact according to my expectation ... I just want to make sure I have done the correct thing * though I have NOT asked for the correlation between "R" and "B" ... MPlus reports that also in the output ... WHY? Q2. I understand the maths behind Factor analysis but how does MPlus measure the correlation between the our latent "R" and other categorical "R1" Does it use the "estimated value of factor”? Or something else |
|
|
|
|
It seems right. Some parameters are free as the default. You can read about defaults in the Mplus User's Guide. If you want this parameter to be fixed to zero, say r WITH b@0; The correlation between r and r1 is a biserial correlation. It is estimated from the sample statistics of the observed variables. You can think of the correlation between r and r1 as the correlation between the factor scores for r and the scores for r1 but factor scores are not actually computed in order to estimate the correlation between r and r1. |
|
|
| Sanjoy posted on Friday, April 29, 2005 - 8:22 pm
|
|
|
Thanks ... well madam, a mild confusion remains 1.are we then calculating everything simultaneously in MPlus ... I mean the Factor analysis term is ok (in regular textbook jargon; y = delta*eta + epsilon along with threshold adjustment since Ri's and Bi's are all categorical) R BY R7-R9; B BY B6-B8; R1 WITH R; B2 WITH B; Now vector “eta” is 2*1, one of them is "R" and other one is "B", right ... R and B, our two continuous latent variable... The next two lines (WITH stuff) requires two calculation of "polyserial" correlation, one for R1(5-scale categorical) and R, and the other one for B1(5-scale categorical) and B So how is MPLUS measuring (asking for the program logistics behind) ... is it a two-step or some kind of Full information technique Thanks and regards |
|
|
| Sanjoy posted on Friday, April 29, 2005 - 9:38 pm
|
|
|
Madam, In connection to my previous post ... kindly check my output and tell me please why we are having two DIFFERENT correlation matrix Model 1: we are running everything simultaneously TITLE: polychoric test DATA: FILE IS d:\mpluspaper1.txt; VARIABLE: NAMES ARE X1-X19 Y1-Y4 XB1-XB6 XP1-XP9 R1-R9 B1-B11 T1-T4; USEVARIABLES ARE R1 R7-R9 B2 B6-B8; CATEGORICAL ARE R1 R7-R9 B2 B6-B8; ANALYSIS: PARAMETERIZATION=THETA; ESTIMATOR=WLSMV; MODEL: R BY R7-R9; B BY B6-B8; R1 WITH R; B2 WITH B; SAVEDATA: SAVE=FSCORES; file is d:\polychoric.txt; MODEL RESULTS Estimates S.E. Est./S.E. R BY R7 1.000 0.000 0.000 R8 0.924 0.220 4.205 R9 0.855 0.195 4.386 B BY B6 1.000 0.000 0.000 B7 0.890 0.221 4.037 B8 0.960 0.227 4.233 R1 WITH R 0.369 0.074 4.972 B2 WITH B 0.421 0.084 5.035 B WITH R 0.023 0.055 0.416 Variances R 0.605 0.179 3.386 B 0.579 0.195 2.965 Model 2: using the data set "d:\polychoric.txt" which has factor score saved from model 1 ... here we are calculating only polyserial correlation between R1 and R and B2 and B (using TYPE=BASIC) LOOK AT THE OUTPUT ... each value is different, correlation as well as the variance of R and B TITLE: Polyserial test between factor scores and R1 and B2 DATA: FILE IS d:\polychoric.txt; VARIABLE: NAMES ARE R1 R7-R9 B2 B6-B8 R B; USEVARIABLE ARE R1 B2 R B; CATEGORICAL ARE R1 B2; ANALYSIS: PARAMETERIZATION=THETA; TYPE=BASIC; MODEL RESULTS CORRELATION MATRIX (WITH VARIANCES ON THE DIAGONAL) R1 B2 R B R1 B2 0.191 R 0.353 -0.006 0.332 B 0.224 0.405 0.040 0.316 Thanks and regards |
|
|
| bmuthen posted on Saturday, April 30, 2005 - 8:52 am
|
|
|
The WLSMV estimator first computes a sample correlation matrix (tetrachoric, polychoric) and then fits the model to that, thereby estimating the model parameters. So the fitting of the model is similar to what is done if the outcomes had been continuous. No factor score estimation is involved in this, but the parameters are estimated directly. If you instead estimate factor scores and then fit a model to a covariance matrix involving those estimated scores, you will get biased results. These biases are well-known in psychometrics and are due to the fact that estimated factor scores do not have the same variances or covariances with other variables as the true factors. See literature on factor score estimation in Psychometrika. |
|
|
| Sanjoy posted on Saturday, April 30, 2005 - 4:41 pm
|
|
|
Thank you Professor ... I think I got your words, at least partially 1. In our model-1 we keep the idea of checking correlationship between R1 and R (which is the common factor to R7-R9), however we are not calculating the factor score ... hence we are circumventing the problems associated with factor score calculation like Thurstone validity maximization at the cost of un-orthogonality or Anderson's process which ensures us orthogonality but lacks determinacy and so on 2. In model 2, instead of R, we are using "estimated R", which it self incorporates some measurement error and hence we end up having some bias while calculating correlation between R and R1 at the second step ... am I right! I never have Psychometrics, my major was Statistics and Economics, so my acquaintance with psychometric literature is very minimal ... could you please refer one seminal article like urs one (1984, 1983) so that I will be able to understand the basic nuances and the solution of the factor score calculation problem ... I'm relatively comfortable with mathematical rigor Thanks and regards |
|
|
| bmuthen posted on Sunday, May 01, 2005 - 5:05 pm
|
|
|
| Sounds like you got that right. As for factor score literature, search for Skrondal's Psychometrika article in the last 5 years. |
|
|
| Sanjoy posted on Sunday, May 01, 2005 - 6:02 pm
|
|
|
| Thank you professor ... I will look for his articles ... regards |
|
|
|
|
| If I specify all indicator variables as ordinal, does MPlus calculate (and perform all subsequent calculations on) polychoric correlation matrices by default? |
|
|
|
|
| Yes, for an unconditional model using weighted least squares regression. For a conditional model, the sample statistics used for model estimation are the thresholds, probit regression coefficients, and residual polychoric correlations. |
|
|
|
|
I am trying to estimate polychoric asympt cov matrix in text format in mplus 5.21 and wondering wether the following syntax is appropriate. Thx in advance TITLE: This is the Mplus syntax to extract polychoric asympt cov matrix in text format DATA: FILE IS c:\tetrad\file.txt; VARIABLE: NAMES ARE q83 q84 q85 q88 q89 q90 q91; CATEGORICAL q83 q84 q85 q88 q89 q90 q91; ANALYSIS: TYPE = GEN; ESTIMATOR = WLS; MODEL: q83-q90 WITH q91; q83-q89 WITH q90; q83-q88 WITH q89; q83-q85 WITH q88; q83-q84 WITH q85; q83 WITH q84; SAVEDATA: tech3 is Jason22.acm; OUTPUT: SAMPSTAT; |
|
|
|
|
| That should do it. |
|
|
|
|
Hi, I was wondering is there is an adequate procedure to obtain the polychoric correlation between two variables with underlying non-normal discributions, that have in addition been censored in the middle (so that only extremes are used), and dichotomized? Thanks a lot, Sanja |
|
|
|
|
| I am not aware of such a procedure. |
|
|
| Cecily Na posted on Tuesday, December 14, 2010 - 9:41 pm
|
|
|
Dear Linda, I did an SEM with MLSMV. I suppose the correlation in the output before the model estimation is the polychoric matrix of the variables? Why on the diagnol, the correlation is not 1, but very close to 1? I am copying from the output the diagnal of the correlation matrix, all with non-1 values. 0.851 0.993 0.998 0.994 0.747 0.744 0.985 Thank you very much! |
|
|
| Cecily Na posted on Tuesday, December 14, 2010 - 11:47 pm
|
|
|
Dear Linda, A follow-up of my previous post. I think I mistook the covariance coverage of data for correlation matrix. So there shouldn't be any confusion regarding it. I would like to know what the covariance coverage of data in the output is. Thank you very much for your time. |
|
|
|
|
| It tells you the percentage of observations with no missing data for that value. |
|
|
| Cecily Na posted on Wednesday, December 15, 2010 - 8:34 am
|
|
|
Dear Linda, Thanks! When I use WLSMV, the correlation matrix generated in the output before the model estimates should be the polychorical correlations of the observed variables, right? Thanks! |
|
|
|
|
| If you ask for SAMPSTAT and put the ordered polytomous variables on the CATEGORICAL list, the correlation matrix for those variables are polychoric correlations. |
|
|
| Cecily Na posted on Saturday, February 05, 2011 - 11:11 am
|
|
|
Dear Linda, I used WLS to generate polychoric covariance matrix. Why couldn't I get the covariance matrix, but only the correlation matrix? What is the command I can use? Thanks a lot! |
|
|
|
|
| There is no such thing. |
|
|
|
|
HiDr.Muthen I have one simple question on obtaining tetrachoric/biserial correlations. I have tried ˇ°type=basicˇ± command, as you pointed out. In addition, I also tried the "modelˇ± command along with ˇ°samstat.ˇ±The correlation matrixes are somewhat different, and I was wondering why this occurs? Thank you. |
|
|
|
|
| Please send the two outputs and your license number to support@statmodel.com. |
|
|
| Back to top |