There's been a great deal of chat on SEMNET about single indicator factors. I'm running a model and thought about using a single indicator for an ordered categorical smoking variable. Is constraining the factor variance to equal 1 the best approach for this latent variable?
If you are not adjusting the indicator to reflect reliability, there is no difference between having a factor with one indicator or using the indicator itself in the analysis.
This single indicator issue comes about historically because in some programs you must do this because of how the program is designed. It is simply making the factor equivalent to the observed variable.
Then the idea came about to correct the variable for reliability by adjusting the residual variance of a continuous variable. I personally don't think in many cases the reliabiliy is trustworthy enough for these purposes. This is just my opinion
I am running into the same dilemma. I am being advised to adjust my LV to reflect reliability, but I don't know of the exact method (e.g., syntax) to carry that out.
I've brought up the idea of just using the observed variable in the analysis, but my mentor is concerned about unreliable variance ending up in the residual term. I haven't totally absorbed his argument, but can you weigh in and possibly guide me as to which syntax to use to include a single-indicator LV adjusted for reliability?
Slide 44 of the Topic 1 course handout shows how to do this in Mplus.
Leslie vdL posted on Tuesday, March 16, 2010 - 5:32 pm
I have a stupid question, but I am new to SEM: I do not understand why you would want to use a latent variable for a single indicator. Why would you not just use the single indicator? Really only because the program does not allow you to use the indicator?
I ask this because I have to study an article in which the authors looked at possible physical attributes that could predict whether a bird survived yes or no. All their physical attributes eventually (through direct and indirect events) lead to the 'dependent variable' - which they call fitness. Fitness is a single-item factor, with survival as indicator. Survival is measured as: did the bird die (yes=1, no=0). The residual variance was set to 0, because no measurement error was assumed (no live birds considered dead and vice versa).
Why would they not simply use the observed variable survival?
See the left margin of the homepage under Short Course Handouts.
Kate Sawyer posted on Wednesday, March 31, 2010 - 5:25 pm
I have a similar question regarding syntax for a latent variable with a single indicator.
Here is my situation: I have a longitudinal data set that includes measures of both psychiatric symptoms and cognitive functioning. I want to evaluate a model in which psychiatric symptoms predict cognitive functioning at follow-up. Unfortunately, half of the individuals in the data set were given one questionnaire to measure psychiatric symptoms, and half were given a different questionnaire. Both questionnaires have good demonstrated validity and can reasonably be assumed to measure the same construct; however, they are different questionnaires.
I'm interested in creating a latent variable "Psychiatric symptoms" that is simply the score for whichever psychiatric symptom questionnaire the individual completed and corrects for reliability. Thus, it would be a latent variable with two indicators in that "questionnaire 1" and "questionnaire 2" would both load on "Psychiatric symptoms". But would have one indicator in that any individual would contribute only one questionnaire score (either "questionnaire 1" or "questionnaire 2" depending on which they were administered) to the variable.
Is this possible, and if so would you be able to provide example syntax?