I have measured variables with missing data and I wish to create aggregate scores using DEFINE. I specified missingness using MISSING ARE ALL (-99). How does FIML work in this case? Does FIML estimate values for missing data in the level of the variables defined in NAMES before computing the aggregate scores in DEFINE? Or means are computed first in DEFINE and missing values are estimated after? Any insight regarding this is welcome. Thanks very much.
VARIABLE: NAMES ARE X1-X5 Y1-Y5 Z1-Z5; USEVARIABLES ARE MEANX MEANY MEANZ; MISSING ARE ALL (-99);
I have 3 consecutive waves. I would like to define people who sustain their good scores during those 3 waves, and those that show a relapse at any of those 3 waves. I have 3 variables for which I would like to define the 'sustainers' and 'relapsers'; two of them are binary variables. The third variable is continuous. Here, a relapse is defined as a score below a certain threshold.
As I have missing data on these waves, and as the DEFINE command is run before FIML is being used, I wanted to use multiple imputation. However, I can't run the DEFINE command in combination with type=imputation either, because I get a different number of 'sustainers' and 'relapsers' per imputed dataset. Therefore, I thought of defining the sustainers and relapsers (1 and 0) in each imputed dataset and then averaging these values across all imputed datasets to get a variable similar to 'chance of being a sustainer'.
thank you for your reply. However, I am not quite sure I understand what you mean. I impute the values for the three variables (2 binary and 1 continous) which have been measured over 3 waves (let we call them a, b, and c resp). Then, outside of Mplus, I compute whether a participant declined or sustained on these three variables (a, b and c), thus creating 3 binary variables (variable d, e, and f). Still outside of mplus, I then compute an average across all 40 imputed datasets. So, if a participant is a decliner on variable d in 30 of the datasets and a sustainer in 10 of them, he would get a score of .75 (d_mean). This d_mean score is used in mplus as a continuous dependent variable. Is this what you suggested in your post? If not, could you explain which continous variable you refer to?