Mplus Discussion >> Hamaker RI-CLPM input

Topics
Last Day
Last 3 Days
Last Week
Tree View

Edit Profile


Hamaker RI-CLPM input

Mplus Discussion > Structural Equation Modeling >

Message/Author

Bengt O. Muthen posted on Friday, March 16, 2018 - 4:05 pm

The topic of cross-lagged panel modeling has come up a couple of times recently. I recommend the article by Hamaker et al (2015) in Psych Methods: A critique of cross-lagged panel modeling. Here is Hamaker's Mplus input for the proposed RI-CLPM in Figure 1:

MODEL: ! Create two individual factors (random intercepts kappa and omega)
mu_x BY x1@1 x2@1 x3@1 x4@1;
mu_y BY y1@1 y2@1 y3@1 y4@1;

! Create within-person centered variables
cx1 BY x1@1; cx2 BY x2@1;
cx3 BY x3@1; cx4 BY x4@1;
cy1 BY y1@1; cy2 BY y2@1;
cy3 BY y3@1; cy4 BY y4@1;

! Constrain the measurement error variances to zero
y1-x4@0;

! Optional: Constrain observed means per variable over time
! [x1 x2 x3 x4] (mx);
! [y1 y2 y3 y4] (my);

! Specify the lagged effects between the within-person centered variables
! Optional: constrain them to be invariant over time
cx2 ON cy1 cx1;
cx3 ON cy2 cx2;
cx4 ON cy3 cx3;
cy2 ON cy1 cx1;
cy3 ON cy2 cx2;
cy4 ON cy3 cx3;

! Within-person centered variables at the first wave correlated
cx1 WITH cy1;

! Allow the residuals (dynamic errors) at subsequent waves to be correlated
cx2 WITH cy2;
cx3 WITH cy3;
cx4 WITH cy4;

! Fix the correlation between the random intercepts and the within-person centered
! variables at the first wave to zero (as by default these would be estimated)
mu_x WITH cx1@0 cy1@0;
mu_y WITH cx1@0 cy1@0;

Note that the cx and cy factors behind each outcome are used to represent the within-level (within-subject) part of the outcomes - the between-subject part is captured by the random intercepts - so that the cross-lagged regressions refer to relationships on the within level.

claudia cs posted on Thursday, July 05, 2018 - 6:31 am

Hi there,

I am running the RI-CLPM for motivation and different emotions. I have two questions/concerns:

a) I am confused why some autoregressive paths are not significant (I also ran the models according to the traditional CLPM, and they were all significant). I am aware that the autoregressive parameters in the traditional CLPM are usually higher, however, I would not know how to interpret these findings.

b) How do I account for measurement invariance using the RI-CLPM? Can I still do latent modelling for the scales using the items (and enforcing equal factor loadings across the three measurement points).

Your help is much appreciated!
Thank you

Bengt O. Muthen posted on Thursday, July 05, 2018 - 5:48 pm

a) This is surprising - are you sure you set it up correctly? Or, perhaps your time points are not close in time so that the intercept factor captures most of the correlation across time.

b) Yes.

claudia cs posted on Friday, July 06, 2018 - 5:39 am

Thank you for your response.

I agree, this is a surprising result - especially given that the traditional CLPM show significant effects. Unfortunately, my syntax is too big to paste in here. May I send it?

Thank you very much.

Bengt O. Muthen posted on Friday, July 06, 2018 - 5:57 pm

Send your output - and data if possible - to Support along with your license number.

lisanne@versteegt.com posted on Thursday, July 19, 2018 - 3:01 am

Hello,

Thank you very much for posting the code to apply RI-CLPM in Mplus. It was very helpful, and I got it working for my panel (longitudinal) data.

I am used to analyzing longitudinal data in multilevel models; therefore, the terminology that I will use below will be quite similar.

For my model, I would like to add a moderator for the cross-lagged paths (e.g., for cx1 on cy2). In multilevel terms, this would be an interaction between a level 2 variable and a level 1 variable (or: an interaction of a between-person variable that has been measured once and the within-person part of a repeated measure).

I checked the multilevel moderation and mediation sources for Mplus. However, as the RI-CLPM does not seem to take the typical multilevel approach, I'm having a hard time to integrate the moderation models with RI-CLPM.

I was wondering whether someone has any thoughts/ideas on this.

Thank you for your consideration.

Bengt O. Muthen posted on Thursday, July 19, 2018 - 7:47 am

I hope to have some thoughts on that from Hamaker herself before too long.

Ellen Hamaker posted on Thursday, July 19, 2018 - 12:43 pm

When the moderator is invariant over time, you have several options:
a) a multiple group approach based on a median split of the moderator, and then compare parameters across the two groups; this does involve overruling some of the Mplus multiple group defaults though;
b) a DSEM (i.e., multilevel time series) approach (this requires your data to be in long format, rather than in wide format); adding a cross-level interaction is really simple then (you just add the moderator as a between level predictor for the random slopes); the disadvantage is that it automatically assumes that all the parameters are invariant over time (e.g., the lagged relationships between waves, or the residual variances), which may be problematic when the intervals between the waves vary, and/or if the time intervals between the waves are relatively large and developmental changes are may have occurred during the study; however, some of these constraints may be overcome by adding dummy variables for the different waves, and interaction terms between these and the lagged predictors;
c) add the actual interaction terms to the RI-CLPM (with data in wide format); this would require the interaction between the moderator and the within-person centered latent variables (e.g., cy1); I believe this could be done using the XWITH statement.

Note that the latter would also be the go-to option if you have a time-varying moderator.

lisanne@versteegt.com posted on Monday, July 23, 2018 - 12:14 am

Thank you very much for your response, it was very helpful.

I worked on option c) and started with just one moderated path, however I got the following message:

THE ESTIMATED COVARIANCE MATRIX COULD NOT BE INVERTED.
COMPUTATION COULD NOT BE COMPLETED IN ITERATION 1.
CHANGE YOUR MODEL AND/OR STARTING VALUES.

I added the following lines to the code in the opening post, to show how I defined my time-invariant moderator and interaction terms.

analysis: TYPE=RANDOM;
ALGORITHM = INTEGRATION !(to use the XWITH statement)

cm1 BY m1@1
m1@0
cx3 ON cy2 cx2 m1;
cy2xcm1 | cy1 XWITH cm1;
cx3 ON cy2xcm1;

---

I'm already quite skeptical that I can reach convergence with this particular model with my current data (917 observations), however I can not pinpoint at the moment whether that is an issue right now or whether I overlooked something with my code.

Thank you for pointing me into the direction of option b), which could also be viable in my case.

Bengt O. Muthen posted on Monday, July 23, 2018 - 10:21 am

Send your output to Support along with your license number so we can take a close look at it.

Otherwise, option a) is down to earth.

Claire Johnston posted on Friday, July 27, 2018 - 12:49 am

Thank you for this code, very helpful. I have a model that does not converge (I have tried increasing the number of iterations already) and I think that the problem is with one of the new within person centered variables. I had previously run a normal CLPM with no problems.

What can I try to get convergence?

Bengt O. Muthen posted on Friday, July 27, 2018 - 12:16 pm

That depends. Send your output to Support along with your license number.

Martijn Van Heel posted on Monday, July 30, 2018 - 7:29 am

Dear

Is it a necessary part of the model to constrain the measurement error variances to zero? (in the example: y1-x4@0;)
Would it bias the model if they were not constrained?
Many thanks in advance.

Bengt O. Muthen posted on Monday, July 30, 2018 - 1:24 pm

Here is Hamaker's answer on this:

You can estimate measurement error variance, although you need som constraint for identification(e.g., all measurement error variances are held equal; or the first and last are equal to the second and second last respectively.

Adding measurement error results in Kenny and Zautra�s trait state error (TSE) model (formerly known as the STARTS model). In general it requires a larger number of repeated measurements (say 8 waves or more), to be empirically identified.

Borja Del Pozo Cruz posted on Sunday, August 26, 2018 - 6:45 pm

Dear Muthen,
I am running an RI-CLPM model with two variables and two waves, and 3 co-variates. I wonder if my syntax is correct? model cant converge. what can i do? thanks!
MODEL:
RI_PORCSB BY PORCSB1@1 PORCSB2@1;
RI_FTS BY FTS1@1 FTS2@1;
cPORCSB1 BY PORCSB1@1; cPORCSB2 BY PORCSB2@1;
cFTS1 BY FTS1@1; cFTS2 BY FTS2@1;
PORCSB1-FTS2@0;
cPORCSB2 ON cPORCSB1 cFTS1 AGE1 SEX1 BMI1;
cFTS2 ON cPORCSB1 cFTS1 AGE1 SEX1 BMI1;
cPORCSB1 WITH cFTS1;
cPORCSB2 WITH cFTS2;
RI_PORCSB WITH cPORCSB1@0 cFTS1@0;
RI_FTS WITH cPORCSB1@0 cFTS1@0;
OUTPUT: TECH1 STDYX SAMPSTAT;

Ellen Hamaker posted on Monday, August 27, 2018 - 10:10 am

A RI-CLPM with only two waves is not identified (as the traditional CLPM is
already saturated then, so adding two latent variables with a covariance is
not possible then). While having some covariates here may ensure that the
number of parameters does not exceed the number of sample statistics (such
that the model is not resulting in a negative number of df), the model is
probably still not identified, because with only two waves, it is not
possible to tell the difference between stability due to autoregression
versus stability due to a trait. In contrast, when you have three waves, you
can tell the difference between these two forms of stability, because they
imply different covariance structures (i.e., the typical simplex structure
versus the one factor structure).

Borja Del Pozo Cruz posted on Tuesday, August 28, 2018 - 2:14 pm

Thank you for your response. My understanding is that i have to go with a traditional clpm.
here is what i have now:
MODEL:

PORCSB2 ON PORCSB1 FTS1 AGE1 BMI1 SEX1;
FTS2 ON PORCSB1 FTS1 AGE1 BMI1 SEX1;
PORCSB1 WITH FTS1;
PORCSB2 WITH FTS2;

However, age and bmi are time variant covariates. how do i account for this in the model?
my alternative would be:
PORCSB2 ON PORCSB1 FTS1 AGE2 BMI2 SEX1;
FTS2 ON PORCSB1 FTS1 AGE2 BMI2 SEX1;
PORCSB1 ON AGE1 SEX1;
FTS1 ON AGE 1 SEX1;
PORCSB1 WITH FTS1;
PORCSB2 WITH FTS2;
is correct? thanks

yuxiong posted on Tuesday, October 02, 2018 - 9:40 am

I am trying to use RI-CLPM in multigroup comparison, however I got identification issues. I read the post in the comment that "a) a multiple group approach based on a median split of the moderator, and then compare parameters across the two groups; this does involve overruling some of the Mplus multiple group defaults though"

can I ask what specifically needs to be overruled? I only know that reference factor mean is set to 0 by default but it does not seem to influence this one.

Bengt O. Muthen posted on Tuesday, October 02, 2018 - 5:25 pm

Here is Hamaker's answer:

Mplus will impose the following default constraints (related to strong factorial invariance):

a) equal factor loadings across groups; this is no problem here, because all factor loadings are constrained to be 1 anyway

b) equal intercepts across groups; this you typically do not want, so you need to free the intercepts in the second group; you can simply do this by specifying them as free parameters; when you have x1 to y3, you simply include for the second group: [x1-y3];

c) free latent means in the second group; this leads to trying to estimate more parameters for the mean structure than that there are observed means (hence the identification problems); you need to constrain all the means of the latent variables in the second group to zero; this includes the means of the random intercept factors, and the means of the within-person centered variables per occasion

yuxiong posted on Tuesday, October 02, 2018 - 6:14 pm

Thanks! It works!

Jeremy Stevenson posted on Thursday, October 04, 2018 - 10:23 pm

Hi there,
I�m running the RI-CLPM for 16 separate models: 8 different predictors, and 2 different outcome variables. When I run the 8 models for the first outcome variable, everything goes smoothly. When I run the models with the second outcome variable, nearly all of the models produce the following error: THE LATENT VARIABLE COVARIANCE MATRIX (PSI) IS NOT POSITIVE DEFINITE. THIS COULD INDICATE A NEGATIVE VARIANCE/RESIDUAL VARIANCE FOR A LATENT VARIABLE, A CORRELATION GREATER OR EQUAL TO ONE BETWEEN TWO LATENT VARIABLES, OR A LINEAR DEPENDENCY AMONG MORE THAN TWO LATENT VARIABLES. CHECK THE TECH4 OUTPUT FOR MORE INFORMATION. PROBLEM INVOLVING VARIABLE MU_Y.
It is generally the same variable (the random intercept of the second outcome variable) that has a negative variance. Any thoughts why this might be the case? Both outcome variables are measures of social anxiety, so I�m not sure why this is occuring - they are very similar.
One idea I�ve had is to fix the variance of the random intercept of the second outcome variable to be similar to that of the first outcome variable. I know it�s recommended to fix it to zero, but doesn�t that defeat the whole purpose of the RI-CLPM?

Ellen Hamaker posted on Friday, October 05, 2018 - 12:37 am

A negative variance estimate typically means a model is too complex for the data you have. In this case, I would conclude that the second outcome variable is not characterized by stable between-person differences; rather, everyone varies around the same mean (or trend) on this variable. Hence, you can do one of two things:
1) You can set the variance of this random intercept to zero, and also set the covariance between this random intercept and the other to zero, while still estimating the random intercept of the predictor freely; this will still result in a warning about the covariance matrix, but then you can just ignore it.
2) You can adjust the model by taking the random intercept for this outcome variable out of the model, and model the lagged relationship between the within-person part of the predictor variable (note that for that variable you keep the random intercept in the model), and the original (i.e., non-decomposed) outcome variable.
These two options are statistically identical (same fit etc.), and should lead to the same lagged parameter estimates.

EH posted on Sunday, October 07, 2018 - 1:00 am

Dear dr Muthen,
for RI-CLPM with latent variables, can i (1) use factor scores as person centered variables, or should i make a new by-statement of the factors?(2)does my input seem right?

1) WRLFT1, T2,T3 by the items (factor loadings = across time) same for WRLIT1,2,3; PEET1,2,3 and for PEIT1,2,3
2) Intercepts of items = across time
3) RI_WRLF by WRLFT1@1,T2@1,T3@1;(same for WRLI, PEE and PEI)
4) [RI_WRLF]; [RI_WRLI]; [RI_PEE]; ...
5) Intercept of all factors = 0: [WRLFT1@0 WRLFT2@0 WRLFT3@0]...;
6) All measurement error variances =0: WRLFT1@0 WRLFT2@0 WRLFT3@0 PEET1@0�
7) Aur effects: WRLFT3 ON WRLFT2 (WRLF); WRLFT2 ON WRLFT1 (WRLF); WRLIT3 ON WRLIT2 (WRLI)...;
8) All possible cross-lagged effects: even if not hypothesized
9) Corr within person variables: PEET1 WITH PEIT1 WRLFT1 WRLIT1 WRLNT1; PEIT1 WITH WRLFT1 WRLIT1 WRLNT1;...;
10) Corr residuals at subsequent waves
11) Corr between the RI's and the other exogenous var =0: e.g., RI_WRLF WITH PEET1@0 PEET2@0...;

Ellen Hamaker posted on Sunday, October 07, 2018 - 7:04 am

You can find a pdf that explains how to specify a multiple indicator RI-CLPM (including Mplus input files and simulated data) here: https://www.statmodel.com/RI-CLPM.shtml

Philipp Alt posted on Wednesday, February 13, 2019 - 6:42 am

Hello,

I am tinking about setting up an RI-CLPM to study the development of two processes.
However, I am not quite sure if this approach is quite fitting, because of my data-structure:

I have data from 9 waves with an age-range within each wave from 8 to 15 years. Because I want to study the processes as a function of age rather than wave, I restructured all the data and basically pooled the different age groups from all waves and then remerged them into one wide dataset, giving me a large dataset with age in the columns rather than wave.

But this leaves me with a couple of question before setting up a RI-CLPM:

1) Is there a possibilty to control for the cohort, that the person is from with the RI-CLPM? I would think that this mandatory.

2) Is there a way to control for the fact that people had different participation rates? Some people were measured in all 9 waves, some just once. This would also have to be addressed, I think. Is this possible wih the RI-CLPM approach?

3) I also have siblings in the data set. Rather than excluding them, I was wondering if I could keep them in the data set and control for the non-indepence via TYPE=COMPLEX,for example, in the RI-CLPM approach?

Kind regards and thank you in advance,

Philipp

Bengt O. Muthen posted on Wednesday, February 13, 2019 - 11:47 am

1) Right, age and not wave should be the time axis. You can handle this in 2 different ways, either using a dummy variable influencing the person-specific intercept or using a multiple-group approach where group correspond to cohort. The UG ex 6.18 shows how to do the latter approach which is very flexible so that you can consider which RI-CLPM parameters are cohort invariant. The ex6.18 approach is also discussed in our Short Course Topic 4 on our website; see slides 48 and on.

2) Yes, this would be handled by standard ML under the usual MAR assumption, also called FIML. Just give missing data flags for the missing values.

3) Right, Type=Complex can be used to adjust the SEs. This assumes of course that subjects with siblings have the same parameter values as subject without siblings (Complex does not allow for different parameter values).

Niyantri Ravindran posted on Friday, April 05, 2019 - 7:39 pm

Hello,

I am testing an RI-CLPM model but the random intercept for one of my variables does not have significant variance across individuals. The covariance between my two random intercepts is also not significant. If I estimate variances of both random intercepts and the covariance, I get an error message that says that the latent variable covariance matrix is non-positive definite. If I constrain the variance (and covariance) to 0 for the one variable, the error message disappears. I am still estimating variance in the random intercept for my other variable (which is significant). I understand that if I constrain both random intercepts my model is identical to the CLPM. But in this case, I am only constraining one. Is it okay to estimate the RI-CLPM in this way? It seems to make no substantive difference to the results.

In general, if there is no trait-like aspect in one variable but there is in the other, is it okay to still use RI-CLPM and just constrain the variance of the intercept for that variable?

Thanks in advance.

Ellen Hamaker posted on Saturday, April 06, 2019 - 12:18 pm

When the variance of a random intercept is not significant, this implies there is not really evidence that there are individual differences in this term. Hence, fixing the variance to zero is a reasonable next step. Alternatively, you can decide to remove the entire random intercept from your model: This is actually the same thing, but it will also make the error message disappear. Either way, it means the new model does not include stable, time-invariant differences between individuals in that particular variable, while there may still be time-invariant, trait-like individual differences on the other variable, which are adequately captured by the remaining random intercept.

Yeonjeong Kim posted on Sunday, June 02, 2019 - 11:33 am

Hi

When we run a RI-CLPM using mplus with an option of STDYX, are the resulting standardized coefficient from within-person standardization (or values from BP or GP)?

Thank you in advance,
Yeonjeong

Ellen Hamaker posted on Sunday, June 02, 2019 - 11:54 am

STDYX will standardize each regression coefficient using the variances (or standard deviations) of the predictor and the outcome variable that are associated with this regression coefficient.

Since in the RI-CLPM, the lagged coefficients are included between the within-person components, the standardization also occurs using only within-person variance. Hence, this implies it is within-person standardization.

dummyvariable123 posted on Thursday, July 04, 2019 - 7:25 am

Hello,

I wonder whether if it's possible to run RI-CLPM with 3 levels (L1 within-person, L2 between-person, and L3 between-classroom)?

If yes, how does the syntax looks like for a 3-level model with a random slope at L2?

Thank you in advance.

Ellen Hamaker posted on Thursday, July 04, 2019 - 12:23 pm

I haven not done or seen this done before, but it should be possible in Mplus with multilevel SEM (use TYPE = TWOLEVEL, possibly also RANDOM if you want random regression parameters).

Note that the regular RI-CLPM is in wide-format (meaning: it is not specified as a multilevel model but as a regular SEM model). In your case you would thus have time points (level 1) as variables and persons (level 2) as cases (i.e., rows in your datafile), just as in the regular RI-CLPM.

Then you can include classroom (level 3) as the between level clusters, and decide whether you want random lagged parameters or not. You can check UG example 9.5 for an illustration of a 2-level path model with random regression coefficients.

dummyvariable123 posted on Monday, July 08, 2019 - 2:28 am

Thank you for your response. Would RI-CLPM syntax look like this?:

USEVARIABLES IS x1 x2 x3 z1 z2 z3;
CLUSTER=id classroom;

ANALYSIS:
TYPE=threelevel random;

MODEL:
%within%
Ix1 BY x1@1;
Ix2 BY x2@1;
Ix3 BY x3@1;
Iz1 BY z1@1;
Iz2 BY z2@1;
Iz3 BY z3@1;

x1@0;
x2@0;
x3@0;
z1@0;
z2@0;
z3@0;

Ix3 ON Ix2;
Ix2 ON Ix1;
Iz3 ON Iz2;
Iz2 ON Iz1;

Ix3 ON Iz2;
Ix2 ON Iz1;
Iz3 ON Ix2;
Iz2 ON Ix1;

Ix3 WITH Iz3;
Ix2 WITH Iz2;
Ix1 WITH Iz1;

dummyvariable123 posted on Monday, July 08, 2019 - 2:32 am

continued:

%between id%
RIx BY x1@1 x2@1 x3@1;
RIz BY z1@1 z2@1 z3@1;

SLx BY x1@0 x2@1 x3@2;
SLz BY z1@0 z2@1 z3@2;

RIx;
RIz;
SLx;
SLz;

RIx WITH Ix1@0 Iz1@0;
RIz WITH Ix1@0 Iz1@0;

%between classroom%
RIx BY x1@1 x2@1 x3@1;
RIz BY z1@1 z2@1 z3@1;
RIx;
RIz;
RIx WITH Ix1@0 Iz1@0;
RIz WITH Ix1@0 Iz1@0;

Bengt O. Muthen posted on Sunday, July 14, 2019 - 11:44 am

We need to see your full output - please send your output to Mplus Support along with your license number.

We ask that postings be limited to one window.

ywang posted on Tuesday, July 23, 2019 - 1:25 pm

Dear Dr. Muthen,

We used a CLPM model to assess two variables across three waves. The two interval is about 10.5 and 13.5 months, separately. Is there a way to account for the differences in the length of the time? Any Mplus syntax example for continuous-time CLPM?
Thanks!

Ellen Hamaker posted on Wednesday, July 24, 2019 - 10:24 am

When intervals in a CLPM design are of different length, the parameters should not be constrained to be identical over time. Such constraints are only sensible when the intervals are identical, and you assume the underlying process remains the same over time.
To determine whether the underlying dynamics remain the same even though the intervals are different, a continuous time perspective is needed. This is explained in more detail here: https://ryanoisin.github.io/files/RyanKuiperHamaker_preprint.pdf
To summarize the main issue: The constraints that are needed are on the matrix with lagged parameters, rather than on separate lagged parameters, making it difficult to impose them in Mplus. Alternatively, one could first estimate the model in the conventional way (without the constraints), and then convert the parameters obtained for the two intervals to refer to an interval of the same length (e.g. 12 months, see also https://ryanoisin.github.io/files/KuiperRyan_2018_DrawingConclusions_SEM.pdf). However, there is at this point no test to determine whether these converted parameters are significantly different from one another.
Alternatively, you could use software that was specifically designed for contrinuous time modeling, such as ctsem in R.

Anna MacKinnon posted on Wednesday, August 07, 2019 - 2:01 pm

Hello,

I have tried the code above to model RI-CLPM with 2 variables (anxiety and insomnia) across 4 time points (equal 10 week intervals), using Estimator = ML, and did not constrain observed means per variable over time - but receive the following message:
NO COVERGENCE. NUMBER OF ITERATIONS EXCEEDED.
This persists even when I increase iterations to 20000.
Is there anything else I can adjust in the code to get it to converge?

Thank you

Bengt O. Muthen posted on Wednesday, August 07, 2019 - 5:29 pm

We need to see your full output - and data if possible - send to Support along with your license number.

shonnslc posted on Wednesday, October 02, 2019 - 9:27 am

Hi,

I am wondering if it is necessary to specify the dynamic errors in RI-CLPM:

cx2 WITH cy2;
cx3 WITH cy3;
cx4 WITH cy4;

What happens if this part is not specified in the model? I am doing power analysis for RI-CLPM and I encountered replication error when I specified dynamic errors but when I removed this part, there was no error message for each replication. Thanks.

Bengt O. Muthen posted on Wednesday, October 02, 2019 - 5:09 pm

Those error covariances are standard and should not cause the problem you are seeing. But we need to see your full output to see what's going on - send to Support along with your license number.

shonnslc posted on Monday, November 04, 2019 - 9:56 am

Hi,

I am wondering if this is a correct way to add covariates to RI-CLPM (https://www.statmodel.com/download/RI-CLPM%20Hamaker%20input.pdf):

RI_x RI_y on sex income; #A1
cx1 cy1 on sex income; #A2

Because if I only added covariates to either A1 or A2, my model did not converge. Thanks!

Bengt O. Muthen posted on Monday, November 04, 2019 - 2:11 pm

We need to see your full output to say - send to Support along with your license number.

Philipp Alt posted on Tuesday, January 21, 2020 - 4:59 am

I have more of a conceptual question about the RI-CLPM:

My understanding of the RI-CLPM is that you control for stable between person differences in the between part of the model. Therefore it does not make sense to control stable covariates (variables that do not change) in the within part of the model anymore, as they are already controlled for in the between-part. Is this assumption right?

Ellen Hamaker posted on Tuesday, January 21, 2020 - 5:19 am

Hi Philipp, your reasoning is mostly correct. However, you could also consider regressing the observed variables on a time-invariant covariate directly (rather than through the random intercepts); this would allow for the effect of this covariate to change over time. If you�d constrain the regression parameters in this model to be invariant over time, the model ibecomed identical to the model in which the random intercept is regressed on the time-invariant covariate. Hence, you can do a chi-square test to compare these two options (time-varying effect vs constant effect).

Adrienne D. Woods posted on Thursday, January 23, 2020 - 1:59 pm

Dear Drs. Muthen,

I�m analyzing data for ~14,000 weighted cases across three timepoints using Hamaker�s RI-CLPM. I�m hoping to break this sample into subgroups to ascertain whether the cross-lagged paths operate in the same way across groups. My code for the full sample works perfectly fine, and if I create separate datasets by subgroup the code again works just fine (e.g., dataset for White, dataset for Black, etc.). However, when I use the GROUPING command, I get warnings that the model is not identified. Is it possible to use the GROUPING option in order to use DIFFTEST without splitting the sample into separate datasets?

Thank you!

Here's the error message:

THE MODEL ESTIMATION TERMINATED NORMALLY
THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES COULD NOT BE COMPUTED. THE MODEL MAY NOT BE IDENTIFIED. CHECK YOUR MODEL.
PROBLEM INVOLVING THE FOLLOWING PARAMETER:
Parameter 24, Group DISAB: [ CACH1 ]
THE CONDITION NUMBER IS -0.688D-12.
THE ROBUST CHI-SQUARE COULD NOT BE COMPUTED.

Ellen Hamaker posted on Friday, January 24, 2020 - 6:56 am

I believe you are trying to run a multiple group version of the RI-CLPM. This is a little tricky, as it requires you to overrule the multiple-group-factor-analysis-defaults that Mplus imposes. Specifically, Mplus will constrain the intercepts of observed variables to be identical across the groups, and free the latent means (i.e., for all the variables defined by a BY statement) in the second group; this leads to a model that is unidentified in this case.
You can find the correct code for this (and other extensions of the RI-CLPM) here:
jeroendmulder.github.io/RI-CLPM/mplus

Yeonjeong Kim posted on Thursday, February 20, 2020 - 3:27 pm

Hi,

I wonder whether there is a way to conduct statistical testing that compares the strengths of cross-lagged parameters (rather than just descriptively compare the standardized coefficients) when two sets of variables are not on the same metric?

I was thinking about running a model after standardizing all variables, and directly test whether the different score is greate than zero. But it is difficult to do WP standardization on variables because of the large amount of missing values.

If I use the BP standardization, I think I can interpret the results using the relative order changes, but not sure this would be a good approach.

Thanks in advance!

Bengt O. Muthen posted on Thursday, February 20, 2020 - 4:26 pm

You can express the standardized coefficients as new parameters in Model Constraint and then a difference parameter - that gives you a test. But it can be cumbersome to express those formulas.

Adrienne D. Woods posted on Friday, February 21, 2020 - 10:23 am

Hello Dr. Hamaker,

Thank you for your reply above. I am now able to run the multiple group analysis using the syntax on your website. I have two further questions:

1) why do the estimates differ when using the GROUPING command vs. when estimating separately by subsample (i.e., after creating separate datasets by subgroup)?

2) what is the syntax for testing whether, overall, the model fits differently across the subgroups?

Thank you again!

Bengt O. Muthen posted on Friday, February 21, 2020 - 3:51 pm

1) Grouping runs impose certain invariance constraints that your separate-group analyses won't have - check your output, e.g. Tech1 to see if you have any equalities between parameters.

2) You can only compare the fit indices - you can't test whether a model fits better in one group or another.

Craig Sewall posted on Tuesday, February 25, 2020 - 12:12 pm

Hello,
I am planning a 3-wave study and will be analyzing the data with a RI-CLPM. I would like to run a power analysis before collecting data, however, I am unsure how to run the RI-CLPM using Montecarlo simulation. Any suggestions or resources on how to do this are greatly appreciated!

Ellen Hamaker posted on Wednesday, February 26, 2020 - 3:00 am

You can find Mplus code for running the RI-CLPM (and several frequently asked for extensions) here: https://jeroendmulder.github.io/RI-CLPM/mplus You can quite easily adjust this code for a Monte Carlo study (we don't have that code up there--yet).
In some preliminary and unpublished simulations that we ran, we found that three waves of data can lead to considerable bias in the estimates of the lagged effects, although this also depends on the actual parameter values. My advice would be to opt for four waves of data, as this was much better in terms of reducing the bias (and increasing power).

Craig Sewall posted on Wednesday, February 26, 2020 - 8:43 am

Thank you, Dr. Hamaker, for your quick and helpful response. I have attempted to run a Monte Carlo simulation using the code provided in the link you sent. However, I get an error message saying that the population covariance matrix is not positive definite.
Here (and in the next post) is my code, can you identify where I am making an error?

MONTECARLO:
NAMES = x1-x5
y1-y5;
NOBSERVATIONS = 300;
NREPS = 100;

MODEL POPULATION:
! Create two individual factors (random intercepts)
RIx BY x1@1 x2@1 x3@1 x4@1 x5@1;
RIy BY y1@1 y2@1 y3@1 y4@1 y5@1;
! Estimate means of random intercepts
[RIx*0.5];
[RIy*0.4];
! Estimate covariance between the RIs
RIx WITH RIy*.3;
! Estimate variances for RIs
RIx*.1 RIy*.1

Craig Sewall posted on Wednesday, February 26, 2020 - 8:44 am

! Create within-person centered variables
wx1 BY x1@1;
wx2 BY x2@1;
wx3 BY x3@1;
wx4 BY x4@1;
wx5 BY x5@1;

wy1 BY y1@1;
wy2 BY y2@1;
wy3 BY y3@1;
wy4 BY y4@1;
wy5 BY y5@1;

x1-y5@0;

! Estimate the lagged effects between
! the within-person centered variables
wy5 ON wy4*.8 wx4*.3;
wx5 ON wx4*.8 wy4*.3;
wy4 ON wy3*.8 wx3*.3;
wx4 ON wx3*.8 wy3*.3;
wy3 ON wy2*.8 wx2*.3;
wx3 ON wx2*.8 wy2*.3;
wy2 ON wy1*.8 wx1*.3;
wx2 ON wx1*.8 wy1*.3;

wx1 WITH wy1*.5;

wx2 WITH wy2*.2;
wx3 WITH wy3*.2;
wx4 WITH wy4*.2;
wx5 WITH wy5*.2;

My MODEL section looks identical to the above. Thank you for your help!

Ellen Hamaker posted on Wednesday, February 26, 2020 - 10:10 am

I think there are two problems with your code. First, you cannot set the covariance between the random intercepts to 0.3 when you set the variances of the random intercepts to 0.1; that combination would imply a correlation larger than 1. I suggest setting the variances to 1.
Second, you have to specify the variances of the within-person components at wave 1; and the residual variances of the within-person components of all waves from wave 2 and onward. I suggest setting the variances at wave 1 to 1, and at later waves smaller (as part of the variances will be explained through the lagged relations).

Craig Sewall posted on Wednesday, February 26, 2020 - 2:02 pm

This worked! Thanks again for all your help, Dr. Hamaker. I look forward to reading your future work with these kinds of models.

KL posted on Saturday, February 29, 2020 - 4:57 pm

I am new to RI-CLPM and just have a few questions about the model specifications. In particular, I was wondering what 1) constraining observed means per variable over time and 2) fixing the correlation between the random intercepts and the within-person centered variables at the first wave to zero means for the assumptions made by the model? Further, I was wondering how constraining the means and fixing the correlation to zero affect the interpretation of the results?

Ellen Hamaker posted on Tuesday, March 03, 2020 - 6:21 am

Time-invariant group means are attractive because in these models, the random intercepts can be interpreted as individual-specific stable deviations from a constant, rather than as individual-specific stable deviations from a time-varying group mean. When you visualize this, it implies the group mean is a horizontal line over time, and an individual's random intercept is characterized by a horizontal line above or below this line. As stated in Hamaker, Kuiper and Grasman (2015, p. 210): "Models in which the group means do not change over time facilitate interpretation, although time-invariant means are no prerequisite for the models considered here."

Ellen Hamaker posted on Tuesday, March 03, 2020 - 7:03 am

When the measurements start at an arbitrary point in time during an ongoing process, there is no reason to assume that the temporary deviations from the persons' trait scores (i.e., the within-person components) at the first wave are related to these trait scores (i.e., the between-person components, or random intercepts). We need to fix the covariances to zero, as by default Mplus will allow the random intercepts and the within-person components at the first wave to be correlated (since these are all exogenous latent variables).
Note there are other models that are closely related, in which it is actually critical to include the covariances between the stable person-components and the temporary within-person deviation at the first wave (e.g., the Autoregressive Latent Trajectory model by Bollen and Curran, or the Cross-lagged Panel Models with Fixed Effects by Allison, Williams and Moral-Benito). This critically depends on whether the stable person-parts are separated from the within-person dynamics (as is the case in the RI-CLPM), or that the stable between-person parts have indirect effects through the lagged relations in the model. You can read more about this in Usami, Kou and Hamaker (2019). A unified framework of longitudinal models to examine reciprocal relations. Psychological Methods, 24(5), 637-657. http://dx.doi.org/10.1037/met0000210

KL posted on Tuesday, March 03, 2020 - 6:01 pm

Hi Dr. Hamaker,

Thanks so much for the very helpful responses! I ran several RI-CLPM analyses and have found that the chi-square test of model fit is not significant. In contrast, the chi-square test was significant when I just ran the CLPM analyses. I'm wondering if this means that the traditional CLPM is a better fit for the data (i.e., trait-like differences are not critical)?

Ellen Hamaker posted on Wednesday, March 04, 2020 - 2:03 am

As with any SEM analysis, a significant chi-test implies that the model fits significantly worse than the saturated model, whereas a non-significant test implies it does not fit significantly worse. In your case, it implies that the traditional CLPM does not provide an adequate description of the data (the model is rejected), whereas the RI-CLPM seems to describe the data well (that model is not rejected). You can also do a chi-square difference test, as these models are nested (see also the empirical illustration in Hamaker et al., 2015).

KL posted on Wednesday, March 04, 2020 - 8:30 pm

Thank you so much for all your help, Dr. Hamaker!

KL posted on Monday, March 09, 2020 - 1:35 pm

I have been referencing the RI-CLPM and Extensions resource posted here. I just have a quick question about adding time-invariant covariates. I was wondering if the following code would work to regress my observed outcomes on two separate covariates in the same model.

x1-x3 ON z1 (s1);
s1-s3 ON z1 (s2);

x1-x3 ON z2 (s3);
s1-s3 ON z2 (s4);

Aurelie Lange posted on Wednesday, March 11, 2020 - 2:16 pm

Hello,

I ran the Hamaker model today, but came across several problems:

1. THE STANDARD ERRORS FOR H1 ESTIMATED SAMPLE STATISTICS COULD NOT BE COMPUTED. THIS MAY BE DUE TO LOW COVARIANCE COVERAGE.
THE ROBUST CHI-SQUARE COULD NOT BE COMPUTED.
2. THE STANDARD ERRORS OF THE MODEL PARAMETER ESTIMATES MAY NOT BE TRUSTWORTHY FOR SOME PARAMETERS DUE TO A NON-POSITIVE DEFINITE FIRST-ORDER DERIVATIVE PRODUCT MATRIX.....THIS IS MOST LIKELY DUE TO HAVING MORE PARAMETERS THAN THE SAMPLE SIZE.

I expect these warnings to be a result of my small sample size (I have 15 time points and only 47 participants) as they disappear if I include only part of the time points (thereby having less parameters).

I have several questions following this output:

I. Is it correct to assume that problem 1 is also the result of having more parameters than the sample size ?

II. If so, is it ok� to ignore both warnings?
If not, would it be a good solution to include only every second time point, so as to have the whole range, but less parameters?

III. As the chi-square is not computed, I use AIC and BIC to compare model fit between different models (I try to make the model more parsimonious by constraining certain paths to be equal). Is this an appropriate solution?

Thanks for your help!

Kind regards,
Aurelie Lange

Bengt O. Muthen posted on Wednesday, March 11, 2020 - 4:17 pm

We need to see your full output - send to Support along with your license number.

Benjamin Walsh posted on Monday, March 30, 2020 - 10:30 am

In RI-CLPM, is indirect effect (ab): cm2 on cx1 (a path) and cy3 on cm2 (b path)?

MODEL:
RI_x BY x1@1 x2@1 x3@1;
RI_m BY m1@1 m2@1 m3@1;
RI_y BY y1@1 y2@1 y3@1;

cx1 BY x1@1; cx2 BY x2@1; cx3 BY x3@1;
cm1 BY m1@1; cm2 BY m2@1; cm3 BY m3@1;
cy1 BY y1@1; cy2 BY y2@1; cy3 BY y3@1;

x1-y3@0;

cx2 ON cx1 cm1 cy1; cx3 ON cx2 cm2 cy2;
cm2 ON cx1 cm1 cy1; cm3 ON cx2 cm2 cy2;
cy2 ON cx1 cm1 cy1; cy3 ON cx2 cm2 cy2;

cx1 WITH cm1 cy1;
cm1 WITH cy1;

cx2 WITH cm2 cy2; cx3 WITH cm3 cy3;
cm2 WITH cy2; cm3 WITH cy3;

RI_x WITH cx1@0 cm1@0 cy1@0;
RI_m WITH cx1@0 cm1@0 cy1@0;
RI_y WITH cx1@0 cm1@0 cy1@0;

Bengt O. Muthen posted on Monday, March 30, 2020 - 4:50 pm

Longitudinal mediation is a complex topic. You want to read Maxwell-Cole (2007) in Psych Methods and same authors in a 2011 MBR special issue. More has been written more recently but I don't have those refs handy right now.

Adrienne D. Woods posted on Monday, October 05, 2020 - 9:29 am

Dear Drs. Muthen & Hamaker,

I am using RI-CLPM to measure the cross-lagged relations between sleep and academic achievement in ~8,000 children across 3 timepoints (3rd, 4th, 5th grade). The autoregressive paths from T1-T2 and from T2-T3 for academic achievement are significant, which makes sense. However, only the T1-T2 path is significant for sleep, while the T2-T3 coefficient is negligible in size and reverses direction. This does not occur when I run a traditional CLPM (though only the cross-lagged paths from T2-T3 are significant). If this phenomenon only appears in the RI-CLPM and not the CLPM, does this mean that sleep duration at T2 does not affect within-person fluctuations in sleep duration at T3? If this is the case, is the RI-CLPM still appropriate, or should I switch to the CLPM?

Alternatively, might I be encountering a suppressor effect (e.g., Burkholder & Harlow, 2003)? I have ruled out sampling issues, missing data, multicollinearity, and outliers. The autoregressive paths are similar in the RI-CLPM even when I remove the cross-lagged paths from the model. If I follow Burkholder's example, I would design two separate models in which a) either Time 1 sleep constructs predict achievement constructs or b) achievement constructs predict Time 3 sleep duration, before performing a final model-testing procedure to ascertain the �true� direction of causation. Does this make sense to do with the RI-CLPM?

Ellen Hamaker posted on Tuesday, October 06, 2020 - 7:18 am

It is not uncommon to have significant autoregressive paths in a CLPM that become insignificant in the RI-CLPM. The reason for this is that in the CLPM the autoregressive parameters captures the stability of the rank order of individuals, while in the RI-CLPM this stability is attributed to two different sources: Stable between person differences that are captured by the random intercept, and carry-over from one occasion to the next within an individual as captured by the autoregressive parameter. The fact that the latter is zero would not be a reason to abandon the RI-CLPM; it simply implies that the stability from occasion 2 to occasion 3 is captured fully by the random intercept.
A reason for going back to the CLPM would be if the variances of all random intercepts in the model are negligible; that would imply there are no stable between person differences in the data, and in that case there is no reason to use a model that separates between person variance from within person dynamics.
What is important to keep in mind when you consider lagged relations is the time interval between the measurements: The fact that you get autoregressive parameters that are (close to) zero may also simply imply that the intervals between your measurements are rather long for the process you are observing, such that there is no within-person carry-over any more.

Luisa Liekefett posted on Tuesday, October 27, 2020 - 6:31 am

Hello everyone,

I am running a RI-CLPM for a longitudinal data set with 4 time points. Some participants did not complete all time points: there are some who only did t1, some who did t1 and t2, some who did t1, t3 and t4 and so on. Would you recommend to include all participants, and just flag the missing values as missing? Or would you recommend to include only those who did all 4 (or at least 3)?

I noticed that the results are quite different when I include all participants, as when I have only those who completed all 4 measurements.

Thank you in advance.

Bengt O. Muthen posted on Tuesday, October 27, 2020 - 10:42 am

Q1: Yes

Q2: No

The default ML estimation in Mplus uses ML under MAR, also referred to as FIML. This uses all available data. The MAR assumption says that data can be selectively missing if the missingness is predicted by the variables that are observed - primarily the earlier time points in your case. It's the standard assumption in statistics. Even if it is not completely true, evidence suggests that it is much better than the alternative you mention. A discussion of this is given at the end of our Short Course Topic 11 on our web site and also in Chapter 10 of our book Regression and Mediation Analysis Using Mplus.