The simplest varying intercepts/slopes model there is

torkar · April 14, 2020, 3:23pm

I’m trying to find the absolutely simplest/shortest Stan model (varying slopes and intercepts) to use as an example. This is the example (Ch. 1.13 in the User’s Guide), which I’ve tried to simplify even further:

data {
  int N;                   // num individuals
  int K;                   // num ind predictors
  int J;                   // num groups
  int L;                   // num group predictors
  int jj[N];               // group for individual
  matrix[N, K] x;          // individual predictors
  row_vector[L] u[J];      // group predictors
  vector[N] y;             // outcomes
}
parameters {
  corr_matrix[K] Omega;    // prior correlation
  vector<lower=0>[K] tau;  // prior scale
  matrix[L, K] gamma;      // group coeffs
  vector[K] beta[J];       // indiv coeffs by group
  real<lower=0> sigma;     // prediction error scale
}
model {
  row_vector[K] u_gamma[J];
  for (j in 1:J)
    u_gamma[j] = u[j] * gamma;
  beta ~ multi_normal(u_gamma, quad_form_diag(Omega, tau));
  for (n in 1:N)
    y[n] ~ normal(x[n] * beta[jj[n]], sigma);
}

I’ve removed priors. I’ve removed most of the <lower> and <upper> checks.

Can we simplify it even further (but keep the correlation matrix)? Efficiency, validity, etc. is not the issue here - we want to use as few statements as possible.

Any input is much appreciated!

MauritsM · April 14, 2020, 3:29pm

If your only goal is simplicity in terms of expressing the model I would use the covariance matrix instead of the correlation matrix and scale vector. I can’t really think of another way to simplify this model.

torkar · April 14, 2020, 3:31pm

Obviously - I forgot about that - thanks Maurits!

torkar · April 14, 2020, 3:38pm

@MauritsM, then we need a transformed parameters block? Kind of defeats my aim to introduce that here… :)

MauritsM · April 14, 2020, 4:09pm

No (of I understand correctly). My approach would have a matrix D that replaces your Omega and tau, as well as quad_form_diag. The way those three are used is to create a covariance matrix out of those individual components.

torkar · April 14, 2020, 4:14pm

Could you edit my example above and show?

MauritsM · April 14, 2020, 4:23pm

This is what I had in mind

data {
  int N;                   // num individuals
  int K;                   // num ind predictors
  int J;                   // num groups
  int L;                   // num group predictors
  int jj[N];               // group for individual
  matrix[N, K] x;          // individual predictors
  row_vector[L] u[J];      // group predictors
  vector[N] y;             // outcomes
}
parameters {
  cov_matrix[K] D;    // prior covariance
  matrix[L, K] gamma;      // group coeffs
  vector[K] beta[J];       // indiv coeffs by group
  real<lower=0> sigma;     // prediction error scale
}
model {
  row_vector[K] u_gamma[J];
  for (j in 1:J)
    u_gamma[j] = u[j] * gamma;
  beta ~ multi_normal(u_gamma, D);
  for (n in 1:N)
    y[n] ~ normal(x[n] * beta[jj[n]], sigma);
}

Does this make sense? By the way, is this simplification for teaching purposes?

MauritsM · April 14, 2020, 4:24pm

Oh oops, I missed the part about keeping the correlation matrix! Apologies, I don’t see any improvements in that case…

torkar · April 14, 2020, 4:44pm

Thanks, Maurits, I think your example is clearer. Yeah, you could say it’s for teaching purposes, or at least allowing people quick access to the most common models written in Stan :) I’ll put you in the acknowledgments!

torkar · April 14, 2020, 4:50pm

What code would you use for generating data for this model?

MauritsM · April 14, 2020, 7:28pm

Thanks @torkar, although I don’t know if I deserver credit for such a small adaptation of your model :-)

I don’t really have a great example on hand that uses the full structure and all the parameters. I think that after removing all the priors it may be quite difficult to get this model to fit, even on simulated data.

When I have some more time I’ll try to simulate some fake data for this kind of model. Is R code ok for this?

torkar · April 14, 2020, 7:40pm

The original model is from the user’s guide. Sure R code is great. I’ll see if I too can look at it tomorrow :)

mike-lawrence · April 14, 2020, 9:32pm

Check out the make_data.r file here. I use different terminology, and use a hack to add a between-subjects variable/effect, but it’s the same model.

Topic		Replies	Views
Multilevel Models with Varying Intercepts and Slopes with Covariance Modeling	14	517	June 17, 2024
Help with Varying Intercept, Varying Slope Model Modeling	2	891	April 12, 2020
Mistake in the specificaiton of a simple linear hierarchical model Modeling	1	412	August 8, 2018
Linear mixed effects model with varying intercept and slope in matrix notation Modeling	2	3265	January 17, 2019
Help with structuring a (simple) random slope hierarchical model in Stan Modeling	8	1500	July 11, 2019

The simplest varying intercepts/slopes model there is

Related topics