Hierarchical Categorical Logit with large predictor matrix fails to initialise

bbbales2 · February 4, 2020, 6:46pm

This is the way I’d do random effects in Stan.

vector[N] mu = X * B; // fixed effects
for(I in 1:N){
  mu[l] = mu[l] + theta[rel_id[l]]; // add on random effects
  y[l] ~ N(mu[I],sigma);
}

And non-center the thetas:

for(j in 1:Nj){
  theta[j] = theta_z[j] * sigma_theta;
}

Where theta is defined at the top of the model block and theta_z is a parameter. This mostly works well.

I think you can definitely code the random effects with a matrix like that, but then you’d need to be careful to use sparse matrix vector products to make it efficient. I’m not familiar with standardizing the matrix like that.

I asked @jonah/@Bob_Carpenter about the multinomial vs. Poisson thing. They said @bgoodri at some point thought the Poisson trick would be good, but I don’t know if anyone has actually tried.

So no solid advice there. Best we can say is try both things and see what happens. If you get a chance to compare them both, we’d be curious to hear how it works out.

Topic		Replies	Views
Failure to start because of initial values Modeling	16	3489	July 31, 2017
Initialization failed, initial values rejected Modeling	10	2379	October 17, 2018
RuntimeError: Initialization failed in categorical logit model PyStan	2	486	December 19, 2020
Initialization between (-2, 2) failed after 100 attempts Modeling	4	5998	April 24, 2018
Initialization between (-2, 2) failed after 100 attempts in Bass model fitting Modeling	3	692	March 14, 2020

Hierarchical Categorical Logit with large predictor matrix fails to initialise

Related topics