Reduce_sum() no time saving for multilevel model

George_Seah · December 24, 2020, 12:13am

Please share your Stan program and accompanying data if possible.

I have a multilevel regression model (900+ group and 50+ observations per group)
I am trying to use reduce_sum() to speed up the within-chain sampling. However, based on the testing so far, it does not have any performance gain (have tried to slice over group, predictor (X1, X2) , Y ). None of them yield any better performance than single-core run.

Any idea what is the best way to use reduce_sum on multilevel model?

below is the code slice over group. Have tried the same on X1, X2, Y, but does not see any performance gain.

// do reduce_sum()
functions {
    real partial_sum(int[] group_slice, int start, int end, 
	vector Y, vector X4, vector X1,
	vector X2, vector X3,
	vector alpha, vector theta, vector gamma,vector delta, vector beta,
	vector sigma
	) {
    return normal_lpdf(Y[start:end] | alpha[group_slice] + theta[group_slice].*X4[start:end]  + gamma[group_slice].*X1[start:end]  + delta[group_slice].*X2[start:end]  + beta[group_slice].*X3[start:end] , sigma[group_slice]);
  }
}


data {
  int<lower=0> N;      // No. of obs
  int<lower=0> J;      // No. of Groups
  vector[N] Y;   // outcome
  vector[N] X1; // predictor 1
  vector[N] X2;  // predictor 2   
  vector[N] X3;    // predictor 3  
  int Group[N];        // grouping
  vector[N] X4;      // predictor 4
}

transformed data {
  real meanY = mean(Y);
  int grainsize = 3000;  //for reduce_sum , I have about 13000+ data point , 255 group, 50+ observation per group
  // int seq[N] = rep_array(1,N);
}

parameters {
  vector[J] alpha;           // Intercept/[[]]
  vector[J] gamma;           // Intercept
  vector<lower=0>[J] delta_raw;  // Slope
  vector<lower=0>[J] beta_raw;   // Slope
  vector<lower=0>[J] sigma;
  real<lower=0,upper=2> tau;         // Standard deviation of varying intercept by group
  real<lower=0,upper=2> phi;
  
  vector<lower=0>[J] theta_raw;   
}

transformed parameters {
  vector<upper=0>[J] beta = - beta_raw;
  vector<upper=0>[J] delta = - delta_raw;
  
  vector<upper=0>[J] theta = - theta_raw; 

}

model {
  beta_raw ~ exponential(tau);    
  tau ~ normal(log(2),0.03);
  alpha ~ normal(meanY,1);
  gamma ~ std_normal();           
  delta_raw ~ exponential(phi);
  phi ~ normal(0,10);
  sigma ~ exponential(2);        
  
  theta_raw ~ normal(0,10); 

  target += reduce_sum(partial_sum, Group, grainsize,Y, X4, X1,X2,X3,alpha,theta,gamma,delta,beta,sigma );
}

wds15 · December 24, 2020, 8:59am

You should pack all the group specific parameters into an array and then slice over the array with reduce sum. That is far more efficient.

Other than that the normal lpdf is hard to speed up, since it is cheap to calculate. You should also use the normal identity link glam function for further speed up .

Topic		Replies	Views
Reduce_sum for multi-level model Modeling cmdstanr	7	518	July 4, 2021
Multilevel hurdle model -- no performance increase with reduce_sum() Modeling performance	5	543	September 12, 2021
If I can spped up base my reduce sum stan code Modeling rstan	1	240	July 17, 2023
Improving Runtime & Reduce Sum for Hierarchical model with large number of groups Modeling	8	546	October 30, 2023
How to most efficiently reduce_sum in a hierarchical logistic model Modeling performance	13	1777	May 24, 2020

Reduce_sum() no time saving for multilevel model

Related topics