Different results with poisson_log() and poisson_log_glm

FJCC · December 10, 2019, 3:29pm

I am using rstan (downloaded this week) to fit a toy data set with 200 rows, three predictors and a response generated from R’s rpois function. I gave the predictors coefficients of 0.3, -0.15, and 0.2 and there is an intercept of 1.4 I fit the data with the model below using either poisson_log or poisson_log_glm. The poisson_log version reports a Total time of 0.5s, Rhats of 1.000 and neff of 2000 to 4000. The version using poisson_log_glm reports a Total time of 11s, Rhats > 140, neff of 2 and a warning that “There were 3997 transitions after warmup that exceeded the maximum treedepth.” The treedepth limit was set to 10. Increasing the max_treedepth to 15 results in a similar warning after a longer run time. The treedepth in the poisson_log version does not exceed 5.

I strongly suspect I am doing something very silly but I cannot see it. Any ideas why the performance is so different?

In case it matters, my R version and system are:
R version 3.5.2 (2018-12-20)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: Linux Mint 18.3

data {
    int N; //number obs
    int K; //number of columns in the model matrix
    int y[N]; //response
    matrix[N,K] X; 
  }

parameters {
  real alpha;
  vector[K] beta; //the regression parameters
}
model {  
  alpha ~ normal(0,1.5);
  
  beta ~ normal(0,1);
  
  //y ~ poisson_log(alpha + X * beta); //version 1
  y ~ poisson_log_glm(X, alpha, beta); //version 2
}

aleshing · December 10, 2019, 5:06pm

I’ve had a similar problem. I resolved it by directly updating the target

target += poisson_log_glm_lpmf(y | X, alpha, beta);

rather than using the sampling statement. I think the sampling statement has a bug.

maxbiostat · December 10, 2019, 5:27pm

@bgoodri is this a RStan-specific issue or is it a problem with math?

FJCC · December 10, 2019, 5:34pm

@aleshing - Yes, that fixed the problem. Thank you! I searched for an issue on github but did not find one. Does one exist?

avehtari · December 10, 2019, 5:37pm

This is the first time I see this reported. Most likely math and not RStan problem. Please make an issue (I’ll be offline next 14 hours)

FJCC · December 10, 2019, 6:08pm

I opened issue #722 in the rstan section.

bbbales2 · December 10, 2019, 6:09pm

Make one if you don’t see one. If it’s an accidental duplicate, it’s fine, we can clean it up. Post the models and such.

I think the error is here: https://github.com/stan-dev/math/blob/develop/stan/math/prim/mat/prob/poisson_log_glm_lpmf.hpp#L109

include_summand<propto, T_partials_return>::value should be include_summand<propto, T_x, T_alpha, T_beta>::value

The two different specifications compile to cpp differently. The target+ syntax calls:

lp_accum__.add(poisson_log_glm_lpmf<propto__>(y, X, alpha, beta));

and the ~ syntax calls

lp_accum__.add(poisson_log_glm_lpmf<false>(y, X, alpha, beta));

Not sure if this effects any of the other glms.

avehtari · December 11, 2019, 3:13pm

Just adding here that this has been confirmed as a bug

mcol · January 9, 2020, 5:01pm

Thanks to @bbbales2 and @rok_cesnovar, this has been fixed in math and will appear in the upcoming release. :)

Topic		Replies	Views
Difference between y ~ poisson_log(...) and y[i] ~ poisson_log(...) in for-loop? Modeling rstan , poisson , brms	1	560	October 18, 2021
Error in check_stanfit(stanfit) rstanarm	4	1220	March 9, 2020
Issues with fitting this poisson-regression Modeling fitting-issues , poisson	5	506	December 18, 2020
Help with Poisson model Modeling loo	45	7609	March 3, 2018
Getting a runtime ititalization failure error on a poisson GLM Modeling	3	404	January 14, 2020

Different results with poisson_log() and poisson_log_glm

Related topics