Linear excess risk model

SreenathMadathil · March 31, 2018, 3:50pm

I am trying to fit a linear excess risk model as mentioned here:
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3139969/

Briefly, this is the model:

   odds = exp(alpha)*(1+b1*x1+b2*x2+b3*x1*x2)
pr(y=1) = odds/(1+odds)

Stan model:

data{
  int N;               //sample size
  int ncol;          //number of columns of data
  matrix[N,ncol] X;   //matrix of data
  int<lower=0, upper=1> y[N]; //outcome
}
parameters{
  real a;
  vector[ncol] b;
}
transformed parameters{
  vector[N] xb1;
  vector<lower=0>[N] odds;                       //odds cannot be negative
  vector<lower=0,upper=1>[N] pr;             
  
  xb1 = X*b;
  odds = exp(a)*(1+xb1);

  for(n in 1:N){
    pr[n] = odds[n]/(1+odds[n]);
  }
}
model{
  a ~ cauchy(0,2.5);
  for(i in 1:ncol){
    b[i] ~ cauchy(0,2.5);
  }
  y ~ bernoulli(pr);
}
generated quantities{
  vector[N] yrep;
  for(n in 1:N){
    yrep[n] = bernoulli_rng(pr[n]);
  }
}

set.seed(4507907)

N <- 800
x1 <- rpois(N,10)
x2 <- rbinom(N,1,0.5)
x1x2 <- x1*x2

b <- c(-3,2,2,1)

odds <- exp(b[1])*(1+ b[2]*x1 + b[3]*x2 + b[4]*x1x2)
p <- odds/(1+odds)

y <- rbinom(N,1,p)
table(y)

X = cbind(x1,x2,x1x2)

dat = list(N=N, X=X, y=y, ncol=ncol(X))

fit <- stan("linOdds.stan",
            data=dat, control = list(adapt_delta=0.99, max_treedepth=14))

Tested it with a simulated data. However, many divergent transitions.
When I truncated the priors to be non-negative (see below).

parameters{
  real a;
  vector<lower=0>[ncol] b;
}

No divergent transitions, posterior looks okay.

Inference for Stan model: linOdds.
4 chains, each with iter=2000; warmup=1000; thin=1; 
post-warmup draws per chain=1000, total post-warmup draws=4000.
      mean se_mean    sd  2.5%   25%   50%   75% 97.5% n_eff Rhat
a    -3.05    0.03  0.85 -4.67 -3.61 -3.07 -2.48 -1.42   770 1.00
b[1]  3.38    0.26  5.56  0.34  1.19  2.20  3.94 11.95   474 1.01
b[2]  5.05    1.02 19.95  0.11  0.96  2.23  4.54 21.46   382 1.01
b[3]  1.57    0.06  1.85  0.06  0.49  1.05  2.00  6.02  1115 1.00

However, the pairs plots show some interesting shapes

Also there could be negative effects in real data examples.

The issue is the coefficients can be negative in real data.
So I guess my question is could any one suggest a reparameterisation?
To avoid divergent transitions?

Do I need to do something with the target distribution?

Thank you in advance.

jonah · April 7, 2018, 4:34am

Sorry for the slow response. Have you tried modeling on the logit scale? That is, using the log odds instead of odds. Stan has a special bernoulli_logit distribution that might be helpful.

SreenathMadathil · April 8, 2018, 4:23pm

Hi Jonah,

Thank you for the reply.
May be I am not understanding something, but if I model it on logit scale it would be simillar to logistic regression.

I found the soultion as suggested by: Haitao Chu et.al here https://insights.ovid.com/pubmed?pmid=21228700

There needs to be two more constrains

  real<lower=-1> b1;
  real<lower=-1> b2;
  real<lower=-(1+b1+b2)> b3;

Thank you again.

Sreenath

jonah · April 8, 2018, 6:47pm

Glad you found the solution. I suggested the log-odds because fitting a Bernoulli model is often more stable numerically using bernoulli_logit than bernoulli, but that may not make sense here (only had a chance to take a brief look). I haven’t had a chance to look into the paper you linked to but if it provides a solution that results in good sampling then that’s great.

Topic		Replies	Views
Divergent transitions Modeling	12	998	July 17, 2019
Divergent transitions in hierarchical logit Modeling	5	779	September 30, 2019
Physically-based model: How to parameterize program such that one variable must be larger than another Modeling	19	3291	March 26, 2018
High number of divergent transitions in ODE model with explicit constraints Modeling ecology , population-dynamics	55	3427	June 5, 2018
Regime Switching Model Modeling	4	1706	June 15, 2018

Linear excess risk model

Related topics