Error message: Initialization between (-2, 2) failed after 100 attempt

dsd353 · June 7, 2021, 3:04pm

Dear Stan developers,

Hi! I am fitting four parameters with Matt trick.
The four parameters are 1) s_dm_early, 2) s_dm_late, 3) s_em_early, 4) s_em_late.
When I ran the following Stan code in R, errors appeared as:

Chain 3:   Log probability evaluates to log(0), i.e. negative infinity.
Chain 3:   Stan can't start sampling from this initial value
Chain 3: Initialization between (-2, 2) failed after 100 attempts. 
Chain 3:  Try specifying initial values, reducing ranges of constrained values, or reparameterizing the model.
[1] "Error in sampler$call_sampler(args_list[[i]]) : Initialization failed."

The Stan code is as follows:

data {
  int<lower=1>                 nSub;
  int<lower=1>                 maxT;
  int<lower=0>                 SigmaBound;
  int<lower=1>                 T[nSub];            
  int<lower=0,upper=1>         Missing_dm[nSub,maxT];
  int<lower=0,upper=1>         Missing_em[nSub,maxT];
  real<lower=0,upper=1000>     st[nSub,maxT];    
  int<lower=-21,upper=1000>    rf[nSub,maxT];   
  real<lower=0,upper=1000>     dur_dm[nSub,maxT];    
  real<lower=0,upper=1000>     dur_em[nSub,maxT];   
  int<lower=0,upper=1000>      dm[nSub,maxT];    
  real<lower=-180,upper=1000>  em[nSub,maxT];    
}

transformed data {
}

parameters {
  vector[4] mu_p;
  vector<lower=0>[4] sigma;
  
  vector[nSub] s_dm_early_pr;
  vector[nSub] s_dm_late_pr;
  vector[nSub] s_em_early_pr;
  vector[nSub] s_em_late_pr;
}

transformed parameters {
  vector<lower=0,upper=SigmaBound>[nSub] s_dm_early;
  vector<lower=0,upper=SigmaBound>[nSub] s_dm_late;
  vector<lower=0,upper=SigmaBound>[nSub] s_em_early;
  vector<lower=0,upper=SigmaBound>[nSub] s_em_late;
  
  for (iSub in 1:nSub) {
    s_dm_early[iSub] = Phi_approx( mu_p[1] + sigma[1] * s_dm_early_pr[iSub]) * SigmaBound;
    s_dm_late[iSub] = Phi_approx( mu_p[2] + sigma[2] * s_dm_late_pr[iSub]) * SigmaBound;
    s_em_early[iSub] = Phi_approx( mu_p[3] + sigma[3] * s_em_early_pr[iSub]) * SigmaBound;
    s_em_late[iSub] = Phi_approx( mu_p[4] + sigma[4] * s_em_late_pr[iSub]) * SigmaBound;
  }
}

model {
  mu_p  ~ normal(0,1);
  sigma ~ normal(0,1);
  
  s_dm_early_pr ~ normal(0,1);
  s_dm_late_pr ~ normal(0,1);
  s_em_early_pr ~ normal(0,1);
  s_em_late_pr ~ normal(0,1);
  
  for (iSub in 1:nSub) {
    int inT = T[iSub];
    for (t in 1:inT) {
      if (Missing_dm[iSub,t]==0){
        
        real ip;
        real is_dm;
        real is_em;
        
        // DM 
        if (dur_dm[iSub,t] == 4.5){
          is_dm = s_dm_early[iSub];
        }
        else {
          is_dm = s_dm_late[iSub];
        }
        ip = normal_cdf(rf[iSub,t], 0, is_dm);
        dm[iSub,t] ~ bernoulli(ip);
        
        // EM
        if (Missing_em[iSub,t]==0){
          if (dur_em[iSub,t] == 4.5){
            is_em = s_em_early[iSub];
          } 
          else {
            is_em = s_em_late[iSub];
          }
          
          if (dm[iSub,t]==0){
            target += normal_lpdf(em[iSub,t] | 0, sqrt(is_dm^2 + is_em^2))
                    + normal_lcdf(em[iSub,t] - (is_dm^2 + is_em^2) * rf[iSub,t] / (is_dm^2)
                                 | 0, is_em);
          }
          if (dm[iSub,t]==1){
            target += normal_lpdf(em[iSub,t] | 0, sqrt(is_dm^2 + is_em^2))
                    + normal_lcdf(-(em[iSub,t] - (is_dm^2 + is_em^2) * rf[iSub,t] / (is_dm^2))
                                   | 0, is_em);
         } 
        }
      }
    }
  }
}

generated quantities {
}

You help will be really grateful to me!

caesoma · June 10, 2021, 3:31pm

Probably nSub is a large number and there are large regions of the parameter space with vanishingly low probability (hence the log(0), i.e. negative infinity). If you know this model can fit the data or got it working at least once, it indicates a problem with finding an initial value, so that’s the (relatively easy) problem to solve.

If you can try making nSub small for testing purposes and see if it manages to initialize the chain, or alternatively, if you can choose initial parameters init (maybe inits in rstan, I’m not sure) arguments passed on to the sampling function that will produce a nonzero likelihood (even if it’s a very bad fit, as long as it’s nonzero HMC will take it from there and move towards higher probability regions of parameter space).

If this doesn’t work it may indicate a problem or bug with the model, but that will require more investigation (and maybe some more detail about what the model is and what the functions in your code are supposed to do).

Topic		Replies	Views
Initialization failure in rstan Modeling	1	542	May 15, 2020
Initialization between (-2, 2) failed after 100 attempts Modeling	4	6024	April 24, 2018
Initialization and log probability problem RStan	1	422	April 8, 2020
Rstan Initialization error Modeling rstan	1	245	May 28, 2024
Debug help Modeling	2	796	November 29, 2017

Error message: Initialization between (-2, 2) failed after 100 attempt

Related topics