Error: Chain 1: Rejecting initial value: mu[1] is nan, but must be greater than or equal to 0

mitzimorris · June 30, 2020, 6:59pm

That’s the ICAR prior - pairwise distance - and it’s not identifiable - you can add any constant you want to sai and it washes out of the pairwise difference. adding a sum-to-zero constraint fixes this problem. we recommend a soft sum-to-zero constraint.

You can code up the ICAR prior as a function in Stan as follows:

functions {
  real icar_normal_lpdf(vector phi, int N, int[] node1, int[] node2) {
    return -0.5 * dot_self(phi[node1] - phi[node2])
      + normal_lpdf(sum(phi) | 0, 0.001 * N);
 }
}

When used in a sampling statement, you call it as follows:

sai ~ icar_normal(N, node1, node2)

Nazanin · June 30, 2020, 8:32pm

Many thanks for your comments @mitsimorris. Yes, I added it to the codes before but forgot to write it here. soft sum-to-zero constraint on sai in codes is:

target += -0.5 * dot_self(sai[node1] - sai[node2]); print("target = ", target());
  sum(sai) ~ normal(0, 0.001 * N);

But I think the problem is something else and I wonder what it is.

mitzimorris · June 30, 2020, 9:21pm

it looks like the BYM2 part is OK.
the complexity of your model and the fact that I haven’t yet implemented a spatio-temporal model
make it hard to see what’s wrong.
the standard debugging advice is to simplify - could you set up a basic spatio-temporal regression based on your code and share?

Nazanin · July 1, 2020, 6:26am

Thanks a lot for your reply @mitsimorris. I wrote the code more simply by distributing gamma. The result is the same as before! The codes:

data {
  
  int<lower=0> N;     
  int <lower=0> K;  
  
  int<lower=1> N_edges;
  int<lower=1, upper=N> node1[N_edges];  
  int<lower=1, upper=N> node2[N_edges];  
  real<lower=0.00001> scaling_factor; 

  vector [N] y;    
  matrix [N, K] X; 

 int <lower=1> S;     // Spaces
  int <lower=1> TJ;    // Times
}

transformed data {
  for (i in 1:N_edges) {
    if (node1[i] >= node2[i])
       reject("edge pair i=", i, ", node1[i] must be less than node2[i]");
  }
}

parameters {
  
  //distribution
  real<lower=0> inverse_phi; //the variance parameter

  //BYM2 model
  vector[N] theta;       
  vector[N] sai;         
  real<lower=0> sigma;        
  real<lower=0, upper=1> rho; 
  
	//Reg model
	vector[K] beta;
  
  // Random walk
  real<lower=0> sigmaRw;
  real<lower=0> sigmaR;
}


transformed parameters {
  
  vector[N] convolved_re;
  vector<lower=0>[N] mu;
  vector[N] gama; //rate parameter for the gamma distribution
	vector [TJ] Rw1;
	vector [TJ] R2; 
	
  convolved_re =  sqrt(1 - rho) * theta + sqrt(rho / scaling_factor) * sai;
	
	for (i in 1:S) {
		for (j in 1:TJ){
		                               
      mu[i*TJ-TJ+j]= exp(X[i*TJ-TJ+j, ] * beta + convolved_re[i] * sigma + Rw1[j] + R2[j] );
			}
   	}
   	
  gama = rep_vector(inverse_phi, N) ./ mu;
}  

model {

  inverse_phi ~ exponential(1); // prior on inverse dispersion parameter
  beta[1] ~ cauchy(0,10); 
  beta[2:] ~ cauchy(0,2.5);  
  theta ~ normal(0, 1);
  sigma ~ normal(0, 1);
  rho ~ beta(3, 3);  
  sigmaRw ~ gamma (1, 0.001);
    Rw1[1] ~ uniform(0,1) ;
  for (j in 2:TJ) {
   Rw1[j] ~ normal(Rw1[j-1], 1/sigmaRw);
}
  sigmaR ~ gamma (1, 0.001);
  R2 ~ normal(0, 1/sigmaR);

  
  target += -0.5 * dot_self(sai[node1] - sai[node2]); 
 sum(sai) ~ normal(0, 0.001 * N);
  
  y ~ gamma(inverse_phi,gama);
  
}

the error is at line 49 which is a constraint on mu:

vector<lower=0>[N] mu;

this is the error (as the same):

Chain 1: Rejecting initial value:
Chain 1:   Error evaluating the log probability at the initial value.
Chain 1: Exception: validate transformed params: mu[1] is nan, but must be greater than or equal to 0  (in 'model2d587ac1394c_spatialtemporal_gamma' at line 49)

Chain 1: 
Chain 1: Initialization between (-2, 2) failed after 100 attempts. 
Chain 1:  Try specifying initial values, reducing ranges of constrained values, or reparameterizing the model.
[1] "Error in sampler$call_sampler(args_list[[i]]) : Initialization failed."
[1] "error occurred during calling the sampler; sampling not done"

Thank you, I really appreciate your help.

nhuurre · July 1, 2020, 7:39am

I see that mu has length N but the loops assign to the first S*TJ places. I don’t know if those are equal but the error is what I’d expect if S*TJ=155 and N>155.

Nazanin · July 1, 2020, 8:21am

Tank you @nhuurre. No it’s not true. S=31 and TJ=8 and N=S*TJ=248 which is the sample size. It is still unknown where the number 155 came from!

nhuurre · July 1, 2020, 8:31am

Well, 155=31\times 5 so first NaN at index 156 looks a bit like j=6 is failing–but the loops are the other way around so 156 actually corresponds to i=20,j=4 and all i<20 must work for any j.

Nazanin · July 1, 2020, 9:18am

Great, many thanks @nhuurre. It really helped me find the problem. I had a mistake in specifying data in R before running the code, and set TJ=5! I modified the code to TJ=8, and also I changed constraint on lambda from 0.1 to:

vector<lower=0.000001>[N] lambda;

the error changed to this:

...
target = -inf
target = -inf
target = -inf
target = -inf
target = -inf
Chain 1: Rejecting initial value:
Chain 1:   Log probability evaluates to log(0), i.e. negative infinity.
Chain 1:   Stan can't start sampling from this initial value.
Chain 1: Rejecting initial value:
Chain 1:   Error evaluating the log probability at the initial value.
Chain 1: Exception: validate transformed params: gama[193] is 6.93183e-007, but must be greater than or equal to 1e-006  (in 'model2d585aa36987_BESAG_temporal_RW' at line 75)

Chain 1: Rejecting initial value:
Chain 1:   Error evaluating the log probability at the initial value.
Chain 1: Exception: validate transformed params: lambda[40] is 1.97203e-007, but must be greater than or equal to 1e-006  (in 'model2d585aa36987_BESAG_temporal_RW' at line 73)

Chain 1: target = -inf
target = -inf
target = -inf
target = -inf
target = -inf
target = -inf
target = -inf
...
target = -inf
target = -inf
target = -inf
target = -inf
target = -inf

Chain 1: Rejecting initial value:
Chain 1:   Log probability evaluates to log(0), i.e. negative infinity.
Chain 1:   Stan can't start sampling from this initial value.
Chain 1: 
Chain 1: Initialization between (-2, 2) failed after 100 attempts. 
Chain 1:  Try specifying initial values, reducing ranges of constrained values, or reparameterizing the model.
[1] "Error in sampler$call_sampler(args_list[[i]]) : Initialization failed."
[1] "error occurred during calling the sampler; sampling not done"

Lambda, alpha and gama are parameters of the distribution. The problem seems to be the constraint on lambda and gama. The constraint for these parameters are:

transformed parameters {
  
  vector[N] convolved_re;
  vector<lower=0>[N] mu;
  vector<lower=0.000001>[N] lambda;
  vector<lower=0.000001>[N] alpha;
  vector<lower=0.000001>[N] gama;
...

Nazanin · July 1, 2020, 9:35am

lambda is parameter of Poisson distribution, alpha and gama are parameters of Gamma distribution. for:

vector<lower=0>[N] lambda;

the code still didn’t work

Topic		Replies	Views
Error rejecting initial value but have looked at lower bounds of priors Modeling rstan , fitting-issues , specification	11	680	May 2, 2020
Divergent transitions & BFMI low in a state-space model Modeling	6	1064	August 17, 2017
Location parameter is nan, but must be finite! Modeling	8	7681	October 21, 2019
Error evaluating the log probability at the initial value Modeling	8	1269	September 2, 2020
Initial value rejected Modeling	21	8955	January 31, 2023

Error: Chain 1: Rejecting initial value: mu[1] is nan, but must be greater than or equal to 0

Related topics