Divergences during prior predictive check: why and how can the transformed parameters affect them?

anthony_462 · January 26, 2024, 10:36am

When running my model for prior predictive check, I ended up with a substantial amount of divergences (about 40%). However, I noticed that when I removed the ‘transformed parameters’ section, there was no divergence anymore. It seems that the way the parameters are linked in the expression contained in the ‘transformed parameters’ section induced some divergences. This is suprising to me because I naively thought that the sole purpose of the ‘transformed parameters’ section was to transform the parameters so that they can be used in the likelihood function. But as there is no likelihood here (as it is a prior predictive check), I thought they didn’t play any role.

Note that the model is a minimally reproducible example which contains expression that makes little sense mathematically (e.g., the “inv_logit(logit())” part).

Model

// load data objects
data {
  int N_x;//N_data/N_age
  array[N_x] real x;
}

parameters {
  real <lower=0,upper=1> nu;
  real<lower=0> delta;
}

transformed parameters {
  array[N_x] real theta;
  for(i in 1:N_x){
    theta[i] = inv_logit(logit(nu * x[i]^delta));
  }
}

model {
  nu ~ uniform(0,1);
  delta ~ exponential(1);
}

R script

data:
x.RData (19.2 KB)

#example
wd=getwd() #wd=paste0(getwd(),"/debugging/20240126_ppc_divergence/") #save(x,file=paste0(wd,"x.RData"))
library(cmdstanr)
load(paste0(wd,"x.RData"))
data_list = list(x=x,N_x=length(x))
fit <- mod2$sample(data = data_list,
                   chains = 4,
                   parallel_chains = 4,
                   seed = 1:4,
                   iter_warmup = 500,
                   iter_sampling = 300,
                   refresh = 40)
fit$diagnostic_summary()

Here is a bivariate plot of the two parameters, with divergences in red.

nhuurre · January 26, 2024, 5:25pm

That’s correct but the section still runs even if there results aren’t used in the model block. And if the transform fails, the sampler rejects the sample, so you get a divergence. Your inv_logit(logit(...)) transform requires the input to be between 0 and 1; it works if x < 1 but judging from the plot, the largest x in your data set is about 1.23.

Topic		Replies	Views
Help with my divergent transitions, part 999 Modeling	9	1053	May 6, 2021
Help reparameterize GP model to remove divergent transitions Modeling rstan , techniques , fitting-issues , performance	33	1748	February 22, 2022
Choosing correct non-centered parametrization Modeling techniques , specification	9	740	October 2, 2020
Understanding the cause of divergent transitions (no apparent funnel behavior) Modeling	4	1134	January 22, 2020
Looking for an advice regarding divergent transitions General rstan , fitting-issues , performance	7	583	October 1, 2020

Divergences during prior predictive check: why and how can the transformed parameters affect them?

Related Topics