Bayesian Fraction of Missing Information was low

linas · October 26, 2018, 3:59pm

Hi,

I am trying to use blocker example in https://github.com/stan-dev/example-models/tree/master/bugs_examples/vol1/blocker but receive this terrible message. The convergence is not too terrific as well. Any advice?

Thanks for help.
CTbinomial1.stan (535 Bytes)

The R code is:
n.draws <- 1000
warmup <- 6000
n.chains <- 4
init <- list()
for (i in 1:n.chains)
init[[i]] <- list(d = 0, delta = rep(0, N), sigmasq_delta = 1,
mu = rep(0, N))
fit <- stan(file = stan.model.file, data = model,
warmup = warmup, iter = n.draws + warmup,
chains = n.chains,
pars = pars, init = init, open_progress = TRUE,
control = list(adapt_delta = 0.999, stepsize = 0.001,
max_treedepth = 20))

Bob_Carpenter · October 29, 2018, 6:33pm

Help us reparameterize those simply translated BUGS models. We’ve been meaning to do it for ages. This particular example, blocker.stan, is very badly coded given what we know now.

Those inverse-gamma priors are terrible. Use something at least weakly informative that is consistent with zero.
The normal priors are bad, but not as harmful as the inverse gamma. Use something weakly informative.
The hierarchical part of the model uses a centered parameterization—the critical change will be to make that non-centered.

linas · October 29, 2018, 6:41pm

Thanks. I will do and let forum know about the results.

Bob_Carpenter · October 29, 2018, 6:42pm

As a heads-up, we kept going back-and-forth about whether we should try to implement the same bad model as was being used in BUGS, but in the end, we decided we should just apply Stan best practice. That would include putting a prior directly on sigma rather than on sigma_sq, for example.

linas · October 29, 2018, 11:59pm

This one did amazingly well on my data.

a. I am not completely sure that priors for mu are good enough since 95% CR spans from -24 to 4.

b. Priors on d and sigma_delta might be too informative as well based on the chart below.

c. I also wonder if it makes sense to estimate nu instead of using nu=4 in delta ~ student_t(4, 0, 1);

CTbinomial2.stan (445 Bytes)

Bob_Carpenter · October 31, 2018, 4:18am

I’m just pasting this in here:

data {
  int<lower=0> N; 
  int<lower=0> nt[N]; 
  int<lower=0> rt[N]; 
  int<lower=0> nc[N]; 
  int<lower=0> rc[N]; 
} 
parameters {
  real d; 
  real<lower=0> sigma_delta; 
  vector[N] mu;
  vector[N] delta;
} 
model {
  rt ~ binomial_logit(nt, mu + d + sigma_delta * delta);
  rc ~ binomial_logit(nc, mu);
  delta  ~ student_t(4, 0, 1); 
  mu ~ normal(0, 10);
  d ~ normal(0, 10); 
  sigma_delta ~ student_t(4, 0, 1); 
}

Go non-centered parameterization! This is why slight parameteization changes make a huge difference. And just for the record, this isn’t just for Stan—you get the same advantages reparameterizing with Gibbs samplers like BUGS/JAGS.

Topic		Replies	Views
"Bayesian Cognitive Modeling" in cmdstanr CmdStan cognitive-science	5	1202	September 6, 2021
Network meta-analysis random effect model for binary data Modeling meta-analysis	7	1688	June 23, 2020
N best tips & tricks (or the go-to checklist) for new Stan model builders? Modeling	10	2889	May 6, 2019
Sampling issues in my model: Reparamterization does not seem to work Modeling	3	692	March 23, 2021
Simple Example of an Improper Posterior Without Warnings from Stan General	22	2784	December 7, 2019

Bayesian Fraction of Missing Information was low

Related topics