When can I trust results from a short run?

ignacio · July 16, 2019, 1:04pm

I recently ran a model for only 200 iterations per chain, and the only warning that Stan gave me was about exceeding the max tree depth. I was not very concerned about this warning because I understand it to be an efficiency concern instead of a validity one. Moreover, the effective sample size for the parameter of interest was over 100. I had confidence that things were going to work fine after I ran the same model for 2000 iterations per chain tweaking adapt_delta=0.99 and max_treedeph=15. Alas, I woke up to 4k DTs. This brings me to my question: When can I trust results from a short run?

sakrejda · July 16, 2019, 5:14pm

You need to observe the longer runs first and see them complete successfully consistently. Then it’s ok to cut back on post-warmup iterations till you get to your desired smaller effective sample size. If you shorten warmup I would suggest revalidating

mariel · July 16, 2019, 7:16pm

Thanks @sakrejda! In some cases, though, each iteration takes a long time. Are there instances when it’s OK to trust results from a short run (assuming no DTs, big effective sample sizes, good Rhats)? If not, how long is long enough? @shira, any thoughts?

sakrejda · July 18, 2019, 10:23am

In many models you find the main mad of the distribution within a hundred (?) iterations and get decent sampling efficiency soon after but larger sample sizes are what gives you confidence that the sampler is mixing, not going to get stuck in a corner, that multiple chains are sampling from the same distribution, etc… I know you can make that search wider (more parallel short chains) but it’s limited by the sample size required to compare samples (for rhat/ess/etc… )

shira · July 18, 2019, 11:05pm

I wish I knew! Great question :)

Reading this now! https://betanalpha.github.io/assets/case_studies/divergences_and_bias.html#21_a_dangerously-short_markov_chain

Topic		Replies	Views
Low effective sample size after running Bayesian cognitive model in Stan Modeling rstan , fitting-issues	8	593	August 18, 2021
Improve warnings for low ESS Developers	31	3623	August 5, 2020
Model fitting and sampling issue: Only 1 chain sampling properly Modeling rstan , fitting-issues , performance	14	2113	September 20, 2022
Is having 4 chains with length of 5000 the same as having 8 chains with length of 2500? Modeling	8	1326	January 28, 2022
Comparing models with different "chain" and "iteration" paramaters Modeling fitting-issues , loo	1	453	January 23, 2022

When can I trust results from a short run?

Related Topics