Valid to use more chains to increase posterior sample size?

Gang · September 5, 2017, 5:47pm

Currently I’ve fitted the data with a model through 4 chains and 1000 iterations: no warnings, all Rhat of 1.0. I guess the resulting 2000 posterior samples are good enough to obtain the central 95% posterior interval.

Suppose that I would like to obtain the central 99% posterior interval. How many samples would be considered reasonable? Is it valid to obtain more samples by simply increasing the numbers of chains:
for example, 20 chains with 1000 iterations for 10,000 samples?

sakrejda · September 5, 2017, 5:54pm

As long as the multi-chain R-hat is good (and you don’t see other convergence issues) you can combine chains. In many ways it’s better because it’s more likely to show you issues with the posterior. That said, out of curiosity, why 99% posterior intervals?

Gang · September 5, 2017, 6:09pm

Thanks for the confirmation!

out of curiosity, why 99% posterior intervals?

The customer complained that the current results from the central 95% interval are too overwhelming, and would like to prune the results to some extent. I don’t know how to address the issue, and would like to hear any suggestions. It seems that the 95% interval is the custom, but does such a cutoff share the same complaint the conventional statistics is criticized about regarding the p-value of 0.05: binarized decision and arbitrariness?

sakrejda · September 5, 2017, 7:01pm

Depending on context I would resort to ranking, especially if this is something where statistically identified leads are further confirmed. For example in GWAS, any particular lead might be followed up by a manipulative study to confirm the association. In drug development you might need a set of candidates to put through further screening. What these have in common is that you need to identify the most promising set of candidates to follow up with a limited set of resources. So think top 10 rather than a discretized yes/no.

betanalpha · September 5, 2017, 7:34pm

Just be careful – you’ll need about 10,000 effective samples to pin down the 1% and 99% quantiles well enough, not just 10,000 samples.

Gang · September 5, 2017, 8:27pm

What these have in common is that you need to identify the most promising set of candidates to follow up with a limited set of resources. So think top 10 rather than a discretized yes/no.

Thanks for the suggestion! I’ll discuss this possibility with the customer.

Just be careful – you’ll need about 10,000 effective samples to pin down the 1% and 99% quantiles well enough, not just 10,000 samples.

A very good point, Mike! I guess I’m fine so far with the current result: the number of effective samples was 2000 out of 2000 draws for the effect of interest. However, the number of effective samples was pretty low (250-400) for an effect I’m not interested: Does this indicate anything inappropriate for the model or parameterization overall?

sakrejda · September 5, 2017, 8:36pm

One of the cool things about Bayesian effect estimation is that you can easily do a top ten adjusted for uncertainty.

Bob_Carpenter · September 10, 2017, 12:09pm

What is the inferential goal?

I don’t think there’s an established custom as there’s no established “significance level”. If you choose 95% intervals, you are calibrated if 95% of the true values fall in those 95% intervals; same for 10%, 50%, or 99% intervals.

Topic		Replies	Views
Understanding Rhat with respect to the results from Bayesian Hierarchical modeling General	2	1450	March 19, 2020
Multiple chains and posterior exploration General	5	1005	September 27, 2019
Multi-chain vs single-chain Developers	7	2443	March 7, 2023
Nested R-hat < R-hat regardless of # of chains?!? General techniques	16	536	April 9, 2024
Comparing models with different "chain" and "iteration" paramaters Modeling fitting-issues , loo	1	616	January 23, 2022

Valid to use more chains to increase posterior sample size?

Related topics