Summarising Rhat values over multiple variables/fits

avehtari · August 17, 2021, 7:53am

In the end you should care about MCSE, but as a quick scale free diagnostic Rhat is useful, but any Rhat threshold not derived from MCSE is ad hoc
1.01 was chosen assuming one or a few Rhats are examined and the chains are run long enough to be able to infer autocorrelations well, too.
In the new Rhat paper we didn’t explicitly discuss multiple comparisons, but what you write is the natural way to think about it.
Multiple comparison correction as you describe is one way. When there are many variables, a more fancy approach would be to use a model to learn the variation.
Looking at just the percentage exceeding is not enough as those exceeding might exceed a lot, so it’s better to assume some distribution for the Rhats and compare to that.
For repeated automated testing a single binary decision can be useful, but in case of triggering the threshold, there should be more information available. I usually just eyeball the Rhats, but plotting a histogram of Rhats with a assumed distribution overlaid could be a useful way to look if the highest Rhats are suspiciously high.
If in doubt, run more iterations
If more iterations would be very costly, look at the other diagnostics such as ESS, MCSE, R*, etc.

Topic		Replies	Views
Rhat values from a single chain vs multiple chains Modeling	2	729	December 30, 2022
Rhat of one estimate always close to 1 Modeling fitting-issues	2	183	February 29, 2024
Divergent Transitions with R_hat < 1.1 Modeling fitting-issues , performance , shinystan	14	2343	June 13, 2019
Quick question about cutoff values Modeling	2	610	February 24, 2021
Cherry-picking Rhat values Modeling	16	1274	October 6, 2017