Understanding Rhat with respect to the results from Bayesian Hierarchical modeling

Murali_037 · March 19, 2020, 5:13pm

Hi everyone,

I know That close to 1 and 1.1 is good and means the chains are mixing well and they converge.

Does that mean if we use no of chains =1, Rhat values doesn’t matter?

Can someone explain how to interpret Rhat values and what it actually means?

Thanks a lot in advance,

torkar · March 19, 2020, 5:41pm

Hi, I think that \widehat{R} looks at both the within and between chain variance. Hence, setting chains = 1 would probably make \widehat{R} results questionable at best (btw, it should be \widehat{R}<1.01.

Bob_Carpenter · March 19, 2020, 6:09pm

The definition we use for R-hat is in the Stan reference manual. It splits all chains in half before applying the “old” definition of R-hat. How good an R-hat value you need will depend on how good you need the results to be. We typically start with shorter chains while developing models then run longer until they stop griping before publishing. What you’re really looking for is a high enough effective sample size, as that takes R-hat-like cross-chain info into account (definition also in the manual). But the effective sample size estimates are unreliable when they’re small. You can’t trust an effective sample size of 10 to truly be 10. It needs to be in the neighborhood of 100 per chain before it becomes reliable.

Topic		Replies	Views
Traceplot vs Rhat for convergence General	11	2480	April 3, 2024
Rhat < 1 (as low as 9.94e-01). Why? General	6	4045	June 19, 2019
Rhat values from a single chain vs multiple chains Modeling	2	825	December 30, 2022
Potential Scale Reduction and chains=1 General	7	859	November 6, 2017
R hat with only one chain General	11	429	October 15, 2024

Understanding Rhat with respect to the results from Bayesian Hierarchical modeling

Related topics