How much is enough (neff)

timdisher · May 2, 2019, 11:06pm

I am trying to develop some rough guidelines around how many samples after convergence are required for analyses at my work. Does anyone here have good references on how I can use the new mcse outputs from monitor() to help with this? We typically will report things like odds ratios out to a maximum of three decimal places, so for the tails would it make sense to suggest MCSE <= 0.0002? My thinking is that this would put 2sds at 0.0004 which would give me a relatively stable third decimal place? If I am way off base here it might just be easiest to point me towards a useful reference if you’re aware of one. The BUGS book suggests a general 0.5% of posterior SD rule but that doesn’t seem fit for purpose here.

 param      Q5     Q50     Q95    Mean     SD MCSE_Q5 MCSE_Q50 MCSE_Q95 MCSE_Mean MCSE_SD   Rhat Bulk_ESS Tail_ESS
 OR[1,2]  0.7635  0.8073  0.8540  0.8078 0.0276  0.0003   0.0003   0.0004    0.0002  0.0002 1.0000    13267    23131

sakrejda · May 3, 2019, 10:13am

You’ve got the logic down right (MCSE’s aren’t mysterious). This is the writeup for the new diagnostics: https://arxiv.org/pdf/1903.08008.pdf

The 0.5% rule is fine IFF you’re doing things like estimating the mean effect of daily temperature on ice melting speed. Nobody’s going to argue much that it exists, the only questions is how big and getting the SD roughly right is much more important than the edges. When you’re going to put thresholds on parameters and make decisions based on those thresholds like people in medical fields like to do, then it’s important to get the right digits in there.

Whether it’s worth digging more depends on how important these decimal places are to you. They could be off if some other model parameter is not-quite-mixing but it’s really rare that I see people examining that in detail. I usually try to understand if the well-behaved parameters are related to the questionable ones and fit an alternative model that might highlight the issue (e.g.-now mixing is good for all parameters did those Q95/MCSE95 values budge?).

avehtari · May 3, 2019, 11:59am

That seems too much. I know it’s common, but it’s also as common complain that there are unnecessary many decimals reported. I don’t think there is any practical benefit to report more than two significant digits for odds ratios.

timdisher · May 3, 2019, 12:12pm

Totally agree but that is based on clinical obsession with intervals excluding unity and it will take some time to nudge people away from it. I’ve been asked for intervals down to 4 decimals and that I pushed back on, but it’s fighting against the inertia of a whole industry.

betanalpha · May 3, 2019, 3:04pm

The precision necessary of the MCMC estimators is determined by the comparisons you want to make between expectation values. If you want to compare differences of odds ratios to the third decimal place than your logic is sound. If you need only the first or second decimal place then you can get by with many fewer effective samples. If you know that the necessary precision is pretty small but people want many decimal places reported then you can go with fewer effective samples and report something like 0.583 +/- 0.022 – it will upset some people but it many be the path of least resistance given the inertia in the field.

Topic		Replies	Views
Reasonable success rate in MCMC sampling Modeling	6	125	February 9, 2025
Improve warnings for low ESS Developers	31	4099	August 5, 2020
Tail Effective Samples Size (ESS) is too low, indicating posterior variances and tail quantiles may be unreliable General rstan	6	4452	September 25, 2020
How many iterations to run and how many digits to report (notebook) General	0	503	June 7, 2022
Resources of desirable MCMC representativeness General shinystan	2	826	April 18, 2019

How much is enough (neff)

Related topics