Effective sample size differs between rstan 2.18 and rstan 2.17.3 for same model object

Dalton · November 14, 2018, 6:03pm

I’m finishing up a draft manuscript that I’ve been working on for some time. I’ve got some fairly large rstan objects saved that I ran previously using rstan 2.17.3. In the meantime, I’ve updated to rstan 2.18.

I’ve got a sentence in my manuscript where I reported the average number of effective samples for a set of parameters. I wanted to double check those numbers, but I wasn’t able to replicate what I had reported earlier using rstan 2.18 to summarized a saved rstan object that was fit using rstan 2.17.3. Even more odd, I’ve got n_eff numbers that are greater than the number of post-warmup samples and some that are even greater than the total number of samples.

I’ve still got rstan 2.17.3 on another machine, so I’m able to directly compare the results from the same model object but summarized using the two different version of rstan. I’m able to confirm that for parameters where the n_eff was previously reported as equal to the number of post-warmup samples (in this case 5000) it will now often be reported as greater. For some parameters that previously had n_eff less than the number of post warmup samples, the reported n_eff also differs between 2.17.3 and 2.18, but not in a systematic way. For example the first two parameters have an n_eff of 3580 and 2646 reported by 2.17.3, but are now 3516 (lower) and 2661 (higher) under rstan 2.18.

I’d like to avoid rerunning the models, just because it’ll delay me another week, but I want to make sure I can trust those previous runs. Why would n_eff change? Which numbers should I report?

Dalton · November 14, 2018, 6:15pm

Just a followup. I have a simulation with the same general model structure runs fairly quickly. I just checked that using rstan 2.18.1. I’m getting n_eff’s greater than the total number of iterations (the sum of warmup and post-warmup). So this doesn’t appear to be an issue of an old model object fit using 2.17.3 not playing nicely with 2.18.1. Rather it seems to be an issue with the n_eff calculation in 2.18.1

hhau · November 14, 2018, 6:18pm

This blog post explains the n_eff > n_samples.

The original discourse thread is here.

Topic		Replies	Views
N_eff greater than number of iterations? RStan rstan	1	3345	January 2, 2019
Different results with Stan 2.21.0 General	15	1588	July 20, 2020
Rstan output warning Modeling	6	752	June 14, 2020
How to speed up Rstan model Modeling	2	1540	June 3, 2018
Rstan 2.18.2 released; everyone should upgrade to it now RStan rstan	7	2228	November 14, 2018

Effective sample size differs between rstan 2.18 and rstan 2.17.3 for same model object

Related topics