Same model, same data, same seed, different computers, different number of divergences

ramiro · September 6, 2023, 6:42pm

A model is being run on the same dataset on two different computers (my mac laptop vs the unix cluster where I work, R4.3.1 the former, R4.2.2 the latter, I am using cmdstanr). I use the same seed but I am getting a different number of divergences and slightly different results. Just noticed this as I was trying to understand the reason for the divergences and wanted to reproduce the result on my laptop so I can graph things. Do different number of divergences for the same data with the same seed make any sense or is this a red flag?

amas0 · September 6, 2023, 7:00pm

Of interest may be the reproducibility section in the Stan reference manual. In particular:

Stan is designed to allow full reproducibility. However, this is only possible up to the external constraints imposed by floating point arithmetic.

Stan results will only be exactly reproducible if all of the following components are identical:
Stan version
Stan interface (RStan, PyStan, CmdStan) and version, plus version of interface language (R, Python, shell)
versions of included libraries (Boost and Eigen)
operating system version
computer hardware including CPU, motherboard and memory
C++ compiler, including version, compiler flags, and linked libraries
same configuration of call to Stan, including random seed, chain ID, initialization and data

That should, at least partially, explain the discrepancies between the outputs. As to mitigating this and/or whether the number of divergences is meaningfully different, I can’t say.

ramiro · September 6, 2023, 7:11pm

Thank you, this does explain the differences in the number of divergences. Thanks for pointing to the reproducibility section.

Topic		Replies	Views
Same code (with the same seed) but different results on different platforms? Why? General rstan	2	1522	August 29, 2021
Rstan : Could the output of the stan model (each post warmup iteration draws) be different between linux and windows? Modeling rstan	2	325	June 30, 2023
Different results on the same data with the same seed Modeling fitting-issues	1	319	May 18, 2023
Question about the Reproducibility of Stan Results Algorithms cmdstan , cmdstanr	6	1651	January 10, 2022
Different Outputs in RStan vs. PyStan Interfaces	28	3005	April 19, 2020

Same model, same data, same seed, different computers, different number of divergences

Related topics