Running multiple chains on different computers

jsocolar · September 21, 2020, 2:49pm

Are there any known pitfalls or best practices for ensuring independent RNG behavior in completely separate instances of Stan running on different computers (some of which might be virtual machines)? Specific instructions/code greatly appreciated, especially if it’s something I have to do outside of R/cmdstanR.

Motivation:
I am using within-chain parallelization to fit a model with very long compute time. Because I have access to multiple computers, I think the best way to use the available resources is to run different chains on different computers. I am using cmdstan via cmdstanR, and I have access to cores on Mac, Linux, and Windows machines.

jsocolar · September 24, 2020, 3:40pm

Is it uncontroversially ok to select different arbitrary seeds for the different chains and assume that the behavior is independent?

mike-lawrence · September 24, 2020, 5:21pm

Yes. Indeed, it’s tough to get precise replicability when you want it, so simply using a different seed on the different computers shouldn’t pose any risk of non-independence.

Topic		Replies	Views
Running cmdstanr in parallel on computing cluster General	6	1005	December 9, 2022
Within-Chain parallelization & Seed Modeling rstan	7	85	October 28, 2024
Advice for parallelizing many Stan models with multiple chains Modeling	1	614	September 20, 2022
Running chains on multiple cores Developers	2	899	January 30, 2023
Chain ID for Multiple Chains Algorithms rstan	2	684	August 23, 2018

Running multiple chains on different computers

Related topics