Different Outputs in RStan vs. PyStan

bukosabino · April 12, 2020, 8:58pm

I try to reproduce a rstan experiment using pystan, but the results are not identical.

Of course, I am sure about this:

All the variables have the same values.
I use the same OS and computer.
I set the same seed in the R and Python experiment (at the beginning of the scripts) and the same seed in sampling rstan/pystan functions.

The printed model summary should be identical? Or is it normal to get different results (beta, eta, pi, epsilon?

Clue: When I run big experiments (chain=4 and iter=1000), the results are pretty similar (but don’t match), and when I run toy experiments (chain=1 and iter=5) the results are different.

Best,
Dario

ahartikainen · April 15, 2020, 10:44pm

Ok, I have done some digging. With extra_compile_args=['-O3'] ( and RStan with -O3) and setting same seed, I’m able to get the same random init.

I also needed to set chain_id = [1, 2, …] for each chain, so it would follow RStan.

Adaptation info is already different, so I will need to find out where the difference comes from.

wds15 · April 15, 2020, 10:55pm

I am not sure it’s worth following this up. Rstan has some slight patches to make things work with CRAN…

ahartikainen · April 16, 2020, 4:19am

It’s more of a hobby for me.

I think pystan3 / rstan3 should fix this.

ahartikainen · April 16, 2020, 2:20pm

Currently the difference is created in https://github.com/stan-dev/stan/blob/develop/src/stan/mcmc/hmc/nuts/base_nuts.hpp

Somewhere in transition, before this rng values are same. I had 2 chains where 1st chain had 187 extra calls for rng in RStan compared to PyStan. The seconds chain had 550 (don’t remember accurate diff right now) extra calls in PyStan compared to RStan.

ahartikainen · April 17, 2020, 2:26pm

Ok, there are some different flags for RStan and PyStan

Following settings for PyStan gives same results as in RStan (with the same compiler)

model = pystan.StanModel(..., extra_compile_args=['-O3', "-march=native", "-mtune=native"])

fit = model.sampling(..., seed=123, chain_id=[1,2, ...])

I would also use file= option for StanModel to make sure the model is the same.

See seed example here:

Edit. this of course depends what flags you have used

andrjohns · April 17, 2020, 2:41pm

extra_compile_args=['-O3', "-march=native", "-mtune=native"]

Are these not already set by default? They could give some easy performance benefits to Pystan users, since the flags enable the maximum level of compiler optimisation and SIMD vectorisation.

(Apologies if explaining what you already know!)

ahartikainen · April 17, 2020, 2:56pm

These were chosen a long time ago. I think these are not the defaults in RStan, but just recommended in the install guide.

cc @ariddell

ariddell · April 19, 2020, 1:28pm

Here’s the relevant history: clang had problems compiling stanc with -O3, see https://github.com/stan-dev/pystan/pull/33 .

Topic		Replies	Views
Slightly discrenpancy between Rstan and Cmdstanr output General	2	499	February 1, 2021
Initializing Stan Model object with random_seed - all interfaces should use 3-arg constructor Developers	4	821	May 16, 2017
Same model, same data, same seed, different computers, different number of divergences General	2	667	September 6, 2023
Rstan : Could the output of the stan model (each post warmup iteration draws) be different between linux and windows? Modeling rstan	2	325	June 30, 2023
Same code (with the same seed) but different results on different platforms? Why? General rstan	2	1522	August 29, 2021

Different Outputs in RStan vs. PyStan

Related topics