Different Outputs in RStan vs. PyStan

leungi · January 18, 2017, 3:03pm

Hi,

Using the same model, seed, iterations and warmup, is there a reason why RStan would yield different output than PyStan for the same input dataset?

Look forward to insights.

Thanks!
Ivan

Avraham · January 18, 2017, 8:00pm

Different implementation of the PRNG?

leungi · January 18, 2017, 8:42pm

Avraham, thanks for reply.

I used the get_seed() in PyStan and RStan, and both return the same value as the input I have.

Is there another check I can do?

Thanks.
Ivan

ariddell · January 18, 2017, 9:40pm

This is a bug if it is happening. Draws and optimization results should be identical given the same random seed. Other things, such as the printed summary of draws, are not yet identical.

leungi · January 18, 2017, 9:54pm

Thanks for feedback @ariddell

I wonder if anyone else had this same issue.

Bob_Carpenter · January 23, 2017, 8:01pm

You need to set the seed in Stan, not in Python or in R.
At that point, everything should produce the same results.

If it doesn’t, it should all get fixed after the refactor
lands in 2.15 and we call exactly the same underlying code
for sampling (rather than having a lot of complicated duplicated
code across interfaces).

Bob

betanalpha · January 23, 2017, 10:19pm

Also check to see if the chain_id is the same as it strides the RNG to ensure that multiples chains don’t lock. For example, in RStan chains=1 sets the default chain_id to be 1 where as the default chain_id in CmdStan is 0. I had ran into this issue recently and had to resolve it by setting id=1 in CmdStan.

leungi · January 23, 2017, 11:37pm

Thanks for the response.

Update: ran both models with reduced warmup to sampling ratio and got closer outputs; initially, warmup/sampling ratio is 1:1, now it’s 1:4.

@Bob_Carpenter: I set the seed within the stan() call, with the seed argument; this should null the issue you noted.

@betanalpha: Similar to seed, I set the chains argument in stan() call to a 1 for both; hence, shouldn’t run into the issue you noted.

Bob_Carpenter · January 23, 2017, 11:44pm

Are you running on same hardware with the same underlying C++
compiler and compiler options? They really should produce
exactly the same answer in that situation.

We’re not going to try to debut this now as everything’s getting
changed in Stan 2.15 and that should make everything consistent.
For now, there are probably variations in the algorithms that
aren’t updated the same way across RStan, PyStan and CmdStan, which
is one of the main reasons we pulled all the important code down
into C++.

I don’t know hat you mean by 1:1 in warmup/sampling ratio.
Is that just the number of iterations you chose?

Bob

leungi · January 24, 2017, 10:34pm

Same machine for running both.

Regarding the warmup/sampling ratio, the initial run on both had total iterations at 20000, and warmup of 10000 (i.e. sampling 10000 as well). This gave different output when extracting quantile values (e.g. 10% quantile).

The second run had the same number of iterations, but warmup is set to 5000 (i.e. 15000 sampling); this gives a closer output for both.

Bob_Carpenter · January 30, 2017, 8:27pm

As Michael’s suggesting, I think it’s probably a count from 0 vs.
count from 1 issue on the chain ID. Have you tried controlling for
that? Also, I think your reply is truncated.

In general, you just want to be able to replicated within MCMC
standard error. The whole process is probabilistic and there’s nothing
special about any given seed and run.

And as I said, we’re bringing all the implementations exactly in
line with the forthcoming Stan 2.15 release.

Bob

leungi · February 2, 2017, 12:02am

@Bob_Carpenter: I set the chains argument in stan() call to a 1 for both; hence, shouldn’t run into the issue you noted

With the lower ‘warmup:sampling’ ratio, we’re within the MCMC standard error, as measured by RMSE.

Look FW to 2.15.

Thanks for the all the feedback.

Ivan

syclik · February 5, 2017, 10:33pm

@leungi, with the same compiler, same settings, same hardware, same operating system, same versions of Stan, same random seed and chain id, the runs will be identical. Not just within MCMC SE, but identical.

I think PyStan, by default, has a compiler optimization option of -O2, which will make it not run the same way. But I’m not absolutely sure.

You might have to explicitly set seed to make them match. Just setting number of chains might not be good enough.

leungi · February 7, 2017, 11:18pm

As noted in previous post, seed and chain were explicitly set to be the same, hence the confusion.

Thanks for feedback.

sakrejda · February 8, 2017, 1:10am

Up until recently (current develop, not sure if it’s release yet) the different interfaces did not all initialize things in exactly the same way so you might end up with one more or less PRNG call in one interface vs. another. When Daniel says same x/y/z that includes interface and (as Daniel mentions above) optimization level.

edwinnglabs · December 7, 2018, 6:35am

I have the same problem having different result in pystan in rstan:
rstan code:

library(rstan)
schools_dat <- list(J = 8,
y = c(28, 8, -3, 7, -1, 1, 18, 12),
sigma = c(15, 10, 16, 11, 9, 11, 10, 18))
fit <- stan(file = ‘school.stan’, iter=500, warmup = 400,
data = schools_dat, cores = 2, seed = 6789, chains = 1)
la <- extract(fit, permuted = T) # return a list of arrays
mu <- la$mu
length(mu)
mu[1:10]

Python Code:

schools_code = “”"
data {
int<lower=0> J; // number of schools
vector[J] y; // estimated treatment effects
vector<lower=0>[J] sigma; // s.e. of effect estimates
}
parameters {
real mu;
real<lower=0> tau;
vector[J] eta;
}
transformed parameters {
vector[J] theta;
theta = mu + tau * eta;
}
model {
eta ~ normal(0, 1);
y ~ normal(theta, sigma);
}
“”"

schools_dat = {‘J’: 8,
‘y’: [28, 8, -3, 7, -1, 1, 18, 12],
‘sigma’: [15, 10, 16, 11, 9, 11, 10, 18]}

sm = pystan.StanModel(model_code=schools_code)
import os
os.environ[‘STAN_NUM_THREADS’] = “2”
fit = sm.sampling(data=schools_dat, iter=500, warmup=400, chains=1, seed = 6789)

=================

Both has the identical stan file. The first 10 numbers are so different.

Did you solve your problem?

ariddell · December 7, 2018, 12:58pm

RStan, PyStan, and CmdStan should produce identical outputs.

Is the underlying version of the Stan library the same?

jjramsey · December 7, 2018, 1:28pm

What happens if you replace that Python code with the following?:

sm = pystan.StanModel("school.stan")

Doing that should guarantee that the same Stan code is being used with both RStan and PyStan.

bukosabino · April 11, 2020, 12:48pm

Same issue here.

Why printed summary is not yet identical?

ahartikainen · April 12, 2020, 7:50am

What problems do you have with printed summary?

Topic		Replies	Views
Slightly discrenpancy between Rstan and Cmdstanr output General	2	499	February 1, 2021
Initializing Stan Model object with random_seed - all interfaces should use 3-arg constructor Developers	4	821	May 16, 2017
Same model, same data, same seed, different computers, different number of divergences General	2	667	September 6, 2023
Rstan : Could the output of the stan model (each post warmup iteration draws) be different between linux and windows? Modeling rstan	2	325	June 30, 2023
Same code (with the same seed) but different results on different platforms? Why? General rstan	2	1522	August 29, 2021

Different Outputs in RStan vs. PyStan

I have the same problem having different result in pystan in rstan: rstan code:

Related topics

I have the same problem having different result in pystan in rstan:
rstan code: