Should vectorization affect the adaptation or sampling paths given the same seed?

mike-lawrence · November 5, 2020, 9:50pm

I have a model where I do something like (dumbed down):

parameters{
  vector[nrow_y] mu ;
}
model{
  for(col in 1:ncol_y){
    for(row in 1:nrow_y){
      y[row,col] ~ normal(mu[row],1) ;
    }
  }
}

Which of course can be sped up by vectorizing:

parameters{
  vector[n] mu ;
}
model{
  for(col in 1:ncol_y){
    y[,col] ~ normal(mu,1) ;
  }
)

Using cmdstanr and keeping the data and seed constant, I thought that the only consequence of switching from the first to the second model would be to speed up computation, but that the actual course of adaptation and sampling would remain the same since the computations should yield the same values. However, this seems not to be the case as running with the same data and seed constant I get entirely different values in the posterior (and noticeably different adaptation performance).

Now, this arose while debugging a more complex model that definitely has it’s own pathologies I’m still working out, so maybe I’ve done something else wrong leading to the non-equivalence, but I wanted to check first: am I wrong to even expect equivalence? Does the vectorization change something about the compute state that would lead to divergence of the adaptation paths?

bgoodri · November 5, 2020, 10:20pm

In theory, no, but in practice possibly due to floating point reasons.

Topic		Replies	Views
Vectorizing prior sampling and normalizing prior samples after sampling Modeling specification	3	92	October 4, 2024
Can vectorized versions yield different divergences/loo values? Modeling loo	13	1101	October 20, 2017
Vectorizing sampling of arrays of vectors Modeling	6	674	January 22, 2020
Vectorized vs. unvectorized model Modeling	5	568	October 14, 2019
Divergent transitions, or convergence, depending on random seed General	3	563	July 9, 2020

Should vectorization affect the adaptation or sampling paths given the same seed?

Related topics