Start up takes "forever" after slight rewrite of model

dlakelan · June 21, 2017, 5:49pm

I have a situation where I’m using a LOT of data (tens of thousands of survey responses) and I want to keep track of an intermediate quantity, so I modified my model

instead of in the model block having something like:

for(i …)
data[i]/function(parameters[i]) ~ some_distribution()

I have a transformed parameter:

for(i…){
intermediateval[i] = data[i]/function(parameters[i]);
}

then in the model block:

intermediateval ~ some_distribution()

the point being that I’ve got a big vector now that stores the intermediate value, and then the sampling statement is now vectorized.

after doing this and running stan, it takes “quite a while” (minutes?) to get the first message about the speed of calculation of the gradient, and then once that occurs, sampling becomes sort of similar speed to what was going on before.

Is there some massive one-time calculation that takes place before sampling that would get substantially longer in the case where I have this large intermediate value?

dlakelan · June 21, 2017, 5:51pm

“quite a while” might be between 10 and 30 minutes for that first speed of the gradient statement, and then the sampling takes something like 4 or 5 hours to get 70 iterations

whereas before it would take only a few seconds to get that initial statement about the speed of the gradient, and then several hours to get the iterations.

Hey, at least at the end of the sampling I’m getting a good fit after all my playing with the model specification and parameterization!

bgoodri · June 21, 2017, 7:01pm

Yes, it takes RStan an insane amount of time to allocate storage for a big vector because it makes one list element per cell of the vector.

dlakelan · June 21, 2017, 7:28pm

Thanks. I guess I really need to get cmdstan working. Can I use cmdstan based on rstan install? Or do I have to install it separately?

bgoodri · June 21, 2017, 7:31pm

Separate.

Topic		Replies	Views
"Fixed_param" mode agonisingly slow General	7	1010	August 3, 2019
Stan and cmdstan running slow Modeling	8	728	March 24, 2022
How to speed up my Stan code and sampling in rstan? Modeling rstan , fitting-issues , performance	8	1159	May 28, 2021
RStan: long compiling time for a simple practice model Interfaces rstan	4	70	May 16, 2025
Trouble loading cmdstanr csv files with generated quantities in stan model General	0	291	December 22, 2022

Start up takes "forever" after slight rewrite of model

Related topics