Simple Gaussian model: three variants with dramatically different sampling times

wds15 · May 31, 2021, 7:05am

Any Stan program is limited in performance by its AD tree size (the thing to get the gradient). In the last example you do declare “d” which has the size of the data (n entries). You should try to avoid declaring “d” and instead only store reductions of it in variables within the model block. So try to save sum(y-mu) and sum((y-mu)^2) and then use these to get the final quantities you need. This avoids storing a full vector of size “n”. You should also consider the use of the profiling facility when doing these things.

And my “?” is for noting that I am not sure if the above will really help you. It relies on long-year Stan experience, but for performance stuff it can be surprising what one will find. The rule “avoid” declared parameters is usually a good one to get more speed (this is why the size of AD tape is being reported in the profiling outputs).

Makes more sense?

Topic		Replies	Views
Seeking expert stan modeler for help speeding up a complex stan model Jobs fitting-issues , specification , performance	4	928	July 29, 2020
PSA: using stantargets for SBC(-esque) checks, & big speedups for big gaussian likelihoods Publicity techniques , specification , performance	1	976	May 30, 2021
Speeding up multivariate normal model Modeling techniques	7	3167	September 9, 2017
Assessing speed of multilevel model Modeling	1	409	March 25, 2020
"Fixed_param" mode agonisingly slow General	7	1010	August 3, 2019

Simple Gaussian model: three variants with dramatically different sampling times

Related topics