Has anyone tried coding a profilers for Stan programs?

syclik · May 15, 2018, 9:31pm

@yizhang asked this question and I thought I would put up the question and answer here so people can find it.

No. There’s a good reason. First, the only block that takes a while is the model block, unless there’s an error with an infinite loop or something really bad in transformed data, transformed parameters, or generated quantities. (Think <5% in those blocks. Maybe down to <1% for complicated models.)

Second, speeding up the model block doesn’t actually correspond to increased n_eff / time or even total wall time! Yes, it’s a good measurement to have, but not when trying to speed up a model.

Third, all of the time is in the computation of the log joint and computing the gradient with respect to the parameters. There once was a time when building up the expression graph (computing the function value) and applying the chain rule to that graph (computing all the gradients) were separated and measuring the two sweeps would have resulted in a good estimate of how much time was spent in computing the gradient. Not now. Now we greedily build the adjoints in the chain rule when we can reuse computations while computing the function. This makes the time to compute the value longer, but greatly decreases the time to compute the gradients. So, it’s tricky.

Hopefully that explains some of the reason why we haven’t tried to build one generally. If you want to do it, by all means, you have my support. If you have any more questions, fire away.

Bob_Carpenter · May 21, 2018, 2:06am

If there’s no mixing, the speed is zero, so no speedup. Otherwise, it’ll speed it up by the factor that it speeds up the evaluation of the log density. Do you mean that’s only part of the issue and there’s also mixing to consider?

The transformed parameters are transformed as part of the log density eval, so they’re hard to separate computationally from the rest of the model block.

syclik · May 21, 2018, 2:16am

Yes! (I typed that response on my phone in a few mins, so wasn’t really spelled out.)

Yup.

Topic		Replies	Views
Profiling gradient of model with transformed parameters block Modeling techniques	5	752	July 15, 2021
Stan and cmdstan running slow Modeling	8	729	March 24, 2022
Have you been using some of the latest features of Stan? General	14	2580	November 12, 2021
Seeking expert stan modeler for help speeding up a complex stan model Jobs fitting-issues , specification , performance	4	928	July 29, 2020
Stuck at warmup Modeling	11	3593	December 3, 2017

Has anyone tried coding a profilers for Stan programs?

Related topics