NUTS variation

betanalpha · August 16, 2019, 2:25am

Naming convention point: we are long past the “NUTS” algorithm. There will be a renaming around Stan3 but for now just refer to it as “dynamic HMC”.

In any case it’s better to think about the possible parallelization speedup this way. First speculatively integrate forwards and backwards in time and then run the sampler to consume those speculative trajectories. If you fully consume one of the trajectories then you can expand the speculation and continue.

Because the expansion is multiplicative, however, you’re likely to end up wasting a bunch of that speculative computation on both sides, yielding a much smaller speedup than 1.5 on average. Moreover you’ll have to keep all of those speculative states in memory which will be a significant burden for higher-dimensional problems. One of the big advantages of multiplicative expansion is needed only a logarithm number of states in memory at any given time.

Parallelizeable resources are much better spent on speeding the gradient evaluation or running multiple chains in memory to pool adaptation information.

Topic		Replies	Views
Within-chain parallelization idea (maybe crazy) Developers	35	2777	February 24, 2022
NUTS vs HMC Algorithms	8	6855	August 4, 2020
NUTS differences in Stan vs paper Algorithms	3	1123	February 2, 2017
Parallel dynamic HMC merits Developers features	38	3109	September 17, 2019
How to compile and profile NUTS in stan/src/stan/mcmc/hmc/nuts CmdStan	2	523	November 8, 2022

NUTS variation

Related topics