Accessible explanation to the No-U-Turn Sampler

Bob_Carpenter · January 9, 2018, 5:25am

adapt_delta just sets the target “acceptance rate” for the sampler. A higher target acceptance rate means adaptation will find lower step sizes. Once warmup’s done, these are locked in.

How adaptation works has changed over versions. But that target acceptance is now complicated as we’re not using the basic NUTS algorithm.

The main issue you run into is conditioning—the usual bugbear of any kind of gradient-based algorithm. If you get into a location in the posterior where the step size is too large, you get divergences. We only use gradient-based approximations (i.e., first order) of the real posterior curvature, so sometimes we need small step sizes to do that accurately.

Topic		Replies	Views
Adapt_delta Modeling	4	7020	March 7, 2023
Using samples from the adaptation / warmup phase in NUTS General	8	1151	December 21, 2022
NUTS misses U-turns, runs in circles until max_treedepth Algorithms	66	5512	August 31, 2019
What acceptance probability does `adapt delta` target Algorithms	5	790	March 7, 2023
FYI: BayesFlow (part of TensorFlow) Developers	2	2554	April 17, 2018

Accessible explanation to the No-U-Turn Sampler

Related topics