NUTS misses U-turns, runs in circles until max_treedepth

bbbales2 · July 25, 2019, 12:24am

Yeah, so this showed up in a hierarchical model here: Request for Volunteers to Test Adaptation Tweak - #33 by bbbales2

I think that model has 300+ parameters?

@avehtari has been talking about making the uturn criteria less stringent for a while. I think he’s seen suspicious behavior in other models. We were specifically looking at the uturn criteria here since it seemed like something interesting was happening. The clearest plot I think was to plot, for every uturn check done as part of the binary tree-building, the minimum of the two uturn criteria (there are two checks done between every two pairs of points) along with the number of leapfrog steps in that path.

The way things are plotted here going below 0.0 means uturn (the scores are dot products of normalized vectors – so take the inverse cosine to get the angle between them):

scores_vs_distance

These are the uturn criteria checked in one trajectory building. Soon after hitting a doubling to a length 128 trajectory, we found a uturn and jumped out. There are actually two checks here that count as uturns – normally there’d just be one before you break the loop but the way I wrote the debug code you might get two. That’s just an artifact or whatever.

Anyway you see that with 8 leapfrog steps we’re neeeeearly U-turning. In fact, when we keep building and building the tree we eventually do get an 8 leapfrog U-turn. But for 16 leapfrog steps and 32 we’re quite a bit worse off according to the Uturn criteria at least.

So @avehtari was talking about making the uturn criteria something smaller. So maybe not a full 90 degree turn. What we’ve talked about in this thread is adding extra checks of intermediate length (as low as just +1 to our previous checks). I think we could also do something based on the collection of uturn criteria here as well. Like, if we do a tree doubling and our uturn criteria for the combined tree is worse than the criteria for both of the subtrees, bail. And maybe mix in a threshold parameter too.

Code for making that plot coming eventually.

Topic		Replies	Views
Best Practice: Max Treedepth During Warmup General	3	626	February 21, 2021
HMC (jittered) vs. NUTS on 1000-dimensional standard normal Algorithms mcmc	9	3852	April 29, 2019
Acceptance ratio in NUTS Developers	4	909	June 28, 2021
Using samples from the adaptation / warmup phase in NUTS General	8	1130	December 21, 2022
The role of "max_treedepth" in No-U-Turn? General fitting-issues	9	5774	September 6, 2021

NUTS misses U-turns, runs in circles until max_treedepth

Related topics