Exit Stan on Divergences

jmh530 · February 16, 2018, 7:31pm

Would there be any way to have Stan exit upon getting a specific number of divergences? I would think it would be do-able on a chain-by-chain basis. Seems silly to have to sit through N number of chains running if I get 20 divergences per chain and it doesn’t tell me until the end.

bgoodri · February 16, 2018, 7:33pm

Possibly after a certain number of divergences after warmup. It almost always diverges the iteration after a window change during warmup and possibly many more times as it is trying to adapt. However, you often want to let it run to the end to see where the divergences tend to occur.

jmh530 · February 16, 2018, 7:35pm

I get that, but sometimes when you’re prototyping things you just want to
know if it’s working or not on a general basis. Like if you coded something
incorrectly.

Bob_Carpenter · February 17, 2018, 9:16pm

That’s an interesting idea. It’d be nice to have termination filters that’d let you specify this kind of thing. An even more ambitious plan that would use cross-chain information would terminate if the chains are moving internally but not mixing.

As is we just stream out the answer and don’t accumulate summary stats like number of divergences as we go. It would be possible, but would be some work with the current infrastructure.

sakrejda · February 17, 2018, 9:20pm

If you use CmdStan or save the rstan output to a .csv file as you go you can monitor this stuff yourself. That’s not a good general answer and I like the idea but it is a work-around that’s very functional at the moment.

Bob_Carpenter · February 17, 2018, 11:09pm

What we need is an online accumulator where you can monitor things like non-split R-hat and means and variances of chains. Wouldn’t be that hard with the streaming output. But it’d be hard to maintain and support cross-platform.

mike-lawrence · February 18, 2018, 3:48pm

This has been on my to-do list with my R package ezStan, which already does some minimal processing of the sample files for my alternative progress watcher. I happen to have been working on an update this week fixing a few things and could look at adding this finally. Is there a function in rstan to detect divergences?

jmh530 · February 18, 2018, 4:04pm

I’m glad this has garnered some interest. Sounds like I’m not the only one!

mike-lawrence · February 18, 2018, 4:13pm

Think I found what I need at https://github.com/stan-dev/rstan/blob/d84eb5ebd9bcfc0f3bfe6455402678b04a740485/rstan/rstan/R/check_hmc_diagnostics.R

Will report back in a day or two.

Bob_Carpenter · February 18, 2018, 4:26pm

The trick is to use Welford’s algorithm to accumulate sufficient statistics. That covers means and variances. I just added this to our new road map; I’m trying to keep it to big features we would actually like to build.

mike-lawrence · February 18, 2018, 4:32pm

Since I’m looking at the samples anyway, I should also be able to add the time-based and ess-based termination criteria. Did you have anything in mind for how to specify/guess the number of warm-up iterations for these?

Bob_Carpenter · February 18, 2018, 4:53pm

We were thinking that if we were timing, we’d try to split the time evenly between warmup and sampling.

As to the online things, I’m more worried about being able to diagnose when warmup has converged (the mass matrix and step sizes should converge and we should find the high probability mass of the posterior).

mike-lawrence · February 18, 2018, 4:56pm

Ah, presumably by doing a few iterations first to get a feel for the time/iteration?

mike-lawrence · February 18, 2018, 4:58pm

Is there any existing code in rstan to compute these checks?

Bob_Carpenter · February 18, 2018, 5:00pm

This is a problem because we adapt as we warmup, so early warmup iterations are in the wrong space of the posterior and haven’t fully adapted step size (integration time) and mass matrix (metric). If we knew how to do this easily, it’d already be done! But we haven’t really even begun experimenting.

Bob_Carpenter · February 18, 2018, 5:01pm

No. In the end we don’t really care if adaptation has converged if the chains mix well. It just seems like that it’s the only way to figure out when we should stop adapting. We could also run some real sampling in parallel and measure that at various points. That’ll give us a real read of how much time we have left and mixing from where we’re at.

mike-lawrence · February 18, 2018, 5:09pm

Ok, then I’ll focus on merely online reporting of the existing metrics (divergences, max tree depth, ess, rhat) post-warm-up for the near term. The termination stuff feels more like it should be in Stan rather than rstan or an R helper package anyway.

Bob_Carpenter · February 19, 2018, 1:05am

We’ve gone back and forth between designs where the sampler is a simpler iterator to the one we have now where everything gets controlled through a combination of services config and callbacks.

mike-lawrence · February 19, 2018, 6:45pm

Just pushed an update to ezStan that will show post-warmup divergences as they occur. Installation & usage demo here, let me know if you encounter any bugs!

mike-lawrence · February 19, 2018, 7:48pm

Note, I’m really looking forward to the sample storage refactor; what I do in watch_stan to watch the sample files is a rather fragile hack.

Topic		Replies	Views
Extending a non-converged run? General	4	423	March 23, 2020
Rstan swallows divergent transitions Developers	6	1443	April 21, 2017
Output divergences/max_treedepth hit during the run? CmdStan	14	958	February 25, 2022
Spawned processes not being shut down when running multiple chains in Rstudio RStan	3	979	August 20, 2019
Stop stan when it reaches convergence (Rhat = 1) RStan	1	449	May 31, 2021

Exit Stan on Divergences

Related topics