New guide to warnings: request for feedback

martinmodrak · January 14, 2022, 4:36pm

Some time ago, we had a discussion on updating the text of warnings for divergent transitions, treedepth etc (https://discourse.mc-stan.org/t/text-for-warning-message/), this led me @andrewgelman , @jonah and @avehtari to attempt to update the document that currently sits at https://mc-stan.org/misc/warnings.html (the link currently leads to the old version!) The idea is that this document could then be linked from warning messages in all interfaces and provide a basic overview of what the warnings mean and what one can do about them.
We’d be very happy to get feedback from broader community before making this part of the official docs.

A potential problem is that due to our background a lot of the linked resources are R centered. If you know of good resources using Python, please link them where relevant!

Beyond hopefully helping some users to figure out the problem with their model on their own, the aim is that a) it will be easier for people to find additional resources and b) even if the user is unable to resolve the problem on their own, it will be easier to help them (e.g. here on Discourse), because they’ll provide more relevant information - that’s a big reason why “Simplify your model” is the first hint provided.

Beydon the original warnings page, the text builds on Divergent transitions - a primer

The current version can be found at:

where you can directly provide comments and/or edit the text (but comments here on Discourse are also welcome)

Tagging people who participated in the previous discussions on this topic: @betanalpha, @bgoodri , @bbbales2 , @mitzimorris, @spinkney , @mcol, @Max_Mantei

BFiles · January 14, 2022, 6:29pm

As a user in constant need of help, I am hungry for more advice on how to make it easier to help me. The guide to warnings might not be the primary place to find such advice, but since there’s already a section at the end with some best practices, I wonder if this point could be expanded or maybe include links to more detailed advice on getting help.

Specifically, are there existing examples or other guidance on what it really means to “start simple” and slowly add complexity? Maybe a worked-through example of a moderately complicated model that, when implemented in a straightforward/obvious/naïve way leads to some warnings, along with a step-by-step of starting with simple models that leads to the revelation of where these warnings start showing up and how that would lead to a solution.

andrewgelman · January 14, 2022, 10:46pm

That’s a good idea. The Bayesian workflow article (http://www.stat.columbia.edu/~gelman/research/unpublished/Bayesian_Workflow_article.pdf) and forthcoming Bayesian workflow book is supposed to do this, but the examples there are kinda complicated, so we’re planning to set up a simpler example to demonstrate these issues.

Michelle · January 15, 2022, 1:28pm

Hi Andrew, may I ask when this book is coming? Can I preorder it?

jtimonen · January 15, 2022, 5:41pm

Good initiative. Here some feedback:

You might not be used to seeing so many warnings from other software you use, but that does not mean that Stan has more problems than that other software.

Very true.

As the warning message says, you should call pairs() on the resulting object

This is not really practical if you have > 100 parameters for example.

Red points indicate divergent transitions.

One point is not a transition, so what do these points really indicate? If I remember correctly someone said here on the forums that it is a point that is sampled from a trajectory which at some point diverged. I.e. the red point is not the point where the trajectory diverged, and it could have actually been anywhere. So this could be clarified.

In our experience, divergent transitions that occur above the diagonal of the pairs() plot — meaning that the amount of numerical error was above the median over the iterations

I don’t understand this. What diagonal is meant here and how is it connected to numerical error of the trajectories?

iter argument.

There’s no iter in some interfaces, only iter_warmup and iter_sampling. At some point the text seemed to implicitly start assuming that RStan is used.

it is essential that you follow these recommendations:

I would say that following the practices in this “getting help” section may help you but they are not really essential. Starting to use version control for example can be a big hurdle for some more applied users.

andrewgelman · January 15, 2022, 9:08pm

A bunch of us are working on it, led by Dan Simpson, Aki Vehtari, and myself. When it’s done, we’ll announce it here and elsewhere!

martinmodrak · January 18, 2022, 12:27pm

Thanks for the feedback everybody!

In fact between the last draft of this document and now, I’ve written one such example at Small model implementation workflow • SBC - it relies on the SBC package for some functionality (it uses simulation-based calibration to check for bugs/problems), but the core ideas are IMHO accessible even without understanding SBC. Just looking at the sequence of Stan models built there IMHO demonstrates the core principles quite well.

I added the case study as another reference in the document.

This phrase only exists in the current version (at Runtime warnings and convergence problems), but is not found in the proposed new version (at Runtime warnings and convergence problems - HackMD), so I fear you’ve been reviewing the old version - sorry for the confusion. In the new version, we removed most of the discussion of the details of the pairs plot as it is a bit interface specific. Instead we link to relevant documentation in the packages (which hopefully contains enough info for users to find this).

That’s a good point (this wording survived into the new proposed version), I adjusted it.

Thanks again!

WardBrian · January 18, 2022, 2:09pm

The old version of the document still references the warnings outputted by stanc2, which at this point only applies to RStan (and hopefully not for too much longer). I noticed the new version is exclusively runtime warnings, which is probably a good distinction.

We already have a section on the stanc3 warnings (and errors) in the doc here: 33.2 Understanding stanc3 errors and warnings | Stan User’s Guide, which might be good to cross-link with the runtime warnings

jtimonen · January 18, 2022, 2:55pm

Ah I see, dumb me.

martinmodrak · February 7, 2022, 8:16am

Just bumping this up to see if we can get more feedback before going live :-)

andrewgelman · March 3, 2022, 1:02pm

It looks good to me!

jonah · March 10, 2022, 11:37pm

The updates are now online at Runtime warnings and convergence problems. Thanks everyone for their feedback and @martinmodrak for taking the lead on this!

Topic		Replies	Views
Divergent transitions warning message Developers divergences	7	1267	March 23, 2021
Text for warning message General	46	6003	August 11, 2020
Max_treedepth warnings in rstan 2.15.x RStan	9	2886	May 16, 2017
Website page on Guide to Stan's Warnings Developers	3	515	April 14, 2020
Can I ignore Max_treedepth and adapt_delta warnings if I am getting desirable results? Modeling fitting-issues	1	1712	September 2, 2020

New guide to warnings: request for feedback

Related topics