Rstan: recommend prominently to set chains = 1 when debugging models?

Guido_Biele · October 11, 2017, 7:04am

@bgoodri, @jonah
I had assumed that the Rstan documentation (reference manual, vingettes) prominently advises to set “chains = 1” when debugging models.

However, I could not find this with a quick search.
Maybe this should be added to the explanation of the chains argument in the rstan reference manual?

groceryheist · October 11, 2017, 7:21am

Really? It seems like having different chains obtaining similar inferences is an important signal in debugging models. What’s your rationale?

Guido_Biele · October 11, 2017, 12:47pm

This is a misunderstanding because I was not specific enough.

I meant that if a model does not even start sampling one needs to set chains = 1 in order to see all error messages, which will tell you in which line of the model code there is a problem.

It is of course an entirely different matter when the model produces samples. Then one needs multiple chains to see potential scale reduction, check if no chain produces divergences, etc… (I never thought of this part of the workflow as “debugging”)

syclik · October 11, 2017, 11:33pm

I also find that RStan swallows messages when running in parallel. I tend to debug in CmdStan for this reason. In R, I have the number of chains set to the number of cores whenever I start R and I can’t remember how to reset it off the top of my head.

Guido_Biele · October 12, 2017, 12:38pm

From what I remember, RStan messages are as informative for debugging models which do not start sampling as CmdStan messages, as long as the number of chains is set to 1 in rstan.

syclik · October 21, 2017, 5:49am

Yes, the messages are the same. They’re being generated by Stan (the underlying library).

bgoodri · October 21, 2017, 7:40pm

It is really cores = 1 but even then I don’t think it changes the visibility of the messages, except perhaps on Windows.

Bob_Carpenter · October 29, 2017, 3:44pm

Is there no way to fix this bug for Windows?

Also, the detect.cores() thing is wrong in that it picks up Intel’s reporting of their “hyperthreading” feature as if it’s another core. So my four-core i7 reports 8 cores, even though there are truly only four physical cores. Running on about 5 cores is ideal for me when only running a Stan model (or build process, which is where I usually set cores). But it’ll depend on what else is going on with the machine.

As much as we’d like to put all the gotchas in a “prominent” position in the doc, we don’t have that many prominent positions. Do we say “dont’ install with a space on windows” or “debug with one core”?

The problem with “recommend prominently” is that every time somone runs into a problem they ask us to put the solution at the “top of the web page”. Alas, there’s only one top of the doc.

bgoodri · October 29, 2017, 5:14pm

I am not sure there is a bug. You can call parallel::detectCores(logical = FALSE) to get 4 rather than 8 on my laptop. But if you are only doing 4 chains, then it doesn’t matter. In the past, I found that doing 8 chains with 4 cores and 2 threads was slightly faster than doing 8 chains with 4 cores used twice, but it is possible that something in between is more optimal depending on the model and non-Stan activity.

Bob_Carpenter · November 1, 2017, 5:29pm

Thanks. Didn’t realize there was a flag to set behavior.

Topic		Replies	Views
Capturing warnings/rejects from rstan with multiple cores RStan	0	431	August 16, 2021
Model fitting and sampling issue: Only 1 chain sampling properly Modeling rstan , fitting-issues , performance	14	2478	September 20, 2022
Parallel chains not working; different behavior in RStudio and R from Terminal RStan	14	4632	October 22, 2019
Number of cores and number of chains Developers	4	112	January 16, 2025
How to debug initialization failures RStan	6	1914	October 27, 2018

Rstan: recommend prominently to set chains = 1 when debugging models?

Related topics