Resume sampling after interuption

bbbales2 · May 12, 2020, 9:18pm

It’s starting off from something that finished sampling.

It looks reasonable, but a couple problems:

If warmup hadn’t finished, then you wouldn’t have a metric.
I don’t know if there’s any way to cleanly stop a cmdstanpy job if things get interrupted partway through (I don’t know how spot instances work).
I’m scared partial output might break this process.

There’s a checkpointing thread over here that has some info: Current state of checkpointing in Stan - #13 by bbbales2

There is something in cmdstan called the diagnostic file (I think recently [last few days] it has been renamed the latent dynamics file, but I don’t know if that’s made it to cmdstanpy yet) that could be used to do this.

@mitzimorris is there a way to provide a stepsize per-chain in cmdstanpy?

Topic		Replies	Views
Checkpointing with CmdStanPy General	2	550	September 24, 2020
Sampling chains from the middle in cmdstanpy General	10	515	October 8, 2020
Minimizing warmup iterations - Error reading step size from CmdStan output Interfaces bug , cmdstanpy	4	498	August 15, 2023
Saving & reusing adaptation in cmdstanr Interfaces cmdstanr	53	3932	June 8, 2022
Benchmarking and Resuming sampling via DMTCP after interruption Modeling	2	45	February 12, 2025

Resume sampling after interuption

Related topics