Pickling error?

bkaplowitz · August 15, 2018, 4:01am

So far I have been exploring pystan/stan and greatly enjoying it. However, I just tried to run a NUTS sampler with 2 million samples on 12 threads with 800,000 burn-in for around 500 variables (two latent variable time series that are difficult to sample directly). The sampling took a while but finished successfully. However, on attempting to save the fit results I got a long error (that looked kind of like a segfault with hexadecimal strings) ending in

error: 'i' format requires -2147483648 <= number <= 2147483647

Looking on github, it seems the error is due to limitations of pickling. https://github.com/joblib/joblib/issues/387 Is there any way around this error, either via some fix, option I’m not using or hack?

Thanks in advance!

bgoodri · August 15, 2018, 4:59am

Do a few hundred iterations and stop. 800,000 and 2,000,000 are only reasonable for sampling schemes that do not mix well, and the whole point of using NUTS is that it does mix well for lots of models.

bkaplowitz · August 15, 2018, 5:27am

Hi Bgoodri, normally I’d agree, but with this particular model, the posterior distribution does not appear to settle down until >400,000 burn-in. With MCMC, I think it’d be impossible to accurately sample. I wanted to make sure it was truly settled down at 400,000 so wanted to run for 800,000 burn-in. The 2 million is because I am thinning to be safe. That is the first thing I’d cut, however, if there is no option but to reduce sample size (which seems like a major software limitation if that is the case.)

bgoodri · August 15, 2018, 5:53am

If it takes 400,000 warmup iterations, you have bigger problems than pickling. I would say the same thing at 4000.

ahartikainen · August 15, 2018, 9:00am

There are some options:

Run your model with n_jobs=1
Use CmdStan
Manually fix multiprocessing-module (Currently I don’t remember how that was done)

Topic		Replies	Views
Stan sampler gets stuck General	15	1672	May 17, 2020
New to Pystan, Always get this error when attempting to sample: ModuleNotFoundError: No module named 'stanfit4anon_model...' Modeling pystan	9	5406	February 9, 2022
Inference of huge data causes an encoding error by PyStan PyStan	6	3232	March 25, 2022
PyStan with pickle only saves first 100 values? PyStan specification	4	847	August 23, 2020
"Bad message length" error PyStan	5	2557	June 21, 2017

Pickling error?

Related topics