NUL Characters in cmdstan output

Thomas · March 16, 2021, 10:53am

I analyzed a model using cmdstan and the output csv file (~250 Mb) has some NUL characters in it. Is this a recognized failure mode for cmdstan, i.e., an indication that cmdstan could not generate samples from the posterior distributions on that particular iteration? Or, is it telling me something else?

bbbales2 · March 16, 2021, 12:11pm

That seems strange to me. I don’t know of a reason an unusual character would be there (the csv is supposed to be human readable).

Do you have a model that is easy to share that reproduces this behavior?

Thomas · March 16, 2021, 12:35pm

The code takes ~35 days to run on the cluster at school, which may make duplication of the problem difficult. A cut down version of the problem with the code running half the data works fine. Two out of four chains have completed running and both have NUL characters in the csv output. Could this have been a glitch with the cluster?

bbbales2 · March 16, 2021, 12:39pm

Not sure. Let me get a 2nd opinion. @mitzimorris do you know if Stan ever emits nulls in the output csvs?

mitzimorris · March 16, 2021, 4:55pm

Stan doesn’t emit NUL.

something that runs ~35 days might have run up against either disk space, memory, or processing time limits on the cluster - if the process is terminated, no way to record the error, other than to check the process return code.

the two chains that return OK - how long did they take to run?

bbbales2 · March 18, 2021, 12:57pm

Sounds like this could be the case then. I haven’t (knowingly) hit this before myself with long running models and sounds like Stan doesn’t emit these things normally.

Thomas · March 18, 2021, 2:24pm

I spoke to the HPC folks about this and they said that the job did not hit any memory/time/space limits. They think it was a “noisy neighbor” problem because I was using only one core on the 25-core cpu and the job ran for a long time.

Tom

Topic		Replies	Views
Read_stan_csv error General	10	1139	August 26, 2021
Lack of precision/truncation of log posterior trace on cmdStan (but not PyStan) CmdStan	3	1036	January 18, 2019
Reproducibility of a non-linear model CmdStan cmdstanpy	3	207	April 23, 2024
CmdStan 2.30 is now available Announcements	4	693	July 27, 2022
Cmdstan cluster sampling speed CmdStan	3	78	January 10, 2025

NUL Characters in cmdstan output

Related topics