Double free or corruption (out) depending on Linux version

jfhawkin · March 15, 2022, 5:48pm

I am running a model on my university cluster. It’s my first time running my Stan code anywhere except my laptop. The computing center support team has been able to help me narrow the issue to a difference between the interactive interface using RStudio and terminal. In response to my initial question they stated:

It looks like this is a side effect of using the rocker containers to provide Rstudio. They are built on Ubuntu 20.04, which has a slightly newer glibc/libstdc++ version than CentOS 8 which is used for the cluster OS. The immediate workaround is to uninstall the rstan package and reinstall it using the terminal via the R/4.1 module. This will use the older glibc/libstdc++ from CentOS 8. Since those are forward compatible, it should work in the RStudio environment as well.

I reinstalled my packages through the terminal so they are compiled against CentOS 8. When I compile my model through RStudio, it works fine. However, it’s a fairly large dataset and the model takes a long time to run, so I need to run it through the terminal to make full use of the cluster (additional CPUs are available with this option). If I compile the model in the terminal (on CentOS 8), it gives me:

Chain 1 double free or corruption (out)
Warning: Chain 1 finished unexpectedly!

Any ideas I could try or bring back to the support team to get this working?

jfhawkin · March 16, 2022, 3:31pm

We’ve worked out a short-term solution to run the model on an Ubuntu virtual environment. It’s limited to 20 cores, which should be sufficient for me. I’d still like to get the initial problem sorted out though.

mike-lawrence · March 16, 2022, 6:30pm

Are you setting any custom compile flags?

jfhawkin · March 16, 2022, 6:42pm

No. Is that something I should be doing?

mike-lawrence · March 16, 2022, 6:44pm

No, I’ve encountered “double-free or corruption” errors on ubuntu with “-march=native” in my make/local and wondered if you were hitting that too

Richard_Border · March 16, 2022, 11:23pm

Are you parallelizing across multiple compute nodes or are you just using multiple CPUs in a single machine? If the latter (and probably the former as well, just would require more effort), why not just write an R script and call it from the command line?

andrjohns · March 17, 2022, 4:05am

Are you using rstan or cmdstanr? If you aren’t already, I’d recommend trying your model with cmdstanr, since it tends to be more resistant to crashes/undefined behaviours than rstan

There’s more background on getting things up and running in this article: Getting started with CmdStanR • cmdstanr

jfhawkin · March 17, 2022, 1:14pm

Thanks for the feedback. I should perhaps provide some more details. I am using cmdstanr. The model runs on Ubuntu (i.e., it compiles and generates samples). If I install cmdstanr on Centos, it will compile the model but gives the error when I call model$sample(). If I compile the model on Centos and run it on the Ubuntu interactive environment so I can use RStudio, it gives the same error. I’m setting up a Ubuntu environment on the HPC, which they have the option for with some limitations. I was hoping to test it yesterday, but it ended up taking longer than expected and I should really be working on a proposal today…

@Richard_Border Yes, that’s exactly what I’m doing. I can either run an R script from the terminal (Linux Centos 8) or interactively in RStudio (Linux Ubuntu 20). Neither option works if I compile the model from the terminal (using Centos 8).

andrjohns · March 17, 2022, 1:22pm

What compiler version is installed on the Centos instance (e.g., what’s the output from g++ --version?)

jfhawkin · March 18, 2022, 9:35pm

@andrjohns the compiler version is 10.3.0

andrjohns · March 19, 2022, 6:39am

Alright, so that rules out the usual suspects! Let’s ignore anything ubuntu/interactive for now, and just consider the centos instance.

The first step is to test whether it’s cmdstan in general, or if your model in particular has found a bug.

Can you run and post all output from the following code:

library(cmdstanr)

test_mod <- " data { real ymean; } parameters { real y; } model { y ~ normal(ymean, 1); }"
model <- cmdstan_model(write_stan_file(test_mod), quiet = FALSE)
sampled <- model$sample(data = list(ymean = 0), refresh = 0)

jfhawkin · March 21, 2022, 2:55pm

I guess the problem was that I was mixing too many different OS and Stan implementations. I setup a basic model using brms, then used cmdstanr whereas I used rstan on past projects.

The key points:

Needed to ensure I compiled the model on Centos and not Ubuntu.
Needed to ensure I ran install_cmdstan() and not just install.packages(“cmdstanr”). I ran it when I initially tested things on the Rstudio/Ubuntu environment, but I guess I missed something on the Centos install.

Thanks for the help, everyone!

Topic		Replies	Views
Problems running stan_model() on Ubuntu 20.04 RStan rstan	2	755	December 9, 2021
R4.0.1 on Rstudio Server (linux) RStan	1	529	June 20, 2021
Double free or corruption (out) Error in unserialize(socklist[[n]]) : error reading from connection RStan rstan	8	665	August 24, 2022
Stan_model() return compileCode errors Modeling rstan	17	965	September 14, 2022
Warning to others: compilation errors and memory/cache corruption General	3	1083	June 15, 2017

Double free or corruption (out) depending on Linux version

Related topics