Brm model running time

drnight · July 11, 2024, 10:10am

Hello,
I am running a brm model with longitudinal data (total of 17160 observations) after multiple imputation.

Model formula:

brm1 <- brm_multiple (
diag ~ age + gender + educat + time*rte + time*dop + time*ztt + time*frt + (1|subj), 
family=bernoulli (link=“logit”), 
prior=set_prior(“normal(0,100)”,class=“b”), 
cores=4, iter=4000, warmup=2000, seed=1234, data =data)

My computer has 40Gb of RAM memory and 10 CPU cores. Previously, I ran the mode with flat default priors and iter=10000 but after 18 hours the model was still running. Any help to speed out this would be really welcome!!

Thanks!

zacho · July 11, 2024, 1:43pm

In my experience, setting better priors often results in much greater sampling efficiency and less time to sample.
After the priors, you could consider specifying backend = 'cmdstanr' (and make sure you have cmdstanr installed) – that alone has helped my models speed up in some cases. Plus, that gives you the option to pursue within-chain parallelisation; you’ve got some extra cores, hopefully enough memory, and the Bernoulli likelihood might be expensive enough to warrant it.

Solomon · July 11, 2024, 6:34pm

Your cores=4, iter=4000 approach looks much better than setting iter=10000, which is a great start. @zacho made fine points, and I’d like expand a bit on priors. It looks like you’re setting a generic prior on your \beta coefficients, but going with defaults for the other parameters.

Consider theory-based priors on your \beta coefficients.
Think about a better prior for your \sigma parameter (variation in random intercepts). My go-to for a multilevel Bernoulli model is prior(normal(0, 1), class = sd), to which brm() will assign a lower bound of zero by default.
Keep in mind that brm() sets priors for the intercept under the presumption you have mean centered all predictors. If this is not the case, consider either
a. mean centering all your predictors,
b. using the 0 + Intercept syntax (see the brmsformula and set_prior sections in the user guide), or
c. setting center = FALSE in brm() (see the brmsformula section in the user guide).

Also, consider not only mean-centering but actually standardizing any continuous variables (perhaps age and time in your data). This will make it easier to assign better priors to the \beta coefficients. IMO, prior(normal(0, 1), class = b) is a great weakly-regularizing default prior for \beta coefficients on standardized predictors in a Bernoulli model, like yours. It’s a pretty good default for any dummy-coded categorical variables, too (possibly like gender in your data).

Solomon · July 11, 2024, 6:38pm

Also, since you’re swimming in data, consider first fitting and fully debugging your model with a random subset of say 10% of your cases. The debugging process would not only include making sure all your syntax is correct, but also making sure your priors are working as intended. The 10% subset approach could save you a lot of time in this phase.

drnight · July 12, 2024, 5:28pm

Many thanks Solomon! WIll try that! :)

drnight · July 12, 2024, 5:29pm

Many thanks Zacho, will try that :)

drnight · July 31, 2024, 9:12pm

Hi Solomon,

Regarding the predictors’ standardization, I am wondering if you would suggest any particular approach as my dataset contains longitudinal / repeated measures data (it’s in long format originally). Would you standardize in wide format (by time point)?

Thanks

Solomon · August 1, 2024, 8:29pm

If I was fitting, say, a longitudinal growth model, I’d standardize my variables based on the first time point.

drnight · August 1, 2024, 8:53pm

Many thanks for your advice! Do you mean standardising by x(observed value) - u (mean at baseline) / sd (baseline)?

Thanks a lot!

Solomon · August 1, 2024, 9:05pm

yep

Topic		Replies	Views
Running time for hierarchical model brms	11	4216	February 12, 2019
Brm running time Modeling fitting-issues	7	569	May 5, 2020
BRMS Runs too slow; error in definition on my behalf? Modeling rstan , specification , performance , brms	14	2956	November 17, 2021
Brms seems to omit option "cores" while fitting model brms fitting-issues , paralellization	1	1101	April 29, 2023
Brms_multiple always compiles but takes hours to days to do it brms rstan , cmdstanr , brms	16	1346	January 23, 2025

Brm model running time

Related topics