Fitting Multi-level models in batches?

OBidz · March 29, 2021, 6:24pm

I fit multi-level regression models in R using brms, with id-level slopes and intercepts. I have data for many individuals (1000s) and models take a while to fit (5+ hours), so recently I’ve thought it might be useful to fit models in ‘batches’ of individuals, so that maybe I can run them in parallel on many machines/nodes or run them sequentially and save progress.

My initial thought was to fit the models on subsets of the data, and then use brms’ combine_models() to combine them, but of course with the id-level parameters differing from batch to batch (because there are different IDs in there) I get Error: Models 1 and 2 have different parameters.

Does anyone know of a good way to tackle something like this? Would it be valid to concatenate the posterior samples from each batch to brute force it, or is there a way to chain these models so the population-level parameters of the first batch become the priors for the second? Is there potential to bias results here depending on the order of fitting these batches?

As always, thanks in advance for any help advice you can offer on this.

mike-lawrence · March 29, 2021, 8:33pm

With infinite individuals, yes, but with finite samples you’re not going to get posteriors that can/should be lumped together.

This is generally not recommended as you inevitably end up having to summarize the posterior to turn it into a prior causing information loss and unreliable propagation of uncertainty thereby.

Have you tried the options for within-chain parallelization? Also see here for a speed-up trick that’s often helpful for hierarchical models with highly-redundant design matrices.

OBidz · March 29, 2021, 11:46pm

I did look at within-chain-parallelization and that did speed things up. I was really looking for a way to save progress on the fitting periodically though. Perhaps a better strategy would be to fit a model for all IDs but for batches of iterations, and combine those models?

Topic		Replies	Views
Brms sampling in multi-level multinomial models Modeling specification , brms , meta-analysis	8	2078	March 16, 2021
Fitting multiple models with multithreading brms techniques	1	536	August 14, 2022
Possible to fit two models at once? General techniques , specification , performance	3	879	March 29, 2023
Multilevel model with responses on different levels? brms specification , multivariate-normal , brms	1	682	November 4, 2021
Correlated posteriors in a multilevel binomial regression brms fitting-issues	3	607	July 19, 2018

Fitting Multi-level models in batches?

Related topics