Reducing large model file size

sbhuey · July 28, 2021, 5:01pm

I’ve fit a time series model like so:

fit <- brm(event_chances ~ arma(time=age, gr=group, cov=TRUE) + poly(age,2), 
           train_data, 
           family=exponential(),
           prior=priors,
           chains=4, cores=4, iter=2000,
           control=list(adapt_delta=0.95))

Where train_data is around 18,000 rows. This yields a saved model of around 1GB which presents some issues for me as I run some parallelized prediction tasks that export the model N times. I was wondering if there is there a way to reduce the model size either before or after the model is fit?

wds15 · July 28, 2021, 5:38pm

Look for the package shredder on github

mike-lawrence · July 28, 2021, 6:08pm

Oh man, I’d forgotten about that awesome package. Thanks for the reminder!

sbhuey · July 28, 2021, 6:59pm

Thanks for the suggestion, I wasn’t aware of shredder. My guess is that the large file size is located in fit$fit@sim$samples where there are a large number of residuals (err[n]):

> object.size(fit$fit@sim$samples)
1177236424 bytes

Not quite sure how I should be using shredder to reduce the size of this object. Any thoughts?

franzsf · July 29, 2021, 3:49pm

I’d appreciate any pointers on this too – not to distract from the OP’s question – but specifically on how to delete things like all the individual random effects from a brms model object while keeping the ability to predict new levels from the population SD and correlation estimates. [EDIT: assuming I stupidly used save_ranef = TRUE initially and not wanting to take another few days to refit].

sbhuey · July 29, 2021, 5:10pm

I have a similar question about the residuals in my model. Can they be safely deleted and still allow prediction to function and if so can shredder or some other method be used to delete them out of the model object?

mike-lawrence · July 29, 2021, 7:13pm

Maybe tagging @yonicd could help elicit their help in discerning how to use shredder with brms.

Alternatively, I’d been planning the addition of keep & toss arguments to aria::compose() that would permit the desired behaviour and could bump that up in priority if you’re ok with a workflow whereby you use brms to specify the model then have it export to a .stan file. Not sure if the resulting rvars representation of the posterior samples would play nicely with any code you have that expects an brms fit though.

sbhuey · July 31, 2021, 4:52pm

My solution to this was to refit the model to omit saving the residuals. This reduced the file size from 1GB to a much more managable 28MB:

fit <- brm(event_chances ~ arma(time=age, gr=group, cov=TRUE) + poly(age,2), 
           train_data, 
           family=exponential(),
           prior=priors,
           chains=4, cores=4, iter=2000,
           control=list(adapt_delta=0.95),
           save_pars=save_pars(group=FALSE))

Thanks for all the suggestions!

Topic		Replies	Views
Reducing brms model output size for predictive function brms	8	1260	July 20, 2018
Cannot save brms object in R for file size reasons General brms	3	407	February 2, 2024
Sharing fitted brms objects after removing data brms	8	1608	October 7, 2024
Large multivariate model fails to compile brms fitting-issues	0	399	February 28, 2022
Reducing `stanfit` object size in `rstan` with `shredder::stan_axe` General	2	1126	February 14, 2020

Reducing large model file size

Related topics