Help building a faster model

Cchambe12 · August 10, 2018, 9:29pm

I’m currently working on building a model that looks at the number of late spring freezing events based on mean spring temperature, NAO, and elevation in brms. I have about 1 billion rows of data so it is very, very slow. It can take up to a week to run.

Here’s a made up dataframe:

cc <-sample(c(0,1), replace=TRUE, size=150)
species <-sample(c("Acer", "Betula", "Quercus", "Fagus"), replace=TRUE, size=150)

df<-data.frame(freeze=rnorm(150, mean=3, sd=1),
mat=rnorm(150, mean=0, sd=5),
nao=rnorm(150, mean=1, sd=2),
elevation=rnorm(150, mean=4, sd=1),
cc=cc,
species=species)

And my current model:

fit<- brm(freeze ~ nao + mat + elevation + cc + nao:cc + mat:cc + elevation:cc +
(nao + mat + elevation|species), data=df, control = list(max_treedepth = 12,adapt_delta = 0.99),
chains = 4, cores = 4)

Do you have any suggestions to speed this up? Thanks!

bgoodri · August 10, 2018, 9:39pm

With 1 billion observations and a Gaussian likelihood, you should checkout stan_biglm in the rstanarm package, which uses sufficient statistics.

Cchambe12 · August 14, 2018, 11:25am

Thanks, Ben, it looks great. Is there any way to do a mixed effects model with stan_biglm?

bgoodri · August 14, 2018, 2:18pm

No, but you can do interaction terms with group indicators.

lizzieinvancouver · August 20, 2018, 10:21pm

Hi Ben, Just to be dense on language, do you mean adding a fixed effect for the group and interactions with it? So if species is the group, instead of:

fit <- brm(freeze ~ nao + mat + mat:cc +(nao + mat|species)...)

It would be something like:

fit <- brm(freeze ~ nao + mat + mat:cc + species + nao:species + mat:species)...)

Thanks,
L

bgoodri · August 20, 2018, 11:31pm

Yes

Cchambe12 · October 8, 2018, 3:26pm

Hi Ben, is there a way to use binomial or poisson models with stan_biglm?

Thanks!

bgoodri · October 8, 2018, 3:44pm

No. If the GLM is not Gaussian and / or the inverse link function is not an identity, then there are no sufficient statistics.

Topic		Replies	Views
Slower and freez machine when run brm model with brms package rstanarm hierarchical-model	24	1627	May 27, 2020
Brms sampling/speed errors in multi-level model brms	6	1224	February 16, 2020
How to speed up `brms::loo_subsample()` for large models brms loo , hierarchical-model , model-comparison	16	1538	October 25, 2022
Speeding up a Student model with 3 correlation matrices Modeling specification , performance	14	777	February 15, 2021
Running speed - phylogenetic models brms brms	17	998	February 23, 2024

Help building a faster model

Related topics