Combining LOO-CV across equivalent models over independent datasets

sammosummo · November 23, 2020, 9:54pm

I have some candidate models and a large dataset. I want to compare these models using LOO-CV.

The dataset as ~20 natural subdivisions—different subjects in the same (very long) psychological experiment. Ideally, I would fit each candidate model as a multilevel/hierarchical model, then estimate LOOICs in the usual way. Unfortunately, the dataset is too big to feasibly fit any model to all data at once.

My current plan is to fit separate candidate models to each subject, which isn’t too taxing computationally, estimate LOOIC per subject model, then combine LOOICs across subject models of the same kind to find out what the supermodel’s LOOIC would be.

My questions are:

I feel like this approach is equivalent to fitting a single candidate model to all data but without partial pooling . With so much data per submodel, I know partial pooling doesn’t make much of a difference to the submodel parameter estimates. Besides losing partial pooling, is there anything else disadvantageous about this approach?
If I have the estimates LOOIC, number of effective parameters, and se of LOOIC per submodel, how to I combine them to get an estimate of the supermodel’s LOOIC?

avehtari · November 29, 2020, 7:59pm

No.

For combing elpd, elpd_se, elpd_diff, and elpd_diff_se from separate models, see
Adding elpd_kfold estimates from several models to create composite elpd_kfold for comparison - #2 by avehtari and the rest of the thread. I’ll add this to my CV-FAQ soon. For how elpd and LOOIC are related and why I prefer elpd see 21 in CV-FAQ.

Topic		Replies	Views
Group-level vs. individual-level model comparison Modeling loo , cognitive-science	7	2517	July 10, 2019
Loo for a subset of the data General loo , brms	4	747	August 4, 2021
Summing LOOIC across models that share similar structure for different response variables Modeling rstan , loo , brms	5	65	June 6, 2025
LOO-CV for different models and different data Modeling loo	3	696	October 17, 2020
Model comparison for nested models Modeling loo	2	693	April 24, 2020

Combining LOO-CV across equivalent models over independent datasets

Related topics