Repeated K-fold Cross Validation

itowers · April 19, 2021, 1:10am

Hi everyone,

First of all, thank you to the developers of BRMS for creating such an accessible package for Bayesian analyses!

I am currently attempting to perform model comparison between 9 models using K-fold cross validation (K=10) because performing LOO yields a high number of Pareto K > 0.7.

Presumably on account of the random data selection using the K-fold process, the ranked order of the models changes if one compares one run of the K-fold validation for all models to another (examples attached, where g.m1 is the simplest model). Thus, I was wondering if there is a way to “average” across multiple K-fold model comparisons in BRMS, which I believe is a process known as “Repeated k-Fold Cross-Validation” in other statistical softwares? I should note that, in most cases, the most simple model is not significantly different from the “best”-performing model. Ideally, my goal is to present an averaged ELPD_diff and SE_diff model comparison to readers. If there is no way to average across K-fold comparisons, are there any other recommended approaches for how to deal with randomness in the ELPD order of selected models?

Thanks again and please let me know if more information is required!

Example 1

Example 2

avehtari · April 21, 2021, 12:39pm

You can use kfold helper functions to make the data division once and then use the same data division for all comparisons. If you think the results are too sensitive given one way to split the data, you can generate several random splits with the helpers, run kfold with each split, but then you need to do the averaging of pointwise elpd values yourself (those are stored in kfoldobject$pointwise). By generating a new kfoldobject with averaged pointwise results, you can then use loo_compare() to compute diffs and se_diff’s (or you can write your script for that).

itowers · April 26, 2021, 12:56am

Thanks for your quick response Aki, and sorry for my delay in getting back to you!

I am now in the process of running the models using the same data divisions for each model as per your first recommendation. I am hoping that by using the same data divisions using the kfold helper (which I was not doing before) for each model, it will not be necessary to conduct repeated K-fold.

However, I would just like to clarify what you mean by averaging the pointwise elpd values. If I understand you correctly, in order to obtain a an average ELPD for an example model (in this model 1) using just two repeats (in reality this would be 5-10), I would first:

Create two random data splits using the k-fold helper function: split_1 and split_2

Perform 10-fold cross validation as such:

model_1_split_1 <- kfold(model_1, K=10, folds=split_1)
model_1_split_2 <- kfold(model_2, K=10, folds=split_2)

In order to get averaged pointwise_ELPD for model_1, I would simply average the pointwise estimates for each repeat. This is the part I just wanted to make sure about.

model_1_average_pointwise <- apply(cbind(model_1_split_1$pointwise[,1],model_1_split_1$pointwise[,1]), 1, mean)

Thanks in advance!

avehtari · April 26, 2021, 12:28pm

Yes, looks correct.

itowers · April 29, 2021, 11:27pm

Thanks again, this has been extremely helpful!

Topic		Replies	Views
Compare models with elpd calculated differently? brms loo	2	63	February 14, 2025
Model comparison for multiple imputation with brm_multiple Modeling loo , cross-validation , model-comparison , brms , missing-data	1	95	September 20, 2024
Compare models with k-fold cv General	1	1234	May 31, 2019
Model weighing using kfold brms loo	2	673	February 12, 2019
Model selection with loo and bridge sampling brms	1	1461	February 23, 2021

Repeated K-fold Cross Validation

Related topics