Cross-validation for hierarchical models (Replicating Aki's post)

mtenan · March 19, 2019, 4:03pm

Hello,
I am preparing to perform some cross-validation analyses for some multilevel models using rstanarm. As a part of this, I am first mimicking the very helpful blog post by Aki found here:
https://avehtari.github.io/modelselection/rats_kcv.html#setup

I’ve been able to reproduce the code/results up through section 5.4 at which point, running the following code results in the following error:

cv10xfolds ← kfold_split_grouped(K = 10, x = dfrats$rat)
cv10xfolds[dfrats$age==8] ← 11
cv30xfolds ← kfold_split_grouped(K = 30, x = dfrats$rat)
cv30xfolds[dfrats$age==8] ← 31
cv10x_1 ← rstanarm::kfold(fit_1, K=10, folds = cv10xfolds)
Error in rstanarm::kfold(fit_1, K = 10, folds = cv10xfolds) :
all(folds %in% 1L:K) is not TRUE
cv10x_2 ← rstanarm::kfold(fit_2, K=10, folds = cv10xfolds)
Error in rstanarm::kfold(fit_2, K = 10, folds = cv10xfolds) :
all(folds %in% 1L:K) is not TRUE
cv10x_3 ← rstanarm::kfold(fit_3, K=10, folds = cv10xfolds)
Error in rstanarm::kfold(fit_3, K = 10, folds = cv10xfolds) :
all(folds %in% 1L:K) is not TRUE

I have run this in two different Windows computers running similar versions. My session info is below:
R version 3.5.2 (2018-12-20)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows >= 8 x64 (build 9200)

Matrix products: default

locale:
[1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252 LC_MONETARY=English_United States.1252
[4] LC_NUMERIC=C LC_TIME=English_United States.1252

attached base packages:
[1] stats graphics grDevices utils datasets methods base

other attached packages:
[1] rstanarm_2.18.2 bayesplot_1.6.0 ggplot2_3.1.0 Rcpp_1.0.0 loo_2.1.0.9000 rprojroot_1.3-2

loaded via a namespace (and not attached):
[1] splines_3.5.2 gtools_3.8.1 StanHeaders_2.18.1 threejs_0.3.1 shiny_1.2.0 assertthat_0.2.0
[7] stats4_3.5.2 yaml_2.2.0 pillar_1.3.1 backports_1.1.3 lattice_0.20-38 glue_1.3.1
[13] digest_0.6.18 promises_1.0.1 minqa_1.2.4 colorspace_1.4-0 htmltools_0.3.6 httpuv_1.4.5.1
[19] Matrix_1.2-15 plyr_1.8.4 dygraphs_1.1.1.6 pkgconfig_2.0.2 rstan_2.18.2 purrr_0.2.5
[25] xtable_1.8-3 scales_1.0.0 processx_3.3.0 later_0.8.0 lme4_1.1-21 tibble_2.0.1
[31] DT_0.5 withr_2.1.2 shinyjs_1.0 lazyeval_0.2.1 cli_1.0.1 survival_2.43-3
[37] magrittr_1.5 crayon_1.3.4 mime_0.6 ps_1.3.0 nlme_3.1-137 MASS_7.3-51.1
[43] xts_0.11-0 pkgbuild_1.0.2 colourpicker_1.0 prettyunits_1.0.2 rsconnect_0.8.13 tools_3.5.2
[49] matrixStats_0.54.0 stringr_1.3.1 munsell_0.5.0 callr_3.1.1 compiler_3.5.2 rlang_0.3.1
[55] grid_3.5.2 nloptr_1.2.1 ggridges_0.5.1 rstudioapi_0.9.0 htmlwidgets_1.3 crosstalk_1.0.0
[61] igraph_1.2.4 miniUI_0.1.1.1 base64enc_0.1-3 boot_1.3-20 codetools_0.2-15 gtable_0.2.0
[67] inline_0.3.15 markdown_0.9 reshape2_1.4.3 R6_2.2.2 gridExtra_2.3 rstantools_1.5.1
[73] zoo_1.8-3 dplyr_0.8.0.1 shinystan_2.5.0 shinythemes_1.1.2 stringi_1.1.7 parallel_3.5.2
[79] tidyselect_0.2.5

Any idea where the issue may lie? I understand the error, conceptually, but I’m not sure why I would be throwing it but Aki’s Rmarkdown document is not.

Thanks,
Matt

avehtari · March 19, 2019, 5:29pm

Sorry that it’s not mentioned in the actual notebook, I only mentioned it in the twitter and discourse announcements New tutorial: Cross-validation for hierarchical/multilevel models that you need the latest version from github

mtenan · March 19, 2019, 5:40pm

Hi Aki,
I did notice that on Twitter, but it looked like the package versions in your environment:
loo_2.0.0.9000
rstanarm_2.18.2

Were earlier than my own, so I assumed I was safe:
rstanarm_2.18.2
loo_2.1.0.9000

Was that incorrect? I’m unable to load the github rstanarm version for reasons noted here:

I’m also not able to compile C code (i.e. use the github rstanarm) on my work computer, so maybe my reviewing my later versions of packages was false hope. Is there an expected timeline for the updates to hit CRAN?
Thanks again,
Matt

avehtari · March 19, 2019, 5:48pm

Version number doesn’t go up with each commit to git. There will be soon rstanarm 2.19 release

Topic		Replies	Views
New tutorial: Cross-validation for hierarchical/multilevel models General loo	0	1792	March 12, 2019
K-fold validation for hierarchical model in rstan Modeling rstan , loo	4	1077	March 29, 2023
Error using kfold on stan_gamm4 object Modeling loo	6	591	May 13, 2020
Using Loo 2 for grouped k-fold cv rstanarm loo	14	1994	June 12, 2018
Loo with k_threshold parameter vs. kfold for comparing rstanarm models rstanarm loo	5	1159	December 21, 2018

Cross-validation for hierarchical models (Replicating Aki's post)

Related topics