Leave-one-group-out CV

Dillon_Fitch · September 20, 2018, 6:32pm

I’m curious to hear your general approach for doing leave-one-group-out k-fold CV in Stan. I’ve been trying to do the same thing, and my approach seems to be an ugly hack. I roughly follow this approach: https://datascienceplus.com/k-fold-cross-validation-in-stan/#share-wrapper, by including a holdout vector as data. I also include an index vector for the start locations (jj_start) of where the group index changes (i.e. first row for a new group) as data. I then fix all the groups level parameters of the group I’m holding out in transformed parameters to 0 (we need to do this right?), and do some ugly chopping of my data to get group summed log_lik in generated quantities for J groups (people in my case with repeated measures).

generated quantities{
vector[J] log_lik;
for (j in 1:J){
  {
    int start; 
    int end; 
    start = jj_start[j];
    if(j<J){
      end = (jj_start[j+1]-1);
    }else{
      end = N;
    }
      log_lik[j] = normal_lpdf(y[start:end] | a_person[j] + block(x,start,1,end-start+1,K) * beta, sigma);
  }
}}

I’m not even sure this is right, and I know it can’t be the best way to do leave-one-group-out CV. I tried to dig through brms code to get a better idea, but haven’t figured any of that out. Hopefully you’ve found a better way!

Topic		Replies	Views
To use or not to use the assignement to a hierarchical group as a feature for exact LOO and LOO-PSIS Modeling loo	6	970	October 14, 2020
LOGO for Stock Returns Interfaces loo	2	393	July 1, 2019
Loo and "logo" combination. Two data types, one model Modeling loo	6	754	May 17, 2022
Loo_model_weights( ..) throws error with leave-one-group-out cv brms loo	6	1000	November 16, 2020
Leave-one-group-out, but for multiple, different groups? brms loo , cognitive-science	3	871	November 20, 2020

Leave-one-group-out CV

Related topics