Stan model 'RL_RW' does not contain samples

haohao · June 12, 2022, 11:12am

I want to use the reinforcement learning_Rescorla Wagner model to explore how subjects learn the underlying 4 hierarchies. Left[nTrials] and right[n] are stimuli with potential hierarchies presented in pairs, and the subjects are asked to select the high-hierarchy stimuli (choice[nTrials]), if correct reward=1, incorrect reward=-1. But my model keeps getting errors. as follows:

‘Stan model ‘RL_RW’ does not contain samples.’

SAMPLING FOR MODEL ‘RL_RW’ NOW (CHAIN 1).
Chain 1: Rejecting initial value:
Chain 1: Error evaluating the log probability at the initial value.
Chain 1: Exception: categorical_logit_lpmf: categorical outcome out of support is 4, but must be in the interval [1, 2] (in ‘model21585021459_RL_RW’ at line 30)

This is my stan model code

data{
  int<lower=1>  nTrials;
  int<lower=1,upper=4> left[nTrials];
  int<lower=1,upper=4> right[nTrials];
  int<lower=1,upper=4> choice[nTrials];
  int<lower=-1,upper=1> reward[nTrials];

}

parameters{
  real<lower=0,upper=1> alpha;
  real<lower=0,upper=3> tau;

  
  vector[4] V_4;
  vector[2] V_2;
  real pe_l;
  real pe_r;
  V_4=rep_vector(0,4);
  
  for(t in 1:nTrials){
    
    V_2[1]=V_4[left[t]];
    V_2[2]=V_4[right[t]];
    
    choice[t]~ categorical_logit(tau*V_2);

    //value update
    if((choice[t]==left[t] && reward[t]==1) || (choice[t]==right[t] && reward[t]==-1)){
      pe_l=1-V_4[left[t]];
      pe_r=-1-V_4[right[t]];
    }else{
      pe_l=-1-V_4[left[t]];
      pe_r=1-V_4[right[t]];
    }
    
    V_4[left[t]]=V_4[left[t]]+alpha*pe_l;
    V_4[right[t]]=V_4[right[t]]+alpha*pe_l;
   
  }
}
}

Guido_Biele · June 13, 2022, 12:12pm

V_2 is a vector with only 2 entriea, which you seem to use in a choice rule for 4 options.
You need a vector of dour values to choose among 4 options.

haohao · June 13, 2022, 12:24pm

But for each trial， I only present 2 option for subjects.

Guido_Biele · June 14, 2022, 7:05am

If the a choice in a particular trial is 4, but V_2 has only to values, this wont work because the probability of choosing option 4 can’t be calculated.
It should work if you rewrite the model such that in each trials V_2 has the values of the two available options and choice is always 1 for the first and 2 for the second available option

haohao · June 14, 2022, 8:00am

Sorry, I just started learning stan, can you please help me to modify the code directly?

Guido_Biele · June 14, 2022, 10:08am

Sorry, I don’t have time to work directly with the code.

just shortly: It looks like as if here you are already taking care that only the two relevant action values are used:

One way to procede is to actually leave the Stan code unchanged and to mofidy the choice vector in the data, so that is always has 1 if people chose left and 2 if people chose right (assuming I understand your data structure correctly)

haohao · June 16, 2022, 7:30am

Thank you very much. Problem seems solved.

Guido_Biele · June 17, 2022, 3:26pm

If one of my answers put you on the right path, you could mark it as solution. 😀

Topic		Replies	Views
Bugs from A dynamic reinforcement learning model Modeling cognitive-science	7	663	January 6, 2020
Stan code for fitting simple RL model Modeling cognitive-science	2	1563	December 12, 2018
Stan model 'anon_model' does not contain samples Modeling	6	492	February 8, 2024
How can I simulate data for my model?(Reinforcement Learning Model) Modeling cognitive-science	7	1054	November 18, 2019
Repeated measure hierarchical reinforcement learning with groups (2 x 2 x 2 design) Modeling rstan , fitting-issues , hierarchical-model , model-comparison , cognitive-science	7	2015	July 28, 2020

Stan model 'RL_RW' does not contain samples

Related topics