Memory issues with Large Item Response Model

LucaP · January 17, 2024, 5:37pm

Hi, I am trying to fit a large Item Response Model with 39631 students and around 100 questions total (sparse response matrix since typically most students only answer a few questions)

I’m trying to run a single chain, 1000 samples but I get into some memory issues even if I have around 750Gb of RAM.

I am assuming that the response matrix gets a posterior stored for every iteration which is likely to blow up everything, so I’m wondering if there’s some way not to store that sample or some other best practice to scale the model?

simonbrauer · January 17, 2024, 5:53pm

Welcome to the Stan community. Could you share your model code? That would help in diagnosing any issues.

Only draws of parameters, transformed parameters, and generated quantities are stored. The response matrix would not be stored (assuming it is passed as data).

LucaP · January 17, 2024, 6:07pm

Hi! thank you!

so here’s the code, it’s just the boilerplate 2PL model from the stan page

data {
    int<lower=1> n_users; 
    int<lower=1> n_items;
    int<lower=1> n_interactions;
    
    array[n_interactions] int<lower=1, upper=n_users> user_idx;  
    array[n_interactions] int<lower=1, upper=n_items> item_idx;  
    array[n_interactions] int<lower=0, upper=1> user_item_interaction;   // binary matrix: item X user interaction
  }
parameters {
  real category_appeal;                // mean question difficulty
  vector[n_users] affinity_level;             // ability for j - mean
  vector[n_items] item_appeal;              // difficulty for k
  vector<lower=0>[n_items] item_polarization;    // discrimination of k
  real<lower=0> sigma_appeal;    // scale of difficulties
  real<lower=0> sigma_polarization;   // scale of log discrimination
}
model {
  affinity_level ~ std_normal();
  item_appeal ~ normal(0, sigma_appeal);
  item_polarization ~ lognormal(0, sigma_polarization);
  category_appeal ~ cauchy(0, 5);
  sigma_appeal ~ cauchy(0, 5);
  sigma_polarization ~ cauchy(0, 5);
  
  user_item_interaction ~ bernoulli_logit(item_polarization[item_idx] .* (affinity_level[user_idx] - (item_appeal[item_idx] + category_appeal)));
}

simonbrauer · January 17, 2024, 7:30pm

Thanks for sharing. Unfortunately, I don’t see any obvious ways to make your model more memory-efficient. Maybe someone else will have some suggestions

You could always use the thin argument to only save a draw every n iterations. You could then run multiple thinned chains in sequence and combine the draws after-the-fact.

ahartikainen · January 17, 2024, 10:52pm

Do you use CmdStan or some other interface?

CmdStan will push your mcmc draws to csv file on hdd, which might help with the memory issues.

LucaP · January 18, 2024, 12:45am

I’ve been using pystan, do you recommend using CmdStan? I noticed that allows to configure more parameters (including thin)

ahartikainen · January 18, 2024, 4:42am

I would try to use CmdStan.

I think even CmdStanPy tries to read everything to memory so vanilla CmdStan is the best option. (Cc @WardBrian)

ahartikainen · January 18, 2024, 4:37pm

You can define thin and other parameters with PyStan too.

LucaP · January 18, 2024, 4:51pm

I’m using pystan 3.3 and thin is not one of the allowed keyword arguments to pass to the sampler, when I try to pass it I get ValueError: {'json': {'thin': ['Unknown field.']}}

ahartikainen · January 18, 2024, 5:34pm

I think num_thin should work, but I need to check this

github.com

stan-dev/httpstan/blob/54df4bea69b06a562b027d70b34d5d05cf8e3eae/tests/test_arguments.py#L40


      
          async def test_function_arguments(api_url: str) -> None:
              """Test function argument name lookup."""
          
              # function_arguments needs compiled module, so we have to get one
              model_name = await helpers.get_model_name(api_url, program_code)
          
              # get a reference to the model-specific services extension module
              # the following call sets up database, populates app['db']
              module = httpstan.models.import_services_extension_module(model_name)
          
              expected = [
                  "data",
                  "init",
                  "random_seed",
                  "chain",
                  "init_radius",
                  "num_warmup",
                  "num_samples",
                  "num_thin",
                  "save_warmup",
                  "refresh",

LucaP · January 18, 2024, 5:43pm

it works! thank you!

Topic		Replies	Views
Dealing with memory issues in Markov chain style model Modeling	3	74	November 10, 2024
Tips for speeding up a model fit...I cut it off at 10 days Modeling fitting-issues , brms	4	409	September 1, 2023
Possible to lower memory usage? General	9	2187	April 22, 2022
I'm a new convert to Stan. My model runs extremely slowly. Advice appreciated! Modeling irt	3	542	September 8, 2021
Categorical model with large dataset - large/divergent ELBO & possible combined reduce_sum()/GPU support Modeling cmdstan , fitting-issues , specification , performance	13	781	May 13, 2022

Memory issues with Large Item Response Model

Related topics