Stan files for models in PosteriorDB

charlesm93 · February 20, 2026, 10:25pm

I want to take a look at models in PosteriorDB ( GitHub - stan-dev/posteriordb: Database with posteriors of interest for Bayesian inference ). First, I tried to find a list of all available models. In R

library("posteriordb")
my_pdb <- pdb_local(path = "~/Code/posteriordb/")
pos <- posterior_names(my_pdb)
head(pos)
# [1] "arK-arK"                         "arma-arma11"                    
# [3] "bball_drive_event_0-hmm_drive_0" "bball_drive_event_1-hmm_drive_1"
# [5] "bones_data-bones_model"          "butterfly-multi_occupancy"

In the full list, I spotted “prostate-logistic_regression_rhs” and “ovarian-logistic_regression_rhs”, which I’m guessing are models with horseshoe priors. But I couldn’t find the specific Stan files under posteriordb/posterior_database/models/stan at master · stan-dev/posteriordb · GitHub . Am I looking at the wrong repo?

avehtari · February 21, 2026, 1:19pm

Maybe easier to use posteriordb package functions?

"prostate-logistic_regression_rhs" |>
  posterior(my_db) |>
  stan_code_file_path()

charlesm93 · February 23, 2026, 4:45pm

Thanks Aki. The command returns the following:

"/var/folders/ty/36ws994x4l33ksszcdwjtrv40000gp/T//RtmpwsryKD/posteriordb_cache/models/stan/logistic_regression_rhs.stan"

But I couldn’t locate the var folder. That said, I did find the stan model locally.

posteriordb/posterior_database/models/stan/logistic_regression_rhs.stan

avehtari · February 23, 2026, 5:16pm

It must be your operating system confusing you. You don’t actually need to see anything about the path as you can just assign the file path string to a variable without seeing it (with ← or ->), and use that variable to refer to the file. Or if you want to edit it, just keep piping to get the file in your working directory

"prostate-logistic_regression_rhs" |>
  posterior(my_db) |>
  stan_code_file_path() |>
  file.copy(".")

Great

ahartikainen · February 24, 2026, 7:00am

So check this posterior json

github.com/stan-dev/posteriordb

posterior_database/posteriors/prostate-logistic_regression_rhs.json

master

{
  "name": "prostate-logistic_regression_rhs",
  "keywords": ["stan benchmark", "logistic regression"],
  "urls": "",
  "model_name": "logistic_regression_rhs",
  "data_name": "prostate",
  "reference_posterior_name": null,
  "references": ["piironen2017sparsity", "li2018feature"],
  "dimensions": {
    "beta0": 1,
    "z": 5966,
    "tau": 1,
    "lambda": 5966,
    "caux": 1
  },
  "added_date": "2020-02-29",
  "added_by": "Oliver Järnefelt"
}

And notice the model you should look at is

github.com/stan-dev/posteriordb

posterior_database/models/stan/logistic_regression_rhs.stan

master

data {
  int<lower=0> n; // number of observations
  int<lower=0> d; // number of predictors
  array[n] int<lower=0, upper=1> y; // outputs
  matrix[n, d] x; // inputs
  real<lower=0> scale_icept; // prior std for the intercept
  real<lower=0> scale_global; // scale for the half-t prior for tau
  real<lower=1> nu_global; // degrees of freedom for the half-t priors for tau
  real<lower=1> nu_local; // degrees of freedom for the half-t priors for lambdas
  // (nu_local = 1 corresponds to the horseshoe)
  real<lower=0> slab_scale; // for the regularized horseshoe
  real<lower=0> slab_df;
}
parameters {
  real beta0;
  vector[d] z; // for non-centered parameterization
  real<lower=0> tau; // global shrinkage parameter
  vector<lower=0>[d] lambda; // local shrinkage parameter
  real<lower=0> caux;
}

This file has been truncated. show original

and data used there is

avehtari · February 25, 2026, 8:28pm

Happy to get feedback on the user interface and vignette so that we can make it easier to find model code and data without need to look at json.

Bob_Carpenter · March 10, 2026, 12:41am

I’ve found it easier to just grab the model/data pairs I need directly, either on GitHub or after cloning.

One thing I’d recommend is a naming convention where the data for a model has the same prefix as the model. As is, I find it a bit challenging to figure out which data goes with which model.

avehtari · March 10, 2026, 7:35am

Good point. We have to be more careful with the prefixes. The origin for not having the same prefix for models and data is that a single model can be used with data from different sources and vice versa. In posteriordb it is likley that a single data is used only with variations of a model, so we could use the same prefix for all common model variations and all data sets that are used with those variations. Also, the posterior object in the database knows both the model name and data name, so there would be a benefit of using the database instead of directly looking at model and data directories.

Niko · March 10, 2026, 10:55am

The posterior’s name (as eg listed in posteriordb/posterior_database/posteriors at master · stan-dev/posteriordb · GitHub ) is just always {dataset}-{model} though, right? I find this pretty unambiguous.

ahartikainen · March 10, 2026, 12:00pm

I think even something similar like this would be a nice addition to posteriordb.

Quick example made with ChatGPT (I think reference posteriors is still broken)

edit. Yes I think the color theme is horrific

avehtari · March 11, 2026, 7:57am

Ping @mans_magnusson

Topic		Replies	Views
Actual Stan code for Statistical Rethinking? General	1	1167	July 4, 2022
Posteriordb, beta version 0.2 General	3	521	September 23, 2020
Posteriordb, version 0.3 Developers	1	509	September 22, 2021
Beta-release Bayesian Posterior Database Publicity	21	2413	December 14, 2019
Reading cmdstanr csv files CmdStan	2	421	October 16, 2023

Stan files for models in PosteriorDB

Related topics