Models where Stan outperforms Nutpie/Walnuts

ssp3nc3r · April 30, 2026, 3:03pm

I think it would be incredibly valuable to add more models to posteriordb. It has a lot of small models, and a few very big but close to intractable models. Adding more big …

I agree. Not sure whether this is the right place to chime in, but I’ve been excited at the idea of speeding up Stan using Nutpie, Walnuts, etc. For pretty much all the models I would want to reach for this, though, I haven’t actually been able to get these to do better and in most cases they don’t converge or have lower ESS/draw.

These models I’m referring can have 1+GB compressed data with several 100K parameters. So I think quite large for Stan models.

The Stan models converge but the Nutpie or Walnuts implementations would not. In a few models I was able to get Walnuts to converge, and although its time per draw was up to 2x Stan, it’s ESS/draw was far lower so I would need to double or more the iterations to get the same effective draws. I don’t have a strong enough grasp of the why and I’m not able to share these models, unfortunately, so having much larger complex models to test against would hopefully be a step in understanding what I’m seeing that doesn’t seem to match the performance charts floating around.

aseyboldt · May 1, 2026, 10:16am

This isn’t a baseball model by any chance? I still have scars from some of those. They break everything.

I’m definitely interested to hear more about that, but if you can’t share the model that of course makes it a bit tricky. Maybe you can open a separate thread with what info you can give? (and ping me there).

One thing I’ve seen before what that nuts was choosing a too short trajectory length for some reason. nutpie has an extra_doubling argument that you could set to 1 or so, which will make it run one extra tree doubling after it detected a u-turn. It would be interesting if that fixes the problem. Does stan get good ess (let’s say > 300) for all of the parameters?

ssp3nc3r · May 1, 2026, 2:01pm

Ah, those baseball models were a few of the models I thought of!

But it’s consistent with others, like most recently I have a large soccer model, hierarchical in several ways, 6 likelihoods, parameter sharing across them, 86 leagues of data, etc. Same issue.

In each of these cases ESS from Stan is fine.

If it is the trajectory being too short, one of the main differences in result between Stan and Nutpie is that Stan is more conservative in finding an optimal tree depth, and Nutpie is more aggressive in making the trajectories shorter, which is where much of the speed up comes from, to over simplify.

I could set extra_doubling, and if it fixes it, it would also make the speed per draw more similar to Stan. Which means that in the models that Nutpie succeed in posteriordb there is something about the geometry and model structures where Stan’s approach is too conservative, and in the models I’m running, Nutpie is too aggressive. But I don’t know of an obvious pattern to generalize.

I had also forked Flatiron Walnuts repo to give myself extra tuning parameters, one of which is a continuous --unit-mass to float between the two ideas but I haven’t had a lot of time to actually investigate:

github.com/ssp3nc3r/walnuts

SETUP_NOTES.md

stan-cli-diagnostics

# WALNUTS Setup and Installation Guide

This guide documents how to install WALNUTS (a C++ HMC sampler) and use it with Stan models via BridgeStan.

This is a fork of [flatironinstitute/walnuts](https://github.com/flatironinstitute/walnuts) with additional CLI features on the `stan-cli-diagnostics` branch: JSON initialization, diagnostic output, unit mass matrix option, and additional sampler configuration arguments.

## Overview

WALNUTS is an alternative sampler to Stan's default NUTS. It requires:
1. A compiled Stan model as a shared library (.so file) via BridgeStan
2. Data in JSON format
3. (Optional) Initial values in JSON format

## Prerequisites

- C++ compiler (clang or gcc with C++17 support)
- CMake (3.14+)
- Git
- R with bridgestan package (for compiling Stan models to .so files)

This file has been truncated. show original

Feel free to move this into a new thread if this is too adjacent to the Sparse Nuts.

WardBrian · May 1, 2026, 2:26pm

I think this discussion is interesting and could go on for some time, so I have split it off

Topic		Replies	Views
Sparse NUTS: preconditioning with sparse matrix operations General	15	353	May 2, 2026
Comparing Stan's adaptation phase to that of nuts-rs? Algorithms	20	1920	August 11, 2023
STAN for large model with a lot of parameter General performance	2	758	June 21, 2018
Comparing implementations of mixed logit Bayesian inference General	3	1102	July 14, 2020
Run time PyStan	14	2525	January 23, 2019

Models where Stan outperforms Nutpie/Walnuts

Related topics