Personal Stan Guide

Nicklaus_Millican · May 4, 2024, 1:34am

I’ve been a Stan user for a few years now, and one thing I’ve always struggled with is how to learn Stan. There is the documentation and the forums, and these are pretty great. But the documentation can be daunting and best suited for those who know for what they seek. So the the learning, for me, from these tend to be scatter-shot and it can be hard to get a clear systematic understanding of the software. I’d always wished there was a Intro to Stan book that could get somebody pretty far and even serve as a bridge to the more technical pieces of the Stan written resources. The best I’ve found is Ben Lambert’s Bayesian textbook. So first I ask, are there any resources about which I am unaware? Next, I thought I might build just such a guide for myself. Here is a rough outline:"

Introduction to Stan
Overview of Stan and its applications
Installation and setup
Integrating Stan with R: rstan and other interfaces
Understanding Stan syntax and model structure
Basics of Bayesian Statistics
Brief review of Bayesian concepts
Priors, likelihoods, and posteriors
Introduction to inference and sampling methods
Stan Model Structure
Overview of model components
functions {} block: Defining custom functions for model
data {} block: Defining data inputs and types
transformed data {} block: Data preprocessing and transformation
parameters {} block: Defining parameters and their types
transformed parameters {} block: Calculations involving parameters
model {} block: Specifying the model and likelihood
generated quantities {} block: Calculating derived quantities
Data Types and Parameter Types
Overview of data types: integers, reals, vectors, arrays, etc.
Understanding parameter types: real, vector, simplex, etc.
Choosing appropriate data and parameter types for models
Declaring bounds on data and parameters
Using Functions in Stan
General Overview
Broad Categories of Functions
Probability Distributions and Related Functions
Common probability distributions in Stan
Defining distributions: syntax and parameters
Functions for probability density, cumulative distribution, and random variates
Custom distributions and functions in Stan
Working with Stan Models
Preparing data and running models
Analyzing and interpreting results
Debugging models and addressing common issues
Best practices for efficient modeling and sampling
Algorithms in Stan
Markov Chain Monte Carlo (MCMC) and No-U-Turn Sampler (NUTS)

8. Basic concepts and use of MCMC/NUTS
9. Advantages and limitations of NUTS

Variational Inference (VI)

8. Overview of variational inference
9. Practical use of VI in Stan models
10. Comparing VI with MCMC

Optimization

8. Overview of optimization algorithms (MLE and MAP)
9. When to use optimization in Stan
10. Best practices for using optimization in Stan

Common Modeling Scenarios
Linear and logistic regression models
Hierarchical models and multilevel modeling
Time series and spatial models
Troubleshooting Stan Models
Common errors in Stan models and how to address them
Diagnosing and resolving issues with sampling (e.g., divergences)
Debugging variational inference and optimization
Tips for improving model convergence and efficiency
Advanced Topics and Model Building
Custom functions and distributions
Integrating Stan models with other software and languages
Advanced modeling strategies and techniques
Case Studies and Applications
Real-world examples of Stan models across different fields
Step-by-step guides for advanced model building
Practical applications of Stan in research and industry
Resources for Further Learning
Recommended readings and resources
Online tutorials and courses
Stan forums and communities for support
".

Does anybody have any feedback about this outline? What items are missing? Out of order? Unnecessary?

mhollanders · May 6, 2024, 10:02am

I like the outline and I think it will be very useful to those starting in Stan. As much as I love the User’s Guide, it’s not easy to get started with, and I spent years in BUGS before transitioning to Stan.

Bob_Carpenter · May 6, 2024, 3:05pm

Hi, @Nicklaus_Millican:

As your outlines hints, there are really two things going on here: teaching Bayesian statistics and teaching Stan (and even teaching programming). There are two “official” resources that might help here:

Stan Reference Manual: full details on how the language works
Stan User’s Guide: a guide to programming in Stan

The Reference Manual is written for programmers and is rather dry and also not quite as precise as a formal specification. The User’s Guide is intended for people who already know Stan and already know Bayesian stats and want to know how to translate pieces of a model into Stan. It should have probably been called a “Programmer’s Guide”. As you point out, we don’t really have an intro and didn’t want to try to add too much tutorial material to either of the above docs. We really should put a short intro to the language into the User’s Guide. If you wanted to write that and submit it to Stan on GitHub (stan-dev/docs, written in Quarto), we could add it to the docs. We’re pretty picky in reviewing, but welcome contributions and will help with revisions.

CRC/Chapman & Hall keep asking us to write The Stan Book along the same line as their The BUGS Book. So if you really do fill out this overview, you have a clear route to publication! The closest I’ve come is this introduction:

Carpenter. Getting started with Bayesian statistics using Stan and Python.

I plan to go back and fill in more material. And maybe have someone translate to R :-).

@andrewgelman, @avehtari, and a host of others are working on a Bayesian workflow book that is essentially unfolding and trying to make sense of the paper:

Gelman et al. Bayesian workflow (arXiv)

There are a lot of online resources including videos and replicable case studies on the Stan site (both just listed under case studies and in the StanCon proceedings, which also have videos linked in most cases).

There are also several books that are more tutorials oriented. We have a list here:

Stan web site: Books related to Stan

Richard McElreath’s book is a great place to start if you’re new to Bayesian statistics and MCMC, and I believe there are matching videos online.

McElreath. Statistical Rethinking

It’s woefully out of date, though I occasionally update it. There’s also an intro book by Bruno Nicenboim, Daniel Schad, and Shravan Vasishth. I reviewed a large chunk for the publisher, CRC, and it’s both really solid and tutorial oriented (a rare combination):

Nicenboim et al. An Introduction to Bayesian Data Analysis for Cognitive Science

Topic		Replies	Views
Stan reading material General	6	551	March 7, 2023
Beginner Advice General	17	2574	August 2, 2021
Bayesian inference in Stan Modeling	40	5805	July 18, 2017
Materials for thoroughly understanding Stan General	14	307	May 21, 2025
Understanding basics of Bayesian statistics and modelling General howto	3	2842	August 5, 2020

Personal Stan Guide

Related topics