Writing a Program to Calculate ESS from Samples

Corey.Plate · December 15, 2023, 9:59pm

Hey all,

I’m attempting to write a program to calculate ESS from the samples I get back from Stan because I run the program from the command line on the cluster, without R or python as an interface to generate a StanFit object.

I just want to make sure I understand the formula on 16.4 Effective sample size | Stan Reference Manual. I already know how to calculate W and var+_hat because I made an R_hat program using the Stan Reference Manual, but I’m still a little lost trying to figure out the rest of the term definitions for the ESS formula found in the Stan Reference Manual. Starting here…

phat

Does the circled area refer to the formula:

Screenshot 2023-12-15 at 4.47.23 PM

Where the covariance term is the normalized:

Screenshot 2023-12-15 at 4.50.43 PM ?

And, with respect to the rest of the formula…

var

What does this circled area refer to the variance of? Is it the variance of samples for a particular parameter for a single chain?

Thanks!

Sincerely,
Corey

Bob_Carpenter · December 17, 2023, 10:16pm

The circled \widehat{\rho}_{t, m} is the estimated lag-t correlation in chain m, whereas \rho_t is just the general definition of lag-t correlation for a Markov chain X. The expression \textrm{cov}_{x,y} is then an estimate given an observation of a subsequence of X. The x is an observed subsequence of X and the y is an observed sequence lagged by t.

s^2_m is defined where \widehat{R} is defined, back in section 16.3.1. It’s the estimated variance in chain m of the parameter whose ESS is being estimated in chain.

The easiest thing to do might be to read the Bayes-Kit code, which has a simpler version of the definitions that only work on a single chain at a time (and hence don’t adjust for lack of agreement among multiple chains). But it has the definitions of these key terms in a pretty simple form:

github.com

flatironinstitute/bayes-kit/blob/main/bayes_kit/ess.py

from .iat import iat, iat_imse, iat_ipse
from .typing import FloatType, VectorType


def ess_ipse(chain: VectorType) -> FloatType:
    """
    Return an estimate of the effective sample size (ESS) of the specified Markov chain
    using the initial positive sequence estimator (IPSE).

    Parameters:
        chain: Markov chain whose ESS is returned

    Return:
        estimated effective sample size for the specified Markov chain

    Raises:
        ValueError: if there are fewer than 4 elements in the chain
    """
    if len(chain) < 4:
        raise ValueError(f"ess_ipse(chain) requires len(chain) >= 4, but {len(chain)=}")

This file has been truncated. show original

That does everything according to these definitions. On the other hand, we’ve updated posterior (R) and arviz (Python) to use newer definitions from Vehtari et al., and you can look at their code for more details on that.

mitzimorris · December 18, 2023, 2:00pm

if you run Stan from the command line using the CmdStan interface, then you can use the accompanying CmdStan utilities 20 stansummary: MCMC Output Analysis | CmdStan User’s Guide and 21 diagnose: Diagnosing Biased Hamiltonian Monte Carlo Inferences | CmdStan User’s Guide to get ESS and more.

you can also write your own utility, in which case, I suggest using the above utilities to check your work.

Corey.Plate · December 18, 2023, 4:30pm

Thank you! That is a very helpful utility to have handy. I will likely still write a utility for it, but this is very helpful to have on hand right now

Topic		Replies	Views
Stan's ESS (and monotone autocorrelations) Algorithms mcmc	5	905	March 13, 2020
Does effective sample size estimation require independent chains? Algorithms	11	1917	August 20, 2019
Revised Rhatv5 and ESS paper Publicity	0	815	January 17, 2020
Computing effective sample size in R RStan	2	993	November 22, 2019
Empty stanfit Object For Computing MCMC Summaries RStan	6	707	March 29, 2019

Writing a Program to Calculate ESS from Samples

Related topics