A general approach to reparameterization

dmalison · September 27, 2017, 9:46pm

Hi all,

In finite dimensional models, the Central Limit Theorem can usually be applied to prove that the posterior distribution converges to a multivariate normal as the amount of data grows. However, Stan can have trouble sampling from multivariate normals when the parameters are highly correlated. My idea is to use draws from an initial “bad” parameterization to locate a better one.

Suppose f(y|theta) is the model and f(theta) is the prior. What do you think about the following general strategy:

Obtain an initial set of draws from the posterior f(y|theta)f(theta) in the usual fashion
Use the draws to compute a posterior mean vector mu and a posterior covariance matrix Sigma
Reparameterize the model as f(y|mu + L_Sigma * epsilon)f(mu + L_Sigma * epsilon), where L_Sigma is the Cholesky factor of Sigma
Obtain draws from the posterior of epsilon
Iterate to update mu and Sigma

I’ve had some success with this approach in practice, as the posterior of epsilon is usually “close” to a multivariate standard normal.

anon75146577 · September 28, 2017, 12:15am

That is computationally very expensive and would be better achieved by adapting the whole mass matrix (same cost), more efficient.

Bob_Carpenter · October 2, 2017, 8:18pm

This is similar to what Stan already does if you specify a dense mass matrix (Euclidean metric). It uses exponentially increasing blocks to alternatively draw a sample and update the covariance matrix estimate. We can then use that to adjust the metric over which we sample.

We essentially do what you’re suggesting informally by reparameterizing a model by eye to try to put all the parameters on the same scale. Once everything’s on a unit scale, correlation doesn’t matter. It’s not the correlation that hurts Stan, it’s the varying scales plus correlation when we use a diagonal metric. Also, varying curvature is a problem in that there’s no fixed mass matrix/metric that works everywhere, so we have to be conservative, which can result in slow sampling with a small step size (the alternative is to introduce more bias as you won’t get into the high curvature regions with larger step sizes).

Topic		Replies	Views
Multivariate Normal: Scale invariant priors for regularization? Modeling	2	387	August 2, 2022
Beginner request for reparameterization help Modeling	12	1066	January 3, 2018
Sampling issues in my model: Reparamterization does not seem to work Modeling	3	594	March 23, 2021
Non-centered parameterization with additional constraints Modeling fitting-issues , specification	3	686	June 17, 2020
"Pairwise" alternative to multivariate normal isn't behaving for the hierarchical case; help? Modeling techniques , specification	13	785	June 10, 2021

A general approach to reparameterization

Related Topics