Non-centering for mean field variational inference

maedoc · March 7, 2018, 2:38pm

I have a naive question on algorithms & centering: I’ve understood why non-centering is useful for HMC, but I am curious if the same rationale necessarily holds for mean field ADVI, which as far as I understand will simply ignore correlations among parameters. It seems obvious that the ELBO would be lower compared to an equivalent non-centered model, but does that imply that the maximization of ELBO suffers?

avehtari · March 7, 2018, 6:15pm

The same rationale helps. With non-centering the posterior of the parameters is closer to independent normal.

and if the posterior is close to independent normal it works just fine.

See Figure 5 and discussion in “Yes, but Did It Work?: Evaluating Variational Inference” [1802.02538] Yes, but Did It Work?: Evaluating Variational Inference

maedoc · March 8, 2018, 9:21am

Thanks for the reference & comments. I will search arxiv next time before posting here 😅(an idea for a Discourse plug in perhaps)

Figure 5, lower left figure, it seems the combination of centered & non-centered cover more of the parameter space covered by NUTS, than either alone. If I understand correctly, this is irrelevant, because the non-centered variant is closer to the true posterior?

avehtari · March 8, 2018, 1:02pm

That figure is bit overcrowded, but yes what matters which one is closer and with PSIS we can further correct when estimating various expectations.

Topic		Replies	Views
Centered vs non-centered model, different loglikelihoods Modeling specification	6	793	May 15, 2024
Correlated 2D Gaussian breaks ADVI Modeling fitting-issues	23	3522	July 12, 2018
Is the posterior from ADVI always normal? Algorithms	6	160	April 7, 2025
Centered vs NCP Question -> Mixed parameterisation & Correlations Modeling specification	2	537	June 13, 2022
Adaptation and non-centered parameterisations Developers	10	1245	August 2, 2021

Non-centering for mean field variational inference

Related topics