Non-centering for mean field variational inference

maedoc · March 7, 2018, 2:38pm

I have a naive question on algorithms & centering: I’ve understood why non-centering is useful for HMC, but I am curious if the same rationale necessarily holds for mean field ADVI, which as far as I understand will simply ignore correlations among parameters. It seems obvious that the ELBO would be lower compared to an equivalent non-centered model, but does that imply that the maximization of ELBO suffers?

avehtari · March 7, 2018, 6:15pm

The same rationale helps. With non-centering the posterior of the parameters is closer to independent normal.

and if the posterior is close to independent normal it works just fine.

See Figure 5 and discussion in “Yes, but Did It Work?: Evaluating Variational Inference” [1802.02538] Yes, but Did It Work?: Evaluating Variational Inference

maedoc · March 8, 2018, 9:21am

Thanks for the reference & comments. I will search arxiv next time before posting here 😅(an idea for a Discourse plug in perhaps)

Figure 5, lower left figure, it seems the combination of centered & non-centered cover more of the parameter space covered by NUTS, than either alone. If I understand correctly, this is irrelevant, because the non-centered variant is closer to the true posterior?

avehtari · March 8, 2018, 1:02pm

That figure is bit overcrowded, but yes what matters which one is closer and with PSIS we can further correct when estimating various expectations.

Topic		Replies	Views
Correlated 2D Gaussian breaks ADVI Modeling fitting-issues	23	3467	July 12, 2018
Different runs give me different estimated values Algorithms mcmc , variational-bayes	4	1140	July 3, 2017
Convert correlation matrix to non-centered parameterization? Modeling loo	8	1421	March 10, 2018
Centered vs. non-centered parameterizations Modeling performance	3	4573	January 20, 2019
Centered or non-centered parametrization for random effects Modeling	15	5692	July 15, 2017

Non-centering for mean field variational inference

Related topics