Correlated 2D Gaussian breaks ADVI

anon75146577 · September 28, 2017, 8:03pm

I’m not sure why people are surprised by this. For mean-field Gaussian you’re approximating family is a product of Gaussians on the two axes, which, for example, can’t approximate a narrow Gaussian concentrated around the line y=x.

For the full rank one, I’d expect it to be in the correct place, but the covariance matrix to be too “concentrated”. This is because the KL divergence is an asymmetric measure of “distance” between two probability distributions and in the direction that it is used for VI, it penalises approximations that are too diffuse far more fiercely than approximations that are too concentrated. This leads to a systematic underestimation of variation using VB methods.

tl;dr: VB doesn’t really work, but might get you a central point quickly. Sometimes.

Topic		Replies	Views
New Theoretical analysis for ADVI Algorithms variational-bayes , advi	0	460	May 28, 2023
ADVI: Posteriors Algorithms	2	594	November 27, 2019
Variational inference with a mixture of dense and diagonal normals General specification , variational-bayes	3	802	November 20, 2021
Question on variational inference Modeling techniques	6	1241	June 19, 2018
Variational Bayes versus MAP for prediction Algorithms	5	3398	December 7, 2019

Correlated 2D Gaussian breaks ADVI

Related topics