Adjusting default tol_rel_obj threshold in vb

saudiwin · November 27, 2018, 2:08am

The recent paper by Yao et al. on Stan’s variational inference is pretty clear that lowering the threshold for change in ELBO will improve the fit of the approximated posterior and is more likely to pass the diagnostic checks they propose (see figure 2 in the paper). However, the current default in rstan, option tol_rel_obj, is still set at 0.01. Given the fact that rstan settings for full Bayesian inference are conservative (2000 iterations), it would seem a good practice to change the default as well for vb to use a more conservative standard for ELBO convergence, such as 10^{-4}.

I’m happy to put in a pull request to the rstan Github but I wanted to post on here first to get feedback on the idea.

bgoodri · November 27, 2018, 2:46am

I think the defaults for ADVI need to change for all interfaces.

Bob_Carpenter · November 29, 2018, 2:51am

I’d like to hear from @yuling about what he thinks the defaults should be.

I’d like to make robustness the first goal, with options for more advanced users to tune for more efficiency in specific cases.

yuling · November 29, 2018, 3:15am

Currently, the default setting is when running average or running medium of the objective function (elbo) falls below 1%. In many optimization problems, this is not a horrible condition.

A more conservative stopping rule help, but it is also not clear how conservative the threshold should be.

When k hat is outputted, we can also use k hat itself as a stopping rule.

saudiwin · November 29, 2018, 3:48pm

Ok though K-hat can only be computed on a fitted model, right? It would seem that to “make robustness the first goal” @Bob_Carpenter, the initial run should have a strict threshold that more experienced users can adjust when wanting to do quicker optimization runs.

Also is there a plan to include k-hat calculations as part of the vb function (i.e. doesn’t that involve having the user program generated quantities first)?

Bob_Carpenter · November 30, 2018, 1:19am

I think the previous post from @yuling brought up the relevant question of how strict?

Topic		Replies	Views
Convergence of variational inference General variational-bayes	4	2167	December 5, 2020
In Rstan, for ADVI, is there a way to produce ELBO and eta (step-size)? RStan	5	1752	December 20, 2019
Error in ADVI (rstan) Modeling	1	1354	August 4, 2019
Variational Bayes results seems sensible, but vary - What to change? Modeling variational-bayes	6	962	November 6, 2020
Performance differences between RStan's VB and CmdStan's variational Interfaces cmdstan , rstan , variational-bayes	3	601	May 23, 2021

Adjusting default tol_rel_obj threshold in vb

Related topics