Why does ADVI use stochastic gradient ascent not LBFGS

Thanks that’s very helpful. I must’ve missed this in the algorithm just b/c it calls the same gradient calc as everything else. I’ll have to spend more time with it.