I’m experiencing significant performance differences between CmdStan’s
variational (2.26.1) and RStan’s
vb (2.21.3) - wanted to know if this is expected or not.
I have a relatively large model - ~42K parameters - and I’m using RStan’s VB (mean-field) to get an initial estimate of posterior variances. These are then used to rescale parameters in the model - this really helps with sampling, specifically with avoiding max-tree-depth warnings. A typical
vb run on this model takes around ~20 minutes (default parameters).
I tried to switch to CmdStan’s
variational, but the program hangs after eta adaptation. CPU is on 100%, and memory is filling up quickly, but nothing is printed to the console (I killed it after waiting several hours).
variational does work as expected on the Bernoulli example, so it doesn’t seem like a general problem, but rather something that is related to the model itself and/or its size. This happened both on my laptop (macOS) and on a linux machine.