Comparing implementations of mixed logit Bayesian inference

mike-lawrence · July 9, 2020, 12:38pm

Why not apply the monitor function to the samples from the comparison model?

My understanding is that any MCMC algorithm should be run as multiple chains to ensure there is no influence of the initialization context. Is this provably not a concern with your comparison method?

Which model from the repo is this? I don’t have time to look at it in any detail until the weekend, but two notes:

your model implies a cauchy prior on tau; you’re using the more efficient tan-uniform parameterization, but I believe that recent advice is to still avoid these unless they are strongly motivated by concrete theory. Previously folks used heavy-tailed priors to (I think) attempt to make things more “robust” in some sense by allowing more mass to extreme values. However, it turns out that this causes a posterior geometry that is very difficult to sample, causing Stan to degrade in performance dramatically. (It is possible that your alternate methods appear to sample more efficiently because they are not as sensitive to the true geometry and therefore seeming to perform better while being less accurate. Note I say “possible” because I can’t speak likelihood; just wanted to put this possibility on your radar)
usually find that in hierarchical models there is often a lot of redundant computation in the more obvious approaches to coding them. I can’t confirm this is the case with your data until I discern which dataset/model from the repo you’re talking about, but in the meantime you could check out this demo of what I mean by redundant computation and how to eliminate it. Arguably this might induce an unfair advantage when then comparing against a non-Stan sampler, so you’d probably want to implement the same tricks in whatever language those use for expressing the model too.

Topic		Replies	Views
Excellent paper on Comparing JAGS, NIMBLE, and Stan Publicity	23	7072	May 3, 2024
How to compare different algorithms for Bayesian inference in terms of speed (and subject to effectiveness) General	16	3794	February 17, 2018
New preprint: Investigating the efficiency of marginalising over discrete parameters in Bayesian computations Publicity rstan	9	1265	September 21, 2022
New algorithm: Gradient-based Adaptive Markov Chain Monte Carlo Algorithms mcmc	13	2118	February 16, 2022
Rstan much slower than self-coded MCMC Modeling performance	10	2400	January 14, 2022

Comparing implementations of mixed logit Bayesian inference

Related topics