Residual diagnostics in MCMC - based multilevel regression models

Antyteza · February 11, 2020, 11:12am

I’ve recently embarked on fitting multilevel regression models in the Bayesian framework, using a MCMC algorithm (brms in R actually).

I believe I have understood how to diagnose convergence of the estimation process (trace, geweke plot, autocorrelation, posterior distribution…).

One of the thing that strikes me in the Bayesian framework is that much effort seems to devoted to do those diagnostics, whereas very little appears to be done in terms of checking the residuals of the fitted model.

Long story short: I will probably have to present my model to the “classic” econometrician and he will expect me to discuss the residuals. Of course, there is a problem of defining “residuals” in Bayesian regression. So should I simply calculate the fitted model values (Estimate column from fitted(brm) function), replicate the multilevel model into LM and analyze the differences as typical residuals? Or should I focus on posterior predictive checks / loo validation?

andrewgelman · February 11, 2020, 3:09pm

Hi–to start I recommend taking a look at chapters 6 and 7 of BDA3. Chapter 6 discusses model checking, and chapter 7 discusses predictive model evaluation.

Bob_Carpenter · February 13, 2020, 9:33pm

We recommend our revised R-hat metrics and computing effective sample size. It’s hard to learn much from squinting at autocorrelation plots or traceplots.

It depends on the type of Bayesian. The hardcore subjectivists just believe you put down a subjective prior and your posterior is what it is. So you only have to test you computed it, not that it makes sense, because you’ve assumed it makes sense.

With Stan, we strongly recommend checking not just the residuals, but match to data and match to held out data using posterior predictive checks and cross-validation, respectively (“BDA3” is Gelman et al.'s book Bayesian Data Analysis, which has chapters about these things).

I’m also most of the way done with adding chapters on all this to our user’s guide.

Topic		Replies	Views
Using Pearson residuals to validate models brms	3	2241	September 24, 2020
Linear model assumptions check Modeling	3	2155	April 28, 2020
Is check_hmc_diagnostics the quickest and simplest way for a newbie to verify convergence? Modeling rstan , brms	1	447	June 3, 2023
Residual diagnostics Modeling	3	1674	August 5, 2018
Bad chain diagnostic, but good data recovery General	12	1100	December 14, 2020

Residual diagnostics in MCMC - based multilevel regression models

Related topics