WAIC unreliable?

betanalpha · May 23, 2017, 7:19pm

Keep in mind that PSIS-LOO, WAIC, DIC, AIC, etc are all approximations to a fundamentally-uncalculatable number that exactly quantifies predictive accuracy. And each of those approximations require different assumptions to be reasonably accurate.

A common assumption that’s necessary is that the data are roughly IID so that, for example, looking at a subset of the data gives a reasonable quantification of what a draw from the full data would do. In many models, however, such as the one that Ben mentions this assumption is not valid and all of the predictive performance estimators should be suspect.

One of the great features of PSIS-LOO is that is has the self-diagnostic that can identify the failure of some of these assumptions and that makes it much more useful in practice than many of the other estimators. And in practice if we can’t trust PSIS-LOO then we probably can’t trust any of the others like WAIC.

Topic		Replies	Views
Posterior Predictives look good, but PSIS-LOO and WAIC are bad Modeling rstan , techniques , loo	6	1804	October 24, 2021
Not trustworthy WAIC and loo outcome? Modeling rstan , performance , loo	5	847	October 29, 2022
LOO and WAIC deliver exactly same results? Modeling loo	9	1027	October 4, 2018
High Pareto-k values for the same observations across different models: Can I still use loo to compare these models? Modeling loo	2	647	November 5, 2018
Bad Pareto k diagnostic with good chain diagnostics General	12	2045	April 26, 2021

WAIC unreliable?

Related topics