Presenting small elpd_diffs based on multiple fits of the same models

blokeman · October 8, 2023, 11:25am

I fully agree that it’s nicer to have the utility/loss statistic and p_loo on the same scale. But I’m stuck with the Deviance scale for my present project. Therefore I’d like to know how to calculate mcse_looicdiff. I’m currently using a home-made function for my model comparisons which calculates this mcse_looicdiff as

MCSE_LOOICDIFF = \sqrt{(2*\text{MCSE_LOO}_a)^2 + (2*\text{MCSE_LOO}_b)^2}

and I sure hope that someone more skilled at math/stats can confirm whether this is correct, lest I suffer public humiliation after the study goes to print.

EDIT: I’d also like to know, regardless which scale we use, whether the Central Limit Theorem can reasonably be assumed to apply to the variability of MCSE_DIFF/MCSE_LOOICDIFF.

avehtari · October 9, 2023, 5:21pm

First compute everything in log score scale, and in the end multiply all estimates by -2 and all SE’s by 2.

If you are using loo package and Pareto smoothed importance sampling then
Pareto smoothed importance sampling paper provides the conditions when MCSE of elpd_loo is valid and can be assumed to be accurate. Assuming independent posterior draws are used for the compared models, then taking the difference doesn’t change the conditions.

blokeman · October 9, 2023, 6:07pm

Great, thanks!

Topic		Replies	Views
Interpreting elpd_diff - loo package Modeling loo , interpret-results	47	14634	November 9, 2020
If elpd_diff/se_diff > \|2\|, is this noteworthy? brms techniques , loo , cross-validation	21	4013	April 3, 2021
ELPD clarification General brms	3	440	January 12, 2024
Quick examples of loo() interpretation Modeling loo	11	1803	July 3, 2020
Model comparsion for linear regression using loo and Bayesian R2 Modeling techniques , loo	0	431	October 18, 2022

Presenting small elpd_diffs based on multiple fits of the same models

Related topics