Mixis and E_loo

gregd · January 14, 2026, 5:00pm

thanks Aki

that is what I hoped - very glad to hear it.

thanks

Greg

gregd · January 14, 2026, 5:08pm

hello Luca

yes, the reference is re PSIS estimates, i.e. elpd_psis-loo. I would also like to think that the improvement with MixIS makes this comparison more reliable, or at least plausible. The original reason I tried MixIS at Aki’s suggestion was the comment somewhere on the loo online pages that loo_compare relies on the absence of high pareto-k values. Given the outliers (not many) in my data, that was an obstacle.

anyway both of you are saying it is ok to approach the comparison in the same way, using the MixIS pointwise values for elpd.

thanks

Greg

luchinoprince · January 15, 2026, 11:02am

Hello Greg,

Yes from a theoretical perspective, as Aki suggested, the comparison can be done irrespective of the elpd estimation method, and its more that logical to assume that more reliable elpd estimates lead to more reliable model comparison.

I just tried to stress that which elpd estimate is better depends on a couple of factors, so always trying to warn against thinking that a method can work in all situations :).

Best,

Luca

gregd · January 15, 2026, 12:18pm

hello Luca

I take your point that in a particular situation PSIS may be better. However, the loo FAQ says “…if in the final stages we are comparing models that have similar performance, there are some high k^ values, and we want to be minimize probability of wrong conclusions, it’s best to fix the problems. There is no threshold for how many high k^ values would be acceptable.” With my actual data and model, I could find no way to fix the small number of high k^ values (moment match failed, an issue explored in an earlier post to this forum) and although it is conceivable that a longer run would resolve this, it didn’t seem to and the run time is already long, so MixIS was a better choice even though I can’t be sure that it is really better! The only real alternative I could see would be to delete the outliers and run the comparison with the rest of the data. But I am particularly interested in the outliers!

thanks again to you and Aki for comments, and for MixIS!

Greg

luchinoprince · January 15, 2026, 12:40pm

Hello Greg,

I do think your situation is where MixIS can shine, so did not want to give any push back on your methodology. Moreover, being its’ first authour, I for sure have all interest in MixIS being as wide spread as possible!

Just wanted to give as much as possible a complete and broad answer, that could work also for other situations. Thanks again for sharing this use case,

Best,

Luca

Topic		Replies	Views
Loo_compare in the presence of high pareto-k brms loo	6	498	June 29, 2025
Loo_moment_match failure beta_binomial example brms loo , brms	4	239	June 27, 2024
Diagnostics for Mixture IS leave-one-out cross-validation? Algorithms loo	1	64	July 21, 2025
Alternative to LOO for simulation studies Modeling loo , posterior-predictive , model-comparison	14	1261	November 17, 2020
ArviZ loo_i Interfaces loo , arviz	19	1713	May 11, 2021

Mixis and E_loo

Related topics