Addition of AR(1) process in brms model results in high pareto-k values from loo

martinmodrak · November 22, 2021, 8:01pm

Can’t really delve deep into this due to time constraints, but if you inspect the generated code (via make_stancode), I would expect the AR term to be represented by a set of nuisance parameters, one for each observation representing the autoregressive term, because there is no closed form. If this is the case, those nuisance parameters basically by their definition depend strongly on the individual observation. The pareto k in loo checks roughly (in my limited understanding) whether some parameters depend strongly on a single observation. And those nuisance parameters do (by definition) depend on single observations, so you’ll get high pareto k.

The short answer therfore is that for this type of models (with nuisance parameters) you cannot use loo easily. The longer answer is that you can IMHO try to compute loo anyway by integrating the nuisance parameters out, but that’s not straightforward (see Is LOO valid for models with missing outcome when using a complete case dataset that is a subset of the original data? - #7 by martinmodrak for a similar, but simpler case - I think 1D integration might not be enough in your case).

Best of luck with your model!

Topic		Replies	Views
Model converges but high k pareto values Modeling loo	11	973	November 2, 2022
Loo_compare in the presence of high pareto-k brms loo	4	303	June 25, 2024
Improve model with some observations pareto >0.7 brms loo	1	1164	August 18, 2020
Large Pareto K-values and significant Moran I with CAR structure Modeling fitting-issues , loo , spatial , brms	1	188	May 18, 2024
LOO - uncertainty in pareto-k estimates Modeling loo	2	561	September 14, 2021

Addition of AR(1) process in brms model results in high pareto-k values from loo

Related topics