Pathfinder pareto K

cpfiffer · February 8, 2024, 4:11pm

I’m working with Pathfinder on a model that is pretty gnarly – lots of terribly geometry. We’ve been trying to reparameterize for quite a while now and it’s an ongoing project. It’s about a thousand lines of Stan code, so a bit difficult to debug without a lot of eyes. About 66k parameters, most of which are basically random effects I’m trying to marginalize.

We’ve been trying Pathfinder on it to see what happens, and we’re getting Pareto k-values on the order of 63-75. Significantly above reasonable thresholds of 0.5 or 0.7.

My question – does this tell us anything about the model? Aki has suggested that this indicates misspecification, but does anyone have an intuition about what kind of misspecification that might entail?

Could one cause be that there are significant “ridges” in the true posterior that are too steep to be captured by Hessian approximations?

I know that’s difficult to answer without the model, but any top-of-mind ideas would be appreciated.

avehtari · February 8, 2024, 4:40pm

That link is to discussion of high khats when using PSIS-LOO, and doesn’t apply here.

High khats when using Pathfinder indicate that the target is not well approximated by the normal distributions. That is as likely to happen with well-specficied and mis-specified models.

khat diagnostic is based on diagnosing whether the approximate distribution could be used as importance sampling distribution, and usually distribution of importance sampling ratios gets nasty when the number of dimensions increase (ee, e.g. Section 3.3 in [1507.02646] Pareto Smoothed Importance Sampling, and Challenges and Opportunities in High Dimensional Variational Inference). You have a very large number of dimensions so it is likely that khats are big, even if the Pathfinder would get means and variances close to true means and variances. The updated Birthdays example shows use of Pathfinder with posteriors that cause high khats (largest were >10).

cpfiffer · February 8, 2024, 5:20pm

Excellent, thank you for the overview!

Topic		Replies	Views
Pareto k value for pathfinder Modeling techniques	1	78	April 29, 2025
Using pathfinder to initalize sampling Modeling	11	247	November 19, 2024
Interpret pareto k diagnostic Modeling rstan , fitting-issues , loo	3	1658	August 3, 2023
A quick note what I infer from p_loo and Pareto k values Modeling loo	35	15865	August 21, 2022
Using Pathfinder or other method to set initial values for sampling Modeling	32	1296	July 19, 2024

Pathfinder pareto K

Related topics