On LOO CV for non-factorizable normal models

Elizaveta_Semenova · January 18, 2019, 11:02am

Dear all,

I have a question concerning LOO-CV for non-factorisable normal models, developed in the paper “Leave-one-out cross-validation for non-factorizable normal models” by Bürkner, Gabry, Vehtari. The idea of the paper is that for models, which can be written in the form y \sim N(0, C), there is a straignforward way for the calculation of the point-wise log-predictive density, using the two quantities: \bar{c} = \text{diag}(C^{-1}) and
g = C^{-1}y. Then the point-wise log-predictive density can be found as \log p(y_i \mid y_{-i},\theta) = -\frac{1}{2} \log(2 \pi) + \frac{1}{2} \log \bar{c}_{ii} - \frac{1}{2}\frac{g_i^2}{\bar{c}_{ii}}. Note, that in all the expression above y denotes an actual data vector.

Further in the paper the authors give an example of SAR, modeling spatial data, and notice that the model can be presented as y-W^{-1}\eta \sim N(0, C). Here, as before, y is the actual data, while W^{-1}\eta are the estimates obtained via model fitting. Fair enough, we still evaluate the estimates, be they on the left- or on the right-hand side of “~”, against the real data y.

And now my question comes. I work extensively with the point pattern data and Log-Gaussian Cox process (LGCP) model. It can be understood as a spatial Poisson process with random location-dependent intensity \lambda(s). On the log-scale \lambda with covariates can be written as a non-zero mean GP: log(\lambda) = X’ \beta + N(0, C). I would like to be able to compare such models. The problem is that, unless the model formulation is compromised, there is no explicit data vector y, i.e. both parts in log(\lambda) - X’\beta \sim N(0, C) consist of the estimated quantities. (The model fitting is done via the explicit LGCP likelihood, which uses the entries of \lambda as parameters.)

My suspicion is that due to the absence of the real data vector y, I will only indirectly measure convergence and not the predictive ability of the model, by applying the above method. Are there any supporting or alternative opinions?

avehtari · January 21, 2019, 6:20pm

So you have a finite set of locations?

Can you include the equation and corresponding Stan code? I think I know the answer, but I just want to be certain that we are using the same words.

Topic		Replies	Views
Pointwise log lik for multi-response Poisson model Modeling loo , multivariate-normal	6	92	November 14, 2024
LOO-CV for non-bayesian models (too stupid idea?) General loo	2	512	March 1, 2019
Inquiry on the article: Efficient leave-one-out cross-validation for Bayesian non-factorized normal and Student-t models Modeling loo	2	61	January 20, 2025
Model comparison between independent normals and multivariate normals Modeling	11	159	October 21, 2024
Loo: calculating point-wise log-lik when using data augmentation for censored observations Modeling loo	10	842	June 24, 2021

On LOO CV for non-factorizable normal models

Related topics