New paper on prior distributions in latent variable models

edm · April 19, 2023, 6:38pm

I’d like to share a preprint titled “Opaque prior distributions in Bayesian latent variable models”, which describes situations where prior distributions behave in ways that many users would not expect. This can especially cause problems for reproducibility and for some model assessment metrics.

We are currently working on revising the paper, so we would appreciate comments, criticisms, relevant literature that we missed, etc.

Bob_Carpenter · April 21, 2023, 4:56pm

Thanks for sharing @edm.

I didn’t understand this comment:

We could instead place priors on individual parameters within the covariance matrix, but, when we build a covariance matrix using these parameters, the resulting matrix will sometimes be non-positive definite. We elaborate on these issues below.

The way Stan works, when you declare

parameters {
  cov_matrix[K] Sigma;
}

then Sigma will always be positive definite (up to numerical precision) and uniformly distributed. We use a Cholesky factor under the hood with log transformed diagonals. But by the time you get to the model block, you have a positive-definite Sigma on which you can place an additional prior. You can even add a Wishart prior and then additional prior terms—Stan only needs to know the prior up to a normalizing constant, so it’s OK to do something like this:

parameters {
  cov_matrix[K] Sigma;
}
model {
  Sigma ~ inverse_wishart(...);
  Sigma[1,2] - Sigma[2, 3] ~ normal(0, 0.1);
}

The result prior on Sigma will be proportional to inverse_wishart(Sigma) * normal(Sigma[1,2] - Sigma[2, 3], 0, 1).

edm · April 21, 2023, 6:46pm

Thanks, we had overlooked that cov_matrix already starts you out with a positive definite matrix!

I worry a bit about replacing a hard constraint with a soft constraint. It seems that, if you have enough data, the normal(0, 0.1) prior becomes weaker and you may end up with a posterior that is far from the equality constraint that you intended. Maybe a solution is to use a prior like normal(0, 0.1/sqrt(n)).

Of course, if the posterior is far from 0, it indicates that the equality constraint was not reasonable in the first place. But the traditional psychometric approach is to fit one model with equality constraints, one model without equality constraints, and compare them.

Topic		Replies	Views
Questions on Merkle et al. preprints on Efficient Bayesian SEM in Stan General specification , blavaan	5	897	August 26, 2020
Covariance prior via lkj_corr_cholesky Modeling fitting-issues , covariance	4	536	December 21, 2023
Implicit prior for correlation matrix Modeling	1	1584	October 24, 2019
Inverse Wishart distribution Modeling	1	575	January 30, 2022
Priors for binomial trials in generalized linear models Modeling	3	512	July 6, 2018

New paper on prior distributions in latent variable models

Related topics