Use simulated or observed covariates in prior predictive checks?

dmuck · September 7, 2021, 1:38am

In a typical Bayesian workflow, is it recommended to simulate our covariates before generating the prior predictive distribution? Are there any potential problems with using our observed covariates to check our priors?

martinmodrak · September 12, 2021, 6:31pm

Good point. I currently believe that using the observed covariate values can in fact be beneficial - you don’t have to guess what is the plausible range of values etc. The biggest advantage of simulating covariate values are IMHO:

can be done before you collect data
can be done once and reused for a class of similar model-dataset combinations

The biggest potential problem I see with using observed values is that you can be tempted to sneak some properties of the observed outcomes into what you’ll consider a good prior. This needs to be resisted and only properties that are defensible without reference to observed outcomes should be used to guide your priors.

Best of luck with your checks!

Topic		Replies	Views
Fake data for prior predictive checks General	1	1428	April 9, 2019
Bias from improper priors in regression Modeling fitting-issues , priors	9	732	August 7, 2022
Prior predictive checks for multiple regression Modeling	4	477	October 16, 2023
Prior Predictive Checks Modeling	3	3422	October 17, 2018
Numerical values for priors General	24	2112	August 29, 2020

Use simulated or observed covariates in prior predictive checks?

Related topics