Using empirical estimators on posterior predictive observations to get estimated mean/percentiles, etc

BTANG101 · December 10, 2021, 12:44am

Hello,

I have recently had some doubts regarding conducting inference via. the posterior or the posterior predictive, rooted from this thread in particular regarding posterior_predict and the average treatment effect and also a warning in rstanarm’s documentation regarding conducting inference via. the posterior_linpred() function vs. the posterior_predict() function.

Consider the scenario where we wish to conduct inference on the population median. For this scenario, let’s say I have assumed an exponential likelihood (and for simplicity sake, let’s say I assume some arbitrary proper priors). The exponential distribution has a closed-form solution for the median: \frac{ln2}{\lambda}.

I can see two ways of conducting inference on the population median:

Simply calculate:

\frac{ln2}{\lambda}

for every sample from \text{Pr(}\lambda | D)

OR

Simulate from the posterior predictive M times, and then take the sample median of each of the M samples to get M samples of the median.

Is the warning in rstanarm’s documentation alluding to using algorithm 2) instead of algorithm 1), where the posterior predictive is encouraged rather than just the posterior?

Intuitively, I think both algorithms should give approximately the same thing for a large number of samples, though perhaps 1) is more efficient than 2) because 2) is non-parametric. Additionally, I would imagine that 2) produces wider intervals in general, because simulating from the posterior predictive will incorporate additional residual uncertainty. If we wish to fully incorporate both parameter and residual uncertainty, then I can see why 2) would actually be preferred if what I am saying is true.

Is one approach preferable to the other/blatantly incorrect? Am I completely on the wrong track here?

Thanks in advance!

EDIT: Looks like I was just confusing the two distributions: \text{Pr(median |} \lambda) vs \text{Pr(median | D)}, though I’d still be interested to hear more thoughts on this to make sure.

Topic		Replies	Views
CmdSTAN and posterior prediction Modeling	6	1431	March 21, 2019
Use Posterior_predict in rstanarm to generate probabilites for each observation in a logistic regression model Modeling	6	1344	February 22, 2019
Confusion on difference between posterior_epred() and posterior_predict() in a mixed effects modelling context Modeling mixed-model	6	3434	July 27, 2023
Reproduce posterior_predict from posterior_linpred for a neg binom spline model rstanarm rstanarm	5	2181	November 1, 2020
Prediction with rstanarm differs from prediction with stan rstanarm	4	917	January 19, 2018

Using empirical estimators on posterior predictive observations to get estimated mean/percentiles, etc

Related topics