Use Posterior_predict in rstanarm to generate probabilites for each observation in a logistic regression model

dilsherdhillon · February 21, 2019, 10:30pm

Say I fit a logistic regression model

m1<-stan_glm(response~predictor1 +predictor2,family="binomial").

I would like to generate probabilities for each response. Using posterior_predict

preds<-posterior_predict(m1)

I generate a response 1 or 0, for each observation from 4000 draws(the number of posterior draws in my case).
Can I then take a proportion of 1’s in the 4000 draws and use this as my probability measure?

Eventually, I would like to generate a ROC curve and calculate PPV and NPV

bgoodri · February 21, 2019, 11:08pm

In this particular case, it is better to use

mu <- posterior_linpred(m1, transform = TRUE)

to get the posterior distribution of the conditional mean, rather than to get the predictive distribution and average over it. They are equivalent in principle, but with a finite number of draws, the latter can be noisy.

I would use log predictive density to measure predictive accuracy rather than all of that ROC stuff, but whatever you are doing with it, it is better to not use excessively noisy inputs.

dilsherdhillon · February 21, 2019, 11:17pm

Thanks!

I would love to learn more about how to use the log predictive density to measure accuracy. Could you point me to a good reference?

bgoodri · February 22, 2019, 12:26am

dilsherdhillon · February 22, 2019, 3:34pm

Thank you! Very interesting.

I had another question on the posterior_linpred. I get a probability on each observation for each of the draws. If I use the 2.5 percerntile and 97.5 percentile as lower and upper bounds, would these be 95% prediction intervals or 95% credible intervals?

Thanks!!

bgoodri · February 22, 2019, 4:06pm

credible intervals

dilsherdhillon · February 22, 2019, 4:10pm

Thanks!

Topic		Replies	Views
Posterior Predictive Checks After Sampling Modeling	3	810	October 23, 2022
Posterior predictive check for binary outcome in Stan Modeling	2	816	November 10, 2020
Posterior_linpred for brms brms	6	1168	March 5, 2020
Posterior_predict() prediction scale/transformation? rstanarm	2	2016	January 23, 2018
CmdSTAN and posterior prediction Modeling	6	1413	March 21, 2019

Use Posterior_predict in rstanarm to generate probabilites for each observation in a logistic regression model

Related topics