Making predictions from binomial model: why newdata requires part of outcome variable to be defined?

di4oi4 · March 30, 2022, 10:29am

I have a model as follows. It calculates the proportion of successful applications from total.

m = brm(admit|trials(applications) ~ sex, data = data, family = binomial())

When making predictions using “newdata” and tidybayes’es "epred_draws(), why I have to also specify “applications” in newdata? Otherwise “epred_draws()” wouldn’t run. Usually newdata only includes predictors and their levels. And how to choose a value for “applications”?


newdata = expand_grid(applicant.gender = c("female", "male"),
                      dept = "A",
                      applications = 1)

Solomon · March 30, 2022, 1:48pm

If I’m following your model correctly, we might express it as

\begin{align*} \text{admit}_i & \sim \operatorname{Binomial}(n_i = \text{applications}_i, p_i) \\ \operatorname{logit}(p_i) & = \beta_0 + \beta_1 \text{sex}_i. \end{align*}

In such a case, applications isn’t really part of your outcome variable. Rather, applications is part of the likelihood.

Topic		Replies	Views
Problem predicting with newdata from brms model with custom beta_binomial distribution brms	3	1397	January 15, 2019
Predict beta part of beta-binomial model for new groups without number of trials Modeling brms	8	1101	July 2, 2021
Is newdata2 in brms::predict doing what I think it's doing? brms specification , phylogenetic , brms	1	782	September 8, 2021
Predict risk given success/trials data brms	6	820	September 10, 2019
Prediction for a new observation Modeling	8	1835	March 16, 2018

Making predictions from binomial model: why newdata requires part of outcome variable to be defined?

Related topics