Predict Population-level probability of an ordinal probit model

DoRi · December 30, 2020, 4:09pm

I’ll try to predict (from my estimated model in brms) the population-level probability for responding in the categories 1,2,3,4,5 or 6 (dependend variable). Is this possible? I need them to do some ROC-Curves. I tried to manually calculate the corresponding probabiliets via the cumulative distribution and the model parameters but I do get other results then from the model.

test <- predict(mod, data.frame(predictor1 = 0,predictor2 =0, discParameter1 = 0, discParameter2 = 0, SubjectID =NA))

Is this the way to get the population-level probabilities for my model by saying SubjectID = NA?

Thanks for your reply :)

martinmodrak · January 7, 2021, 2:42pm

Hi,
the question is a bit hard to answer without seeing the actual model. Assuming SubjectID is the only varying intercept (random effect) in the model, you can choose from at least three basic prediction tasks that could be considered “population”:

Predict the population mean without taking the varying intercept into account, i.e. ignoring the between-subject variability. This could be done via the re_formula argument.
Predict for a hypothethical previously unseen subject, drawn from the same population, i.e. drawing a completely new varying intercept for each prediction from normal(0, sd_SubjectID). This can be done by setting SubjectID = "any_previosly_unseen_value" and using allow_new_levels = TRUE
Predict for a single subject randomly chosen from the population, i.e. taking one of the fitted varying intercepts (different for each sample). This can be done using the steps in 2) and also setting sample_new_levels = "old_levels"

Which of those makes the most sense for you depends on the actual question you are asking, no general answer here. There are also some more exotic options which I am leaving out for simplicity.

Speaking specifically about ROC curves (which I admit I am not a big fan of - are you sure you need ROC curves?) - wouldn’t it make more sense to make predictions for the full dataset, and then calculate separate ROC curve for each posterior sample? The ensemble of curves then express the model uncertainty about the ROC curve. Whether this is sensible obviously depends on the question you are asking, so once again don’t want to force this on you.

Best of luck with your model!

DoRi · January 12, 2021, 2:09pm

Thank you very much for your reply! :) This helped a lot, I’m gonna try to use re_formula.

Topic		Replies	Views
Group-level and population level monotonic predictor brms cognitive-science	3	716	March 25, 2019
Repeated measures with population-level effects brms	2	1156	September 12, 2019
Obtain raw ordinal predictions brms	2	276	July 8, 2021
One population effect missing in brmsfit object brms specification , ordinal-response , hierarchical-model	1	502	April 21, 2022
Posterior predictive distribution for ordinal regression brms	2	1755	March 3, 2019

Predict Population-level probability of an ordinal probit model

Related topics