How to interpretate the results of a cumulative log model in brms (calculating the probabilities of being in a category))

Charlotte003 · August 20, 2024, 4:33pm

Hello,

I’m working with an ordered categorical outcome variable in my model and am trying to interpret the results. Specifically, I’m analyzing how workload (measured as antwoordtekst) affects the employee loyalty index (eNPS), which is categorized into three ordered levels.

Here’s the model I used:

formula ← bf(category ~ antwoordtekst + (1 + antwoordtekst || technische_sleutel), family = cumulative(“logit”))

fit_workload_1 ← brm(formula = formula, data = testset_workload, family = cumulative(“logit”), iter = 1000, chains = 2, warmup = 500, cores = parallel::detectCores())

In the output, I have the following regression coefficients:

Regression Coefficients:
Estimate Est.Error l-95% CI u-95% CI Rhat Bulk_ESS Tail_ESS
Intercept[1] -6.94 0.17 -7.28 -6.62 1.01 251 437
Intercept[2] -3.50 0.15 -3.80 -3.20 1.01 321 477
antwoordtekst -1.13 0.05 -1.22 -1.04 1.01 295 491

I understand that the Intercept[1] and Intercept[2] represent the thresholds for the categories. Specifically, does Intercept[1] indicate the logit of being in category 2 or higher, and Intercept[2] indicate the logit of being in category 2 or higher? But it is very strange that both of the intercepts are negative. In the data most of the people are in category 3. How do I calculate the probability correctly?

joels · August 20, 2024, 7:23pm

Welcome to the community Charlotte!

The intercepts are thresholds for the logit of being below these levels, rather than above. In addition, the logit for a given observation also includes the effect of antwoordtekst.

The model is returning predictions on the logit scale in the following form:

Below 1st or lowest outcome category: -6.94 - 1.13 * antwoordtekst
Below sum of 1st and 2nd lowest categories: - 3.50 - 1.13 * antwoordtekst

You can turn these into probabilities with the inverse-logit function: 1 / (1 + exp(-x)), where x is one of the linear predictors above (e.g., -6.94 - 1.13 * antwoordtekst). The probability of being in the top category is just one minus the probability of being in the first two categories.

brms has the inv_logit_scaled function to convert from the logit scale to the probability scale, so you can do:

inv_logit_scaled(-3.50 - 1.13 * antwoordtekst)

# Specific example:
inv_logit_scaled(-3.50 - 1.13 * c(0.5, 1, 1.5))
[1] 0.016873391 0.009660523 0.005513647

The fact that the intercepts are relatively negative (along with the negative coefficient for antwoordtekst) indicates that the probability of being in the top outcome category is very high, as you expected from your data.

JLC · August 20, 2024, 11:22pm

The tidybayes package offers the add_epred_draws() function, which will give you per-category probabilities for a cumulative brms model.

Charlotte003 · September 3, 2024, 1:01pm

Thanks that helps a lot.

Topic		Replies	Views
Effect sizes for categorical variable in brm cumulative logit model brms	5	1023	July 20, 2018
Interpreting results from categorical() with brm brms interpret-results	6	4655	April 2, 2024
Interpretation of conditional effects in cumulative models using posterior linpred Modeling interpret-results , brms	0	639	August 30, 2022
Understanding brms model code: Ordered Probit with Ordered Predictor (monotonic effects) Modeling rstan , cmdstanr , brms	1	557	October 3, 2023
Reference levels and model interpretation brms multinomial-response , interpret-results	2	109	February 26, 2025

How to interpretate the results of a cumulative log model in brms (calculating the probabilities of being in a category))

Related topics