Specifying formulates for interaction between categorical variables with the index coding approach in brms

cmagelssen · November 12, 2022, 11:59am

Hi,
I want to use the index coding approach with brms, but I wonder if I have applied and understood it correctly. Most examples illustrate how to apply the case of one categorical factor with multiple levels, but in my case, I have two factors with several levels, and I want to look at the interactions between them. Each participant also has multiple observations at each level of the factors. I have specified three models that I want to compare:

modA <- brm(data = d, 
      family = gaussian,
      performance ~ 0 + course + (0 + course | bib),
      prior = c(prior(normal(0, 0.5), class = b),
                prior(student_t(3, 0, 2.5), class = sigma),
                prior(student_t(3, 0, 2.5), class = sd),
                prior(lkj(2), class=cor)),
                control = list(adapt_delta = 0.95),
                file = "modA_test",
      iter=4000, cores = 4, seed = 1337)

modB <- brm(data = d, 
      family = gaussian,
      performance ~ 0 + course + day + (0 + course + day | bib),
      prior = c(prior(normal(0, 0.5), class = b),
                prior(student_t(3, 0, 2.5), class = sigma),
                prior(student_t(3, 0, 2.5), class = sd),
                prior(lkj(2), class=cor)),
                control = list(adapt_delta = 0.95),
                file = "modB_test",
      iter=4000, cores = 4, seed = 1337)


modC <- brm(data = d, 
      family = gaussian,
      performance ~ 0 + course + day + course:day + (0 + course + day + course:day | bib),
      prior = c(prior(normal(0, 0.5), class = b),
                prior(student_t(3, 0, 2.5), class = sigma),
                prior(student_t(3, 0, 2.5), class = sd),
                prior(lkj(2), class=cor)),
                control = list(adapt_delta = 0.95),
                file = "modC_test",
      iter=4000, cores = 4, seed = 1337)

In modA, I want to examine the difference between the courses, ignoring information about the Day factor.

In modB, I want to explore the estimated change/improvement from day 1 to day 5.

In modC, I want to understand if the changes were different in the three courses.

Given my goal, are the model formulas correctly specified? Or do I have to use a use brms non-linear syntax? In this book, 8 Conditional Manatees | Statistical rethinking with brms, ggplot2, and the tidyverse: Second edition, it seems that it is only necessary to use the non-linear syntax when you have an interaction between a categorical factor and a continuous variable. Please correct me if I am wrong.

Solomon · November 15, 2022, 2:33pm

For the interaction model, try this instead:

performance ~ 0 + course:day + (0 + course:day | bib)

As to the second model, I’m not aware of a way to get brm() to do what you want it to do without the non-linear syntax. Here’s what that could look like for your use case:

bf(performance ~ 0 + c + d 
   c ~ 0 + course + (0 + course |i| bib), 
   d ~ 0 + day + (0 + day |i| bib),
   nl = TRUE)

cmagelssen · November 15, 2022, 3:48pm

Thanks @Solomon. So I don’t need the main effects of course and day included in the interaction model? I was a bit surprised to learn that. The other models seems to work fine. Thanks.

Solomon · November 15, 2022, 3:56pm

To my mind, McElreath’s index approach to interaction models avoids concepts like “main effects.” Rather, his approach simply returns the mean for each group.

cmagelssen · November 15, 2022, 3:58pm

Thank you. It takes some time to consolidate his approach :)

cmagelssen · November 17, 2022, 5:14pm

I guess you also could extend the non-linear syntax to include more complex interaction models, such as:

 bf(performance ~ 0 + a + b * mTime, 
             a ~ 0 + course:day + (0 + course:day |i| bib), 
             b ~ 0 + course:day + (0 + course:day |i| bib),
             nl = TRUE)

mTime is a continuous variable in this case.

Solomon · November 17, 2022, 5:23pm

Maybe. I should confess I’ve only gone so far with the non-linear syntax. Tread with care.

Topic		Replies	Views
Categorical interaction effects while using index notation in {brms} models brms interpret-results	4	393	February 15, 2024
Index approach for categorical predictors in a multi-level model, featuring continuous variables, in brms Modeling brms	7	848	January 8, 2021
Categorical Modeling and priors for interaction terms brms	7	1893	March 26, 2020
Hypothesis testing for interactions in a multivariate model brms techniques	0	180	March 8, 2024
How to code post-hoc comparisons with interactions when using brms and the "hypothesis" function brms brms	4	1006	April 26, 2021

Specifying formulates for interaction between categorical variables with the index coding approach in brms

Related topics