Data Description:
I have data about participants from multiple nations and want to predict each participant’s response variable as a function of participant gender and an aspect of culture measured on the nation-level. Because the response variable is bounded from 0 to 1 and has many zeros (and no ones), I want to use the zero_inflated_beta family.
Question 1:
When fitting such a model, should I include random effects on the zero-inflation part of the model? I would like to know if gender and culture are related to the likelihood of having a response of zero.
Option A (no random effects on z_i):
brm(
formula = bf(
response ~ 1 + gender * culture + (1 + gender | nation),
zi ~ 1 + gender * culture
),
family = zero_inflated_beta,
prior = c(
set_prior("normal(0, 1)", class = "b"),
set_prior("cauchy(0, 2)", class = "sd")
),
data = out
)
Option B (random effects on z_i):
brm(
formula = bf(
response ~ 1 + gender * culture + (1 + gender | nation),
zi ~ 1 + gender * culture + (1 + gender | nation)
),
family = zero_inflated_beta,
prior = c(
set_prior("normal(0, 1)", class = "b"),
set_prior("cauchy(0, 2)", class = "sd")
),
data = out
)
Question 2:
Any other advice or tips for this type of modeling? Thanks in advance.