Assessing the adequacy of the assumed population "random effects" distribution

sourdough · November 15, 2018, 11:11pm

In multilevel models, it is typical to model the coefficients for particular groups (e.g., effects of interventions in particular schools) as drawn from a normal population distribution.

Within the brms environment (or more generally within RStan), what options are available for evaluating the adequacy of this modeling assumption? Are there specific kinds of graphical predictive checks that are particularly useful for this question?

Any advice is greatly appreciated! Thanks.

paul.buerkner · November 16, 2018, 7:43am

You could extract the coefficients with ranef() or coef() and then plot them to see how closely they resemble a normal distribution.

But there is more to it. Even if the normal distribution might not perfectly fit the distribution of the varying effects, it still does it’s job which is to provide shrinkage and thus be less exited about extreme patterns in the data. I believe there are some simulations studies out there (using frequentist models) that show that mean coefficients and other hyperparameters are unlikely to be mis-represented even if the distribution of the varying effects is way different than normal.

sourdough · November 16, 2018, 3:29pm

Interesting, thanks Paul!

I know that in BDA section 17.4, there is a sensitivity analysis reported where the model is re-fit with a t-distribution for the school effects, yielding essentially unchanged school-specific estimates.

Presumably, if the chosen distribution were extremely poor (e.g., the model uses a normal distribution to describe the heterogeneity, but the true distribution is bimodal) you would get sensitivity of the estimates to the prior, and you would also see other problems with the model, right?

For example using the “stat_grouped” method within pp_check, I am guessing that the model-simulated distributions of each person’s mean yrep would poorly match the actual observed y means. Is that correct? Thanks so much!

paul.buerkner · November 16, 2018, 3:42pm

It’s possible, perhaps likely that you will see problems, but that will surely depend on your data and model at hand.

sourdough · November 16, 2018, 3:43pm

makes sense, thank you.

Topic		Replies	Views
Interpretation brms results brms interpret-results	17	6066	November 13, 2018
Colinearity issues arise in multilevel multivariate models (but not in submodels) brms	13	2186	March 15, 2019
BRMS: Can I fit only random effects for a subset of data, and not have this data impact the remaining parameter estimates? brms techniques , specification	9	2014	March 4, 2022
Parameter contrasts in Bayesian linear regression model using brms brms	11	2293	February 26, 2019
Random effects for ordinal multilevel regression brms	2	618	April 17, 2022

Assessing the adequacy of the assumed population "random effects" distribution

Related topics