Posterior random effects distribution

Nic · October 25, 2021, 4:04pm

I’m interested in plotting the density distribution of random effects. A very similar topic (sampling from the random effects distribution) is discussed in some depth here:

github.com/paul-buerkner/brms

Sampling new offsets from a model with random effects

opened 03:45PM - 02 Mar 17 UTC

closed 08:03PM - 27 Mar 17 UTC

tomwallis

feature

Hi Paul, In Bayesian power analysis, one wishes to 1. generate sample dat…asets from a hypothetical or real model fit (for example, a fit to pilot data), 2. fit a new model to each sampled dataset, then 3. assess the proportion of samples in which a goal is achieved (See second edition of Kruschke's book for more detail on the approach). This allows us to assess (e.g.) given a sampling plan of N observations, how often is our goal achieved? The mechanical way to do this, outlined by Kruschke in his book, is to generate new datasets from some number of MCMC samples (which represent jointly-credible model parameters). Each MCMC sample provides one simulated dataset. In the context of a model with one or more random effects, I believe this would mean sampling new random effects offsets from the model for a given number of samples from the random variable. **It would be great if there was a way to do this easily from within `brms`**. As a more concrete example, consider a linear mixed effects model like: `y ~ x + (x | subj)` where `x` might be a covariate with some slope and `subj` is (for example) the index of each subject in an experiment. This is a "varying intercept, varying slopes" model with a random effect of subject on both fixed effects. We also allow the offsets to be correlated. Imagine I fit this model in `brms`, and now I want to make predictions for new data. I can use the `predict` method to generate new `y` values, and I can sample from a new data frame, allowing new levels of `subj`: `prediction_data <- expand.grid(x = seq(-1, 1), subj = paste0("Sim", 1:10))` `preds <- predict(fit, newdata = prediction_data, allow_new_levels = TRUE)` However, in this case the new subjects all receive the same prediction (you can verify this by looking at the `fitted` regression line). Differences in `predict` between subjects are only due to the measurement error of the model. That is, the predict method with new data is giving a prediction for a "new, average subject". If I understand correctly, then doing a Bayesian power analysis appropriately in this setting would require *sampling* new subjects from the model. That is, in the example above, each of the 10 subjects would be given offsets to the fixed effects sampled from a zero-mean two-dimensional Gaussian, taking the estimated marginal variance and correlation structure given by `fit` into account. I can imagine doing this by hand, but it would be lovely to have a general function to do this in `brms` (possibly as an argument to the `predict` method, in which random effects are "sampled" rather than "zeroed"). Or have I misunderstood what the same regression line for all random effects buys me?

Basically, there would be two approaches: (1) draw from a normal distribution with the estimated standard deviation sigma, (2) draw random samples from the empirical bayes estimates of random effects in your data.

However, both of these have downsides: (1) Sampling from the normal distribution does not incorporate any information from the data as to whether random effects may follow a somewhat different distribution. (2) Random samples from EB estimates should be affected by shrinkage and less varied than sigma.

Is there a way to sample from a latent random effects distribution (i.e., not shrinked) that still represents a posterior (i.e., incorporates information from the data on the actual distribution)? This would be ideal for visualizing posterior random effects distributions.

To clarify: Let’s say the “real” random intercept has a bimodal distribution with a variance of 0.5 in the population. The first approach would return a normal distribution with var = 0.5 (i.e., not include the bimodal part) and the existing RE estimates would return a bimodal distribution shrunk towards the average with var < 0.5 (due to shrinkage). Is there a way to visualize estimates of the latent distribution (bimodal, var = 0.5)?

martinmodrak · October 29, 2021, 7:41am

Hi,
I think there might be some confusion about how brms and Stan in general works.

“empirical bayes” is a name of a family of computational methods that are however not used in brms. I guess you meant using the fitted model coefficients?

I don’t think there is a method that would fulfill your requirements directly. The main problem IMHO is that the standard random effects formulation used in brms assumes the distribution of random effects is normal. If that assumption is wrong, your inferences might be problematic. I think that the general pattern you are implicitly using here, i.e. “fit data assuming P, then analyze the posterior as if P is false” is inchorent for most use cases. It is possible that in specific cases, this would yield sensible answers, but I don’t think there are any gurantees it would do so generally.

A better approach would IMHO be to check (e.g. via posterior predictive checks) that the assumption of normality of random effects is not grossly violated. If it is not, you can use the fitted sigma of the random effects as a good summary. However, if the assumption is violated, you probably should be changing your model to accomodate this rather than trying to salvage the situation by analysing the fitted random effects.

Additionally, I don’t think there is any guarantee that if the the “real” distribution is bimodal, that the fitted random effects would also show bimodality or - more broadly - represent the “real” distribution well - that would depend on both how much data you have for the individual random effects (with little data, the hyperprior might smooth out all the “real” structure) and how many levels the random effects have (if you don’t have a lot of levels, the fitted random effects will not provide a lot of information about the “real” distribution).

Does that make sense?

Best of luck with your model!

Topic		Replies	Views
Offset weights in mixed-effects model in brms brms specification , hierarchical-model	2	823	January 9, 2022
Fit random effects for new levels without refitting entire model? brms	4	684	April 11, 2023
How can I simulate a greater dataset (brms/Bayesian analysis) Modeling rstan , brms	2	292	December 8, 2023
BRMS: Can I fit only random effects for a subset of data, and not have this data impact the remaining parameter estimates? brms techniques , specification	9	2021	March 4, 2022
Parameter contrasts in Bayesian linear regression model using brms brms	11	2298	February 26, 2019

Posterior random effects distribution

Related topics