Translating cross-classified and multiple membership model estimated in brms into equation

Pia · May 13, 2021, 11:31am

Hi all, I would like to report two models I have estimated in brms and am hoping someone here can help me with the equation. I have given it a go but don’t know anyone I could ask directly, so I am hoping for the help of the community. I appreciate that this is quite long though…

Cross-classified model with students (L1) nested in the cross-classification of teachers (L2) and classrooms (L2)

Brms model is:

Y ~ 0 + Intercept + V1_cgm + V2_cgm + V3_cwc + V4_cwc + V5_cwc + V6_cwc +V7_cwc + V8_cgm + (0 + Intercept + V1_cgm + V2_cgm + V3_cwc + V4_cwc + V5_cwc + V6_cwc + V7_cwc |p| T1_ID) + (0 + Intercept + V1_cgm + V2_cgm + V3_cwc + V4_cwc + V5_cwc + V6_cwc + V7_cwc |q| C1_ID), family = skew_normal())

Where

V1-V7 are student-level variables (L1), modelled as random effects
V8 is a teacher/classroom level variable (L2)
T1_ID is the teacher unit
C1_ID is the classroom unit
cgm = grand-mean centered
cwc = centered within cluster

This is what I have got:

The formula at Level 1 is

and, at Level 2

Screenshot 2021-05-13 at 12.23.32

2) Cross-classified multiple membership model with students (L1) nested in the cross-classification of teachers (L2) and classrooms (L2), whereby students can belong to multiple teachers and classrooms

Y ~ 0 + Intercept + V1_cgm + V2_cgm + V3_cgm + (0 + Intercept + V1_cgm + V2_cgm |p| mm(T1_ID, T2_ID, T3_ID, T4_ID, weights = cbind(W_T1, W_T2, W_T3, W_T4))) + (0 + Intercept + V1_cgm + V2_cgm |q| mm(C1_ID, C2_ID, C3_ID, C4_ID, weights = cbind(W_C1, W_C2, W_C3, W_C4))), family = skew_normal())

Where
V1 and V2 are student-level variables (L1), modelled as random effects
V3 is a teacher/classroom level variable (L2)
T1_ID is the teacher unit
C1_ID is the classroom unit
cgm = grand-mean centered

And the equation I have got so far is

The formula at Level 1 is

and, at Level 2

where,

If someone takes the time to look through these, many thanks in advance!

PS. So sorry for the formatting - I gave my best

martinmodrak · May 20, 2021, 8:52pm

This looks roughly correct (but I didn’t check all the details), although I personally prefer syntax with “sampling statements” (e.g. as in equtiomatic documentation lme4::lmer Models) to having symbols for residuals, but that’s an aesthetic preference In the formula. I’ll however note that your family is skew_normal, while the formula declares the residual as just normal.

Best of luck with writing!

Pia · May 21, 2021, 11:23am

Thank you very much for having a look at this! And thanks for the link and pointing out the mistake. Is this the correct way of specifying residuals as having a skew-normal distribution?

Thanks

martinmodrak · May 21, 2021, 12:57pm

I don’t think there is a “correct” way, there are many different conventions and the important part is that people can understand what you did (to that effect, many people would IMHO actually prefer just to see the formula of the model as you used in brms than a math formula). I think skew-normal does not have a well-accepted short notation, so I would rather spell it out as \sim \mathrm{SkewNormal}(0, \omega^2, \alpha). I am however not 100% sure if y \sim \mathrm{SkewNormal}(\mu, \omega^2, \alpha) (which is what brms does is euqivalent to

y = \mu + e \\ e \sim \mathrm{SkewNormal}(0, \omega^2, \alpha)

I just don’t understand skew normal well enough.

paul.buerkner · May 21, 2021, 3:03pm

In brms, I am using a parameterization suggested by @Stephen_Martin. Perhaps he as some suggestions how to write it down.

Pia · May 23, 2021, 8:28am

Thank you. Really appreciate your help!

Stephen_Martin · May 24, 2021, 5:53pm

If I recall correctly (it’s been a few years) - I recommended a reparameterized skew normal, in which the ‘location’ parameter is the expected value. If that’s the case, then yes - I think one could say e ~ SN(0, w, a), so long as it’s clear that the ‘0’ is the mean, and not the xi parameter.

Edit: I don’t recall if we further reparameterized so that the ‘scale’ parameter is the sqrt(variance), or if we left the scale to be omega.

paul.buerkner · May 25, 2021, 8:52am

Thanks! I think I remember now that we made the scale parameter the SD as well but kept the skewness parameter of the original parameterization rather than the one typically used for mean-SD parameterization, forming a kind-of hypride between two common parameterizations. I think you had run some experiments with that and found it to be more sampling efficient, but I really don’t know for certain anymore.

Stephen_Martin · May 25, 2021, 7:46pm

I think you’re right, yes. Basically, it’s the definition from the Stan manual and wiki page, but reparameterized to be on mu and sd; but the alpha-skewness-parameter (real on -inf to inf) remains the same. The alternative is parameterizing in terms of the skewness coefficient directly, but it had some weird limit of like, -.998 to .998, and was much less efficient than keeping the parameter on -inf to inf.

Topic		Replies	Views
Help with formula notation for multivariate model in brms vignette brms	1	911	February 20, 2020
Cross-classified multiple membership models with brms brms	9	3202	July 6, 2019
Choosing family and link brms	24	3801	June 25, 2019
Trouble with Mathematical Model brms	2	731	July 29, 2020
Multivariate analysis brms brms	9	774	May 2, 2025

Translating cross-classified and multiple membership model estimated in brms into equation

Related topics