Deterministic and random variables in Stan

aloaberasturi · March 4, 2021, 6:26pm

Hello everybody!
First of all, I apologize if I am asking something obvious, I have very little experience working with Stan and in general with Bayesian statistics.
I want to create a model of peer assessment. Given a set of grades by a peer X to a group of students, and given a set of grades by a different peer Y to that same group of students, I am computing in the transformed data block the mean and variance of the resulting differences. In other words: if X has assessed v,w and z with grades G^{x}_{v}, G^{x}_{w}, G^{x}_{z}; and Y has assessed these same students with grades G^{y}_{v}, G^{y}_{w}, G^{y}_{z}, I am computing the mean \mu_{xy} = \frac{\sum_{i}G^{x}_{i} - G^{y}_{i}}{|\{G^{x}\}\cap\{G^{y}\}|}, where the denominator denotes the number of peers assessed both by X and Y. Similarly, the variance is computed as \sigma_{xy} = \frac{\sum_{i}(G^{x}_{i} - G^{y}_{i} - \mu_{xy})^{2}}{|\{G^{x}\}\cap\{G^{y}\}|}.

I intend now to use \sigma_{xy} and \mu_{xy} as the parameters of a normal distribution such that b \sim \mathcal{N}(\mu_{xy}, \sigma_{xy}), where b is a new variable.
Seeing this as a PGM, we would have the set of grades G^{x}_{v}, G^{x}_{w}, G^{x}_{z}, G^{y}_{v}, G^{y}_{w}, G^{y}_{z} as observed nodes, all of them parents of data-derived \mu_{xy} and \sigma_{xy}, which are likewise parents of an unobserved child b. As you may notice, given the deterministic definitions of \mu_{xy} and \sigma_{xy}, their existence depends on |\{G^{x}\}\cap\{G^{y}\}| \neq 0. Does Stan support such a model? Could I define b as a parameter?

n97 · March 8, 2021, 5:17pm

Hello!

No worries - so if I’m understanding you correctly you’d like to create a PGM that looks a little like this:
(the set of grades) → data-derived mean and variance → your distribution b
I’m unsure if Stan support graphical models so hopefully someone can step in here, but yes you can define b as a distribution and since you have a fixed mean and variance (based on the context you’ve told us!) it should be fairly straightforward to sample from. :)
The one consideration I might have about your latter question, is if you have any reason to believe your set of grades follow a particular distribution - that might affect how you go about sampling from your distribution b.

stevebronder · March 8, 2021, 9:30pm

Bit out of my depth but is this related to multi membership models? If so the brms package has multimember model support so may be helpful

aloaberasturi · March 19, 2021, 12:21pm

First of all, thank you both very much! I finally was able to make the model work! Regarding @n97 's question about the distribution followed by the grades, yes, the data that I gathered shows what seems to be something similar to a gaussian centred around 7 on a scale from 0 to 10.
I was not thinking from the perspective of multi-membership terms, @stevebronder , but that link did turn out to be useful for another project I’m in, so thank you very much for the hint!

Topic		Replies	Views
How to model mixed variables Gaussian, Ranking binary and categorical in stan all together to generate a joint distribution of them? Modeling	2	452	October 14, 2023
Specifying priors for a matrix Modeling	2	2064	June 6, 2019
Priors for binomial trials in generalized linear models Modeling	3	512	July 6, 2018
Mixed models with deterministic and random variables Algorithms optimization , variational-bayes	7	2258	August 1, 2017
Modelling test scores Modeling	5	801	November 5, 2019

Deterministic and random variables in Stan

Related topics