Auto-grouping option in brms' gp()

ermeel · September 28, 2019, 1:48pm

I wonder whether the brms Gaussian Process (GP) support through gp actually supports hierarchical GPs when setting gr=T.

What I mean by hierarchical GP:

Let \vec{M} and \vec{0} be m-dimensional.

So we have

\vec{M} \sim \mathcal{GP}(\vec{0}, \mathbf{K}_1)

where \mathbf{K}_1 is m\times m dimensional. This is what I would call the population mean GP. Now for each of the N individuals we have that the observation \vec{y}_i (also m-dimensional) of individual i is i.i.d. from the following GP:

\vec{y}_i \sim \mathcal{GP}(\vec{M}, \mathbf{K}_2)

where \mathbf{K}_2 is another m\times m kernel. Now suppose each individual has observations for the same m-dimensional vector of time points, couldn’t one then do in brms

fit <- brms(y ~ gp(time,gr=T) + gp(time,gr=F), data=df)

and the term gp(time,gr=T) would correspond to the population GP with kernel \mathbf{K}_1 and gp(time) to the individual GP with kernel \mathbf{K}_2.

paul.buerkner · September 28, 2019, 3:03pm

I think this would indeed imply a hierachical GP in your formulation. You could try running the model, but I would expect it to have serious convergence issues to the the GPs fighting over the same variance. For this to work reliably we may need to joint priors over the GPs, which are currently not implemented. See also https://github.com/paul-buerkner/brms/issues/412

ermeel · September 28, 2019, 4:51pm

Thanks Paul. In order to get closer to an understanding of what such a joint prior would need to look like, could you help me collecting the properties it needs to fulfil more precisely (e.g. which pathologies it needs to avoid)?

I understand that in the above additive 2-level setting one can “freely” add a ~~constant~~ varying vector \vec{d} to each second-level GP realisation (that is to each \vec{y}_i) and add the negative of it to the first-level GP realisation \vec{M}.
…

paul.buerkner · September 28, 2019, 4:56pm

Think of it as a multilevel model. If you had an overall intercept and an individual intercept per group without any hard or soft-centering (the latter via priors) we will run into linear dependence and non-idenfitied models. The same will presumably happen here. I am not sure about the best solution. Likely people have worked on this (@avehtari?). If not, this is actually a relevant research topic I would say.

avehtari · September 29, 2019, 8:56am

One option is decov priors used in rstanarm
See also Bayesian functional ANOVA modeling using Gaussian process prior distributions

ermeel · September 29, 2019, 1:46pm

Thank you!

That is something like that for the two marginal standard deviations corresponding to the two exponentiated square kernels?

\\ ...
data {
 \\...
 vector<lower=0>[2] concentration;
}
parameters {
  \\...
  real<lower=0> tau;
  simplex[2] pi;
}
transformed parameters {
  \\...
  real<lower=0> alpha1;
  real<lower=0> alpha2;
  \\...
  alpha1=pi[1]*tau;
  alpha2=pi[2]*tau;
}
model {
    \\...
    pi ~ dirichlet(concentration);
    tau ~ gamma(1,1);
}

Topic		Replies	Views
Fitting a GP model with subject-specific GP terms in brms brms gaussian-process	4	1522	May 21, 2021
Brms: one shared gp for temporally correlated errors in each of 250+ timeseries brms gaussian-process	18	2205	August 3, 2018
Auto-grouping of latent variable in Gaussian process brms gaussian-process	4	770	May 5, 2020
Evaluating a brms model with a Gaussian Process brms techniques , ecology , gaussian-process , visualisation	5	1434	June 14, 2023
Hilbert space Gaussian process for multiple time series Modeling techniques , specification , gaussian-process	10	341	November 14, 2024

Auto-grouping option in brms' gp()

Related topics