Modeling time dynamics with GP

nerpa · April 2, 2020, 5:00pm

Dear forum,
I am still quite new to Stan and I am trying to model time dynamics in longitudinal data using Gaussian Process. In my data, I have N subjects that were sampled T times and for each time point I have P measures. For now I don’t have any missing data (and I know this will be a whole separate issue). I looked at Stan documentation (10.1), but still couldn’t figure out how to specify the model.
Any point to the right direction will be greatly appreciated.
Thank you!

arya · April 2, 2020, 5:53pm

Are you looking for a full Stan file of a GP model to use as a template? Did you check out section 10.3 of the user guide? Looks like section 10.3 of the user guide has a couple of those.

nerpa · April 2, 2020, 5:58pm

I have seen it, thanks! (I have spent many hours on this page) But I am having trouble relating my data to the examples…

mike-lawrence · April 2, 2020, 6:00pm

So what you have is what I call a hierarchical GP scenario where you can do a GP for each of your N subjects and likely want to model said GPs as deviations from a mean-across-subjects GP. Before looking at a full hierarchical model, you should first get a feel for what it would be like to do a GP for a single subject. I have a demo here for that. Then, for the hierarchical case, take a look here.

Note that in those examples I deviate from the Stan User Guide’s recommendation to parameterize the GP in terms of lengthscale, instead using inverse-lengthscale (a.k.a. “volatility”) but it should be straightfoward to re-parameterize if you prefer lengthscale. I should probably do this myself at some point as I understand that more expert folks than I have worked out what priors one needs for lengthscale to make GPs behave well.

nerpa · April 2, 2020, 6:05pm

Thanks! In your gp_regression.stan code, would it be right to say that rows_z_unique could represent time?

mike-lawrence · April 2, 2020, 6:11pm

It’s best if you use the “download zip” button, unpack on your computer, open the gp_regression_example.R file and step through that, as the comments there will explain how the model is being set up. It’s actually structured to accommodate more complicated designs than you’ve described so far where there is some set of conditions in which each subject is measured, and that’s what’s the z matrix business is all about. For your case, you’d just have an intercept-only contrast matrix, generated as:

z = model.matrix(
	data = dat
	, object = ~ 1
)

mike-lawrence · April 2, 2020, 6:11pm

In both models, x would correspond to time in your data.

nerpa · April 2, 2020, 6:13pm

Ty! It’s going to be a long day of code digging. Thanks again.

mike-lawrence · April 2, 2020, 6:20pm

Oh, and you mentioned missing data in your original post. Note that the way I have things set up, it automatically accommodates missing data as it estimates a latent noiseless GP that is then sampled with Gaussian noise; when you’re missing data for a timepoint, you still get updating on it’s latent value thanks to it’s surrounding timepoints.

nerpa · April 2, 2020, 6:24pm

Cool, that’s super helpful! Do you have a paper describing your model that I can refer to?

mike-lawrence · April 2, 2020, 6:40pm

I developed this approach myself while advising my friend, who wrote up his work in a Masters thesis here, and it looks like they also published here, though that seems to be just a conference abstract.

alexpghayes · April 7, 2020, 3:59pm

You might also find https://jtimonen.github.io/lgpr-usage/index.html useful if you don’t want to code things up in Stan yourself

mike-lawrence · April 29, 2020, 5:55pm

Cool, I meant to look at lgpr when it first came out but never found the time. It does hierarchical scenarios? (a mean function with multiple individual deviation functions?)

Topic		Replies	Views
Missing data in GP Modeling specification	8	519	April 9, 2020
Applied Gaussian Processes in Stan, Part 1. A Case Study Modeling	12	2702	November 21, 2019
Time series and Gaussian process Modeling gaussian-process	62	4572	December 20, 2019
Using 2D Gaussian process predictions within model Modeling gaussian-process	11	2436	February 18, 2022
Gaussian Process out-of-sample predictive distribution Modeling gaussian-process	21	2153	December 6, 2024

Modeling time dynamics with GP

Related topics