Hi there,
I’m new to coding time series models with Stan. I’ve got below R code to generate the data and a Stan model that fits the data. I think it’s all ok, but it would be great if someone who’s worked with time series models before could have a look.
I’m not quite sure whether i’ve done the priors correctly.
R code to generate the data:
Each outcome is drawn from a normal distribution of mean my.mus for that trial (with sd=my.sd).
These means also drift over time, following a normal distribution (with noise my.k).
my.k=19
my.sd=1
my.mu1=50
ntr=20
my.outcomes = array(NA,dim=ntr)
my.mus = array(NA,dim=ntr)
my.mus[1] = my.mu1
for (itr in 1:ntr){
my.outcomes[itr] = round(rnorm(1,my.mus[itr],my.sd));
if (itr<ntr){
my.mus[itr+1] = rnorm(1,my.mus[itr],my.k);
}
}
From looking at the Stan manual, this is not quite an autoregressive model I think as the outcome on the next trial does not depend on the outcome of the last trial, but instead there is a relationship over time between the (unobserved) means
Here is the Stan code:
data {
int<lower=0> ntr;
int outcomes[ntr];
}
parameters {
real mag_mean[ntr];
real<lower=0> mag_sd;
real<lower=0> k;
}
model {
// Priors
mag_mean[1] ~ normal(50,20); // priors for other trials are given in loop (?)
mag_sd ~ normal(0,5);
k ~ normal(0,5);
// Model
for (itr in 1:(ntr)){
outcomes[itr] ~ normal(mag_mean[itr],mag_sd);
if (itr<ntr){
mag_mean[itr+1] ~ normal(mag_mean[itr],k);
}
}
}
Are there multiple observations at each time point? If not, then your priors for the variance parameters are going to be doing a lot of work to render the model identifiable. Have you considered a Gaussian Process model?
Thanks for the quick reply.
I’m looking at the Gaussian Process model chapter section 10.1 in current manual. It seems that these models have y and x? But my model only has observations y, nothing else is observed. Am I misunderstanding these models? (Well possible as I did not understand the text well at all given my lack of knowledge about this)
Why dont you write it as Kalman Filter? This Handook could help you alot. And with the gaussian_dlm_obs_lpdf function is really ease to write your model. I think ctsem package could do it for you. But the right person to ask is @Charles_Driver.
In the SUG, x is analogous to time. There’s a worked-through example of a GP regression (with inference on the latent means, which I suspect is what you’re after) here.
You should also take a look at the lgpr for a possibly more user-friendly interface to GPs.