Hi,
May I ask that could Stan use splines in the modeling process (eg. use spline functions of x to model continuous y)? If so, do we have some examples? For example, how should we assign priors to spline function?
Thx!
Hi,
May I ask that could Stan use splines in the modeling process (eg. use spline functions of x to model continuous y)? If so, do we have some examples? For example, how should we assign priors to spline function?
Thx!
Yup, for sure. Here’s a tutorial on using splines in brms
. That tutorial doesn’t go into detail on the default priors used, but remember that when doing splines, the thing you’re estimating is the weight (or contribution) of each spline as they are summed to produce the final wiggly curve. Rarely does one have much prior info on the contribution of a single given spline, so a priors assigning each weight centered on zero and ranging to cover the variability observed in the data is probably a good default (ex normal(0,1)
if the data are normalized to mean=0 and sd=1).
If you want to play with splines in Stan directly, it’s pretty easy. Here’s code for a simple model where it expects you to compute the splines elsewhere and pass them in as data:
data{
int n ;
int num_splines ;
vector[n] y ;
matrix[n,num_splines] x ;
}
parameters{
real<lower=0> noise ;
real intercept ;
row_vector[num_splines] weights ;
}
transformed parameters{
vector[n] f ;
for(i in 1:n){
f[i] = intercept + sum( weights .* x[i,] ) ;
}
}
model{
noise ~ std_normal() ;
intercept ~ std_normal() ;
weights ~ std_normal() ;
y ~ normal(f,noise) ;
}
Thanks so much!
If I would like to use B-splines in the model, it is possible for me to use it for prediction? (For example, I am modeling y on continuous x ranging within [0,100], is it possible for the spline model to predict the value of y when x falls into the range [100,200]?
I know very little about splines in Stan (or in general), but Gavin Simpson has a very good blog post about extrapolating beyond the range of x to make predictions that might be useful - https://fromthebottomoftheheap.net/2020/06/03/extrapolating-with-gams/
GAMs are great, but I would be extremely cautious about using smooths to predict anything beyond the range of the data. Re: the link, I wouldn’t bury the lede:
However, in none of the fits do we get behaviour that get close to fitting the test observations beyond the training of x in the training data, even when using a Gaussian process that supposedly matches at least the general form of the true function.
I echo what others have noted here whereby you have to be really careful when extrapolating from any model . For smoothing models specfically, different choices for the spline basis will yield very different extrapolation behaviour. Some will gradually revert to the mean of the observed data, others will continue any trends/periodicities that are observed in the data, etc.
Once you’ve chosen a basis, it’s easy to get the predictions, just extend the basis matrix:
data{
int n_obs ;
int n_tot ; \\observed plus predicted
int num_splines ;
vector[n_obs] y ;
matrix[n_tot,num_splines] x ;
}
parameters{
real<lower=0> noise ;
real intercept ;
row_vector[num_splines] weights ;
}
transformed parameters{
vector[n_tot] f ;
for(i in 1:n_tot){
f[i] = intercept + sum( weights .* x[i,] ) ;
}
}
model{
noise ~ std_normal() ;
intercept ~ std_normal() ;
weights ~ std_normal() ;
y ~ normal(f[1:n_obs],noise) ;
}
Thx! So do you mean that generally extrapolating using splines is not recommended (eg. predicting future trend)
Since splines can behave very unpredictably outside the range of the data, you really need to be careful.
As Suvorov said, “The bullet is a mad thing. Only the bayonet knows what it is about.”
No, there’s no outright proscription against extrapolation, you just need to engage extrapolation very mindfully, including knowing the expected extrapolation behaviour of your tools. To achieve that knowledge, it can be very useful to play with simulated data where you can establish the ground truth and observe how well different models recover that truth.
I will say that some tools, for example polynomial smooths, have known terrible extrapolation performance, so with those there is indeed proscription against extrapolation.