Usage of tau in B-splines model

Loni92 · August 1, 2023, 9:53am

Hi all,

I have what’s probably a rather basic question. I was reading this article on B-splines regression
https://mc-stan.org/users/documentation/case-studies/splines_in_stan.html

which contains the following model:

data { 
  int num_data; 
  int num_basis; 
  vector[num_data] Y; 
  vector[num_data] X; 
  matrix[num_basis, num_data] B; 
} 
 
parameters { 
  row_vector[num_basis] a_raw; 
  real a0; 
  real<lower=0> sigma; 
  real<lower=0> tau; 
} 
 
transformed parameters { 
  row_vector[num_basis] a; 
  vector[num_data] Y_hat; 
  a = a_raw*tau;  
  Y_hat = a0*X + to_vector(a*B); 
} 
 
model { 
  a_raw ~ normal(0, 1); 
  tau ~ cauchy(0, 1); 
  sigma ~ cauchy(0, 1); 
  Y ~ normal(Y_hat, sigma); 
}

I am wondering about the tau parameter, is it equivalent to 1/\sigma^2 here and if so, why not use sigma? Or is this some naming convention that is just not obvious to me?

Thanks in advance!

dankehila · August 1, 2023, 5:56pm

While you are correct that the Greek letter \tau is also used as the precision coefficient instead of standard deviation in some models, especially models based on the old JAGS parametrization, in this case \tau is the variance of the spline coefficients (like slopes for each basis) and is parametrized like a standard deviation term and not like a precision term.

To get better intuition, you can technically parametrize using a directly rather than a_raw and tau: you would just have

parameters{
 row_vector[num_basis] a;
...
}
transformed parameters{
...
  vector[num_data] Y_hat; 
  Y_hat = a0*X + to_vector(a*B); 
}
model {
...
a ~ normal(0,tau);
}

But I believe using a_raw and tau may be a more efficient way to code the model, though I would check for myself that they both give the same result for simple simulated or real datasets. The a_raw, tau looks to me like what’s called a non-centered parametrization, which can have advantages, for example allowing you to use the computationally-efficient std_normal() prior for a_raw, and it is possible too that a_raw scales better for more complex models with multiple features.

js592 · August 1, 2023, 6:07pm

Tau in the B-splines model is part of the random walk prior, so I think it might be parameterized that way in the earlier examples in the case study to connect to the latter smoothing. It ensures that the coefficients of consecutive spline functions are relatively close to each other, enforcing smoothness.

a[i] = a[i-1] + a_raw[i]*tau;

Loni92 · August 2, 2023, 7:34am

Ok, thanks for the explanation!

Loni92 · August 2, 2023, 7:36am

thanks, will have a look at the earlier examples

Topic		Replies	Views
Splines in stan case study bug? General bug	2	847	October 24, 2017
Priors for splines brms prior-choice , priors , splines	2	2278	February 25, 2021
Common hyperparameter for splines by categories brms specification	7	797	January 21, 2021
Random walk prior Modeling specification	19	7014	September 4, 2020
Mathemathical notation for a brms model with a smooth term brms notation	8	1140	July 25, 2023

Usage of tau in B-splines model

Related topics