diagSPD for Matérn GP kernels other than 3/2

Peter.Stewart · October 22, 2024, 10:53am

Hi everyone,

I’ve been fitting some Gaussian Process models using the Hilbert space approximation, adapting code from the tutorials by @avehtari here and here. I would like to fit a model which uses a Matérn 1/2 kernel, but I am not sure how to adapt the diagSPD_Matern32 function to the other Matérn kernels - any assistance with this would be greatly appreciated.

Here is the function in question:

vector diagSPD_Matern32(real alpha, real rho, real L, int M) {
    return 2*alpha * (sqrt(3)/rho)^1.5 * inv((sqrt(3)/rho)^2 + ((pi()/2/L) * linspaced_vector(M, 1, M))^2);
   }

Thanks!

js592 · October 22, 2024, 5:36pm

See discussion here: Practical Hilbert space approximate Bayesian Gaussian processes for probabilistic programming - #5 by js592

potash · October 22, 2024, 6:45pm

FYI this was recently implemented in brms (you may need to install from github if it hasn’t made it into a release yet). You could use a formula like

y ~ gp(x, cov="exponential", k=10)

where k is the number of basis functions. Depending on your needs you could either use the brms model directly or use make_stancode() and copy the relevant functions to your own Stan model.

sbfnk · October 23, 2024, 10:38am

Here’s what we think it is (any review / error spotting welcome, see related PR):

github.com

epiforecasts/EpiNow2/blob/f8228d05d650e4b507dc3519205530de37f2dafc/inst/stan/functions/gaussian_process.stan#L33


      
          
          /**
            * Spectral density for 1/2 Matern (Ornstein-Uhlenbeck) kernel
            *
            * @param alpha Scaling parameter
            * @param rho Length scale parameter
            * @param L Length of the interval
            * @param M Number of basis functions
            * @return A vector of spectral densities
            */
          vector diagSPD_Matern12(real alpha, real rho, real L, int M) {
            vector[M] indices = linspaced_vector(M, 1, M);
            real factor = 2;
            vector[M] denom = rho * ((1 / rho)^2 + (pi() / 2 / L) * indices);
            return alpha * sqrt(factor * inv(denom));
          }
          
          /**
            * Spectral density for 3/2 Matern kernel
            *
            * @param alpha Scaling parameter

Peter.Stewart · October 23, 2024, 3:17pm

That’s great, thanks!

js592 · October 23, 2024, 6:09pm

Do you have a link to the maths vingette?

My implementation looks like:

vector diagSPD_exp(real alpha, real rho, real L, int m) {
    vector[m] lambda = (linspaced_vector(m, 1, m)*pi()/(2*L))^2;
    vector[m] diag = 2*alpha*rho/(1+rho^2*sqrt(lambda)^2);
    diag = diag.^(0.5);
    
    return diag;
}

testing both with alpha = 1, rho = 0.5, L = 1, m = 8 I got:

[0.847367,0.748398,0.677581,0.623686,0.580895,0.545854,0.516474,0.491379] (yours)
vs
[0.786439,0.537029,0.390683,0.303314,0.246773,0.207584,0.178955,0.157177] (mine)

I’m actively using this code in my work so I want to make sure I didn’t make any mistakes in my derivation :)

Edit: I have narrowed down the difference to how the \lambda^2 term is coded. E.g. modifying:

vector diagSPD_Matern12(real alpha, real rho, real L, int M) {
  vector[M] indices = linspaced_vector(M, 1, M);
  real factor = 2;
  vector[M] denom = rho * ((1 / rho)^2 + (pi() / 2 / L)^2 * indices^2);
  return alpha * sqrt(factor * inv(denom));
}

gives identical
[0.786439,0.537029,0.390683,0.303314,0.246773,0.207584,0.178955,0.157177]
(or vice vesa if you change sqrt(lambda)^2 to sqrt(lambda) in the other function)

That being said, I need to dig a bit deeper to see which is correct.

js592 · October 23, 2024, 10:12pm

So, there are a couple of things going on here. The first component in the approximation is the spectral density, which I derived as:
s(\omega) =\frac{2 \rho \alpha}{1+\rho^2 \omega^2} = \frac{2\alpha}{ \frac{1}{\rho}+\rho\omega^2} =\frac{2\alpha}{ \rho((\frac{1}{\rho})^2+\omega^2)}.

The second component is this spectral density evaluated at the square root of the eigenvalues:
s(\sqrt{\lambda}_m) where \lambda_m = (\frac{m \pi}{2L})^2
So, I think the following implementation is correct:
s(\sqrt{\lambda}_m) = \frac{2\alpha}{ \rho((\frac{1}{\rho})^2+(\frac{m \pi}{2L})^2)}

Then, this gets an additional square root when we are using the non-centered linear representation.

sbfnk · October 24, 2024, 11:06am

Yes, I think you’re right (and your maths matches our derivation in Gaussian Process implementation details • EpiNow2) - thanks!

js592 · October 24, 2024, 8:12pm

Happy to help.

andre.pfeuffer · October 25, 2024, 11:40am

I tested the Matern 5/2 kernel some program and it got more fitting difficulties than the Matern 3/2 kernel (longer runtime). The results however are plausible.

Further I suggest to write the code slightly more homogeneous:

vector diagSPD_Matern12(real alpha, real rho, real L, int M) {
...
  return alpha * sqrt(factor * inv(denom));
}

vector diagSPD_Matern32(real alpha, real rho, real L, int M) {
..
  return factor * inv(denom);
}
vector diagSPD_Matern52(real alpha, real rho, real L, int M) {
...
  return alpha * sqrt(factor * inv(denom));
}

I just got confused because we have two times sqrt(inv(denom)) and one time inv(denom).
Nevertheless I wanted to check the math, but couldn’t find the reference. Do you have a link?
Thank you for the work, btw!

sbfnk · October 29, 2024, 2:02pm

This one? [2004.11408] Practical Hilbert space approximate Bayesian Gaussian processes for probabilistic programming

Topic		Replies	Views
Applied Gaussian Processes in Stan, Part 1. A Case Study Modeling	12	2707	November 21, 2019
Practical Hilbert space approximate Bayesian Gaussian processes for probabilistic programming Publicity	20	2320	December 1, 2024
Formulating a hierarchical multivariate Gaussian process using the Hilbert space approximation Modeling cmdstanr , gaussian-process	8	229	February 7, 2025
Hilbert space Gaussian process for multiple time series Modeling techniques , specification , gaussian-process	10	311	November 14, 2024
Making function more efficient? General	4	98	December 4, 2024

diagSPD for Matérn GP kernels other than 3/2

Related topics