Non-centered parameterization of the multivariate student-t distribution

paul.buerkner · May 12, 2018, 4:00pm

In the multivariate normal case, we have

mu + L * z

where mu is the mean vector, L is the cholesky factor of the covariance matrix and z are independent standard normal variables. My question is how to correctly incorporate the degrees of freedom parameter nu into the above formula to create multivariate-t random variables? I remember having seens this somewhere already, but I can’t find it right now…

andre.pfeuffer · May 12, 2018, 4:30pm

https://groups.google.com/d/msg/stan-users/nk03n8hX4Us/-pTN5IGjCQAJ

One may optimize it using the inv_chi_square and turn the division in a multiplication.

paul.buerkner · May 12, 2018, 5:18pm

Thanks, that’s what I was looking for! :-)

T_nick · March 14, 2019, 11:33pm

Hello @paul.buerkner @andre.pfeuffer,

The parameterization: mu+L*z you mentioned for the multivariate normal is valid when the covariance matrix is known (defined in the data block).

Is there any difference when the covariance matrix is unknown (defined in the parameters block)?

Thanks a lot.

paul.buerkner · March 15, 2019, 6:34am

It’s still valid.

T_nick · March 15, 2019, 9:04am

I think it’s a bit more complicated in my case. I’ll create a new topic.

sktron · May 24, 2019, 1:12pm

Could somebody re-post the solution at the end of this thread?

The link to the google group seems to no longer be valid.

andre.pfeuffer · May 24, 2019, 1:33pm

data {
  int<lower=1> p;
  vector[p] mu;
  cholesky_factor_cov[p,p] L;
  real<lower=0> nu;
}
parameters {
  vector[p] z;
  real<lower=0> u;
}
transformed parameters {
  vector[p] x;
  x = mu + sqrt(nu / u) * (L * z); // distributed multi_student_t
}
model {
  target += normal_lpdf(z | 0, 1);
  target += chi_square_lpdf(u | nu);
}

markjrieke · May 6, 2024, 1:56pm

To consider nu as a parameter rather than data, you can also use this function (derivation below):

real multivariate_t_scale_lpdf(vector x,
                               real nu) {
        int N = size(x);
        vector[N] xx = nu/(x^2);
        real lp = 0.0;
        for (n in 1:N) {
            lp += log(2) + log(nu) + chi_square_lpdf(xx[n] | nu) - 3*log(x[n]);
        }
        return lp;
    }

From some tinkering, I found that the quantile function for the outcome of interest is

y = \sqrt{\frac{\nu}{\text{Q-Chi-Square}(1-p, \nu)}}

library(tidyverse)

# simulate outcome w/df = 7
nu <- 7
raw <- sqrt(nu/rchisq(1e4, nu))

# quantiles for checking simulated results against empirical
sim <- 
  tibble(x = seq(from = 0, to = 1, length.out = 100),
         q = quantile(raw, probs = seq(from = 0, to = 1, length.out = 100)))

# check the quantile function
tibble(x = seq(from = 0, to = 1, length.out = 1000)) %>%
  mutate(q = sqrt(nu/qchisq(1 - x, nu))) %>%
  ggplot(aes(x = x,
             y = q)) + 
  geom_line() +
  geom_point(data = sim,
             color = "royalblue") +
  theme_minimal()

^{Created on 2024-05-06 with reprex v2.1.0}

Some rearrangement to get p on its own

\begin{align*} \text{Q-Chi-Square}(1-p, \nu) &= \frac{\nu}{y^2} \end{align*}

Set a=\frac{\nu}{y^2} and recalling that the quantile and cumulative functions are inverses:

\begin{align*} \text{CDF-Chi-Square}(a, \nu) &= 1-p \\ p &= 1 - \text{CDF-Chi-Square}(a, \nu) \end{align*}

The pdf is the derivative of the cdf, so by the chain rule, you get the following:

\begin{align*} d &= -\text{PDF-Chi-Square}(a, \nu) \times a' \\ d &= -\text{PDF-Chi-Square}\left(\frac{\nu}{y^2}, \nu \right) \times \frac{-2\nu}{y^3} \\ d &= 2 \times \text{PDF-Chi-Square}\left(\frac{\nu}{y^2}, \nu \right) \times \frac{\nu}{y^3} \end{align*}

The derived density function checks out against the simulated density:

library(tidyverse)

# simulate outcome w/df = 7
nu <- 7
raw <- sqrt(nu/rchisq(1e4, nu))

tibble(x = seq(from = 0, to = 6, length.out = 1000)) %>%
  mutate(d = dchisq(nu/(x^2), nu) * (2*nu)/(x^3)) %>%
  ggplot(aes(x = x)) + 
  geom_line(aes(y = d)) +
  geom_density(data = tibble(x = raw),
               color = "royalblue") +
  theme_minimal()

^{Created on 2024-05-06 with reprex v2.1.0}

Topic		Replies	Views
Non-centred parameterisation for multivatiate normal Modeling	1	647	March 19, 2019
Multivariate skew normal distribution Modeling	1	419	March 23, 2023
Multi-variate half-normal or half-student-t distribution General	6	1141	September 2, 2020
Non-centered multivariate normal - in a mixture? Modeling	2	968	May 9, 2018
Non-centered sampling of multivariate normal distribution - precision matrix Modeling specification	1	493	November 25, 2021

Non-centered parameterization of the multivariate student-t distribution

Related topics