Efficiency improvement over inverse cdf sampling for simulated multivariate normal draws

shoshievass · June 28, 2018, 6:14pm

As part of a larger model, @yjin and I are trying to use simulated random draws from a multivariate normal distribution. The reason for this is to account for individual heterogeneity across observations without trying to identify the individual errors themselves (which appears infeasible in our context).

The crux of the idea is to draw NS [independent uniform] random draws once and then transform them into multivariate draws inside the model, given the mean and correlation parameters (which we are trying to estimate), as in with a copula (or ‘inverse cdf sampling’) method. Sample Stan code is below. The current implementation is fairly slow, however, and so we’re wondering if there’s an efficiency gain to be made somewhere. @Bob_Carpenter or @bgoodri - perhaps you’d know?

data{
  int I;
  int L;
}

transformed data{

  matrix[I,L] unif_1;
  matrix[I,L] unif_2;

  for(i in 1:I){
    for(l in 1:L){
      unif_1[i,l] = uniform_rng(0, 1);
      unif_2[i,l] = uniform_rng(0, 1);
    }
  }
}

parameters{
  cholesky_factor_corr[2] Sigma_corr_chol;
}

model{
  matrix[I, 2] epsilons;

  Sigma_corr_chol ~ lkj_corr_cholesky(1);

     for(l in 1:L){
      epsilons = append_col(inv_Phi(unif_1[,l]), inv_Phi(unif_2[,l])) * Sigma_corr_chol; //this needs L>10K - slice sampling?
     }

     #...[stuff with epsilons down the line]
}

bgoodri · June 28, 2018, 6:26pm

append_col does a copy that you might be able to avoid if epsilons is an array of row_vectors.

betanalpha · July 2, 2018, 8:37am

Why not just use the non-centered generation of Gaussian variates? Taking \vec{z} \sim \mathcal{N}(0, 1) with \vec{x} = \vec{\mu} + L \cdot \vec{z} where \Sigma = L \cdot L^{T} implies \vec{x} \sim \mathcal{N} (\vec{\mu}, \Sigma).

shoshievass · July 2, 2018, 11:17am

Yes, of course!! Thanks, @betanalpha!!

Topic		Replies	Views
Multivariate normal CDF General multivariate-normal	4	1878	April 7, 2021
A general approach to reparameterization General	2	1158	October 2, 2017
Random draws of data vector Modeling	4	414	March 10, 2023
Gaussian copula for discrete marginals Modeling	32	6472	April 13, 2022
Efficiently sample a collection of multi-normal variables with varying sigma (covariance) matrix Modeling	2	958	May 24, 2018

Efficiency improvement over inverse cdf sampling for simulated multivariate normal draws

Related topics