Averaging generated covariance matrices

Ludwig · August 13, 2021, 10:58am

Hello all, I am running a code in which one of the parameters is a covariance matrix, Stan gives the generated samples in the form of an array with dimensions [n,p,q], where n number of generated observations and p=q is the dimension of the covariance matrix. Using the extract function

out_par<-extract(out)
covm<-out_par$Sigma

Now I need to obtain the average covariance matrix, as the posterior estimate of the Sigma, I am not sure how to do it, I could average every single cell of the matrix, but I don’t think that’s efficient. thanks in advance.

mike-lawrence · August 13, 2021, 11:41am

To what purpose? If as an intermediate step in the computation of another quantity, best to actually compute said quantity in each sample from the posterior rather than trying to average the posterior then computing the quantity.

Ludwig · August 13, 2021, 11:45am

The purpose/interest is covariance matrix itself, since I will use it to compute the correlation matrix, which is simple matrix operation of the covariance matrix.

mike-lawrence · August 13, 2021, 11:48am

Right, so you should compute the correlation matrix in each sample of the posterior, giving you n correlation matrices that you can then summarize however you’d like (ex. looking at the posterior for a given cell).

Ludwig · August 13, 2021, 12:01pm

Hi Mike, isn’t easier to just compute the correlation matrix afterwards, instead of adding an extra line to the code? The code is simulating a covariance matrix \Sigma=(\sigma_{ij}) (i,j=1,\cdots,p) which the prior is a Wishart, if I have 2000 mcmc steps (excluding the burn-in’s) I will have 2000 matrices, I just want an average of them, then I can compute the correlation matrix C=D^{-1/2}\Sigma D^{-1/2}, where D=diag(\Sigma) and D^{1/2}=(1/\sigma_{ii}).

mike-lawrence · August 13, 2021, 12:12pm

Given parameter \theta and transform f(\theta), it is only under select circumstances that f(E(\theta)) (function applied to the mean posterior) equals E(f(\theta)) (mean of the function as applied to each sample of the posterior, the thing we’re typically interested in for inference). Instead of keeping track of when those circumstances hold (I’m not very math-y), I think it’s a good default to always compute the latter rather than the former.

Ludwig · August 13, 2021, 12:47pm

Right, however I still have to average the 2000 generated correlation matrices, I am not sure how to do it? cheers.

mike-lawrence · August 13, 2021, 1:06pm

Once you have the correlation matrices, what you do next again depends on what you want to ultimately achieve. If you are interested in a given element of the matrix, you can look at it’s full posterior or summaries (mean, quantiles, etc) thereof. If you want to use that correlation matrix as an input to another computation (ex. get the eigenvalues for factor analysis, etc), then you perform that computation in each sample from the posterior, yielding one or more new quantities that you now have a full/proper posterior for.

Ludwig · August 17, 2021, 4:35pm

Hi, You’re right we should include the correlation matrix in the model, but I am not sure how to tell that to Stan, the model is like

 model {
 mu ~ normal(0,100);
  for(i in 1:N){
   X[i,] ~ multi_normal(mu,Sigma);
  }
    Sigma ~ inv_wishart(k,R);
 }

k and R are given.

The correlation matrix C can be obtained by C=D^{-1/2}\Sigma D^{-1/2}, where D=diag(\Sigma) and D^{1/2}=(1/\sigma_{ii}). But I am not sure how to code it, Stan does not seem to have a function to extract the diagonal of a matrix. I would appreciate any help. thanks.

Charles_Driver · August 17, 2021, 5:02pm

Here is a function I coded years ago that I believe works. Don’t ask me what I was thinking re the naming.

 matrix cov2cor(matrix V){ 
   int p = rows(V);
   vector[p] Is = sqrt(1.0 ./ diagonal(V));
   matrix[p,p] Ism;
   matrix[p,p] r;
   for(i in 1:p) Ism[i,] = Is';
    r =  Ism .* V .* Ism';
   for(i in 1:p) r[i,i]=1;
   return r;
 }

spinkney · August 18, 2021, 10:34pm

Same thing but faster

 matrix cov2cor(matrix V) {
   int p = rows(V);
   vector[p] Is = inv_sqrt(diagonal(V));
   
   return quad_form_diag(V, Is);
 }

Ludwig · August 19, 2021, 10:42am

Thanks you all, it worked fine.

Topic		Replies	Views
Estimating good mass matrices Developers	7	807	October 29, 2020
Generated MVN not what I expect General	11	690	December 10, 2018
Efficient Coding of Large-Dimensional Generated Quantities (Export only means?) Algorithms rstan , posterior-predictive	5	660	July 22, 2020
Computing autocorrelation/autocovariance in Stan Modeling	7	957	November 17, 2020
Extract Variational Posterior estimates Modeling	12	1942	May 21, 2020

Averaging generated covariance matrices

Related topics