Ppca

emiruz · January 25, 2020, 7:05am

Does anyone have some plain code for PPCA implemented in Stan that I could use for a more complicated hierarchical problem? I’m aware of this code, but something with less fancy footwork would be preferable because it’s complicated already.

emiruz · January 25, 2020, 7:21am

Looks like page 34 of the ADVI paper has some super simple Stan code for PPCA:

Although I’ll give this one a go too:

github.com

pourzanj/TfRotationPca/blob/master/Stan/ppca_optimized.stan

functions {
  
  real mirror_atan2(real theta) {
    
    real eps;
    
    // upper plane
    if(theta > 0.0) {
      eps = theta - pi()/2;
      if(eps <= 0.0)
        return(theta);
      else
        return(-pi()/2 + eps);
    }
    // lower plane
    else {
      eps = theta + pi()/2;
      if(eps >= 0.0)
        return(theta);
      else

This file has been truncated. show original

bertschi · January 25, 2020, 10:19am

We have code for PPCA, either plain or reparameterized via Housholder transformations to remove the unidentifiable rotation on the latent space
Link.

emiruz · January 25, 2020, 1:03pm

Thank you! I actually found this already. Do you think it’d be simple enough for me to extend your code to MPPCA?

emiruz · January 25, 2020, 2:38pm

This variant seems nice, although I understand it won’t be identifiable due to rotational symmetry. It has a massive advantage over the code in the ADVI paper (bottom of response) namely that it doesn’t have more parameters than data points… This makes it near impossible to generate a posterior sample for a moderately large dataset (e.g. 200k rows) without lots of RAM.

It’s also pretty straightforward to add a dimension to mu, sigma_noise and W in order to create a mixture of PPCAs although it remains to be seen how well it’ll work.

data{
    int<lower=0> N;
    int<lower=1> D;
    int<lower=1> Q;
    vector[D] Y[N];
}
transformed data{
}
parameters {
    matrix[D, Q] W;
    real<lower=0> sigma_noise;
	vector[D] mu;
}
transformed parameters {
    cholesky_factor_cov[D] L;
    {
        //matrix[D, D] K = W*W'; // tcrossprod(matrix x)
        matrix[D, D] K = tcrossprod(W);
        for (d in 1:D)
            K[d,d] += square(sigma_noise) + 1e-14;
        L = cholesky_decompose(K);
    }
}
model{
	mu ~ normal(0, 10);
    sigma_noise ~ normal(0,1);
    to_vector(W) ~ normal(0,1);
    
    Y ~ multi_normal_cholesky(mu, L);
}

PPCA from the ADVI paper:

data {
  int <lower=0> N;
  int <lower=0> D;
  int <lower=0> M;
  vector[D] x[N];
}
parameters {
  matrix[M,N] z;
  matrix[D,M] w;
  real <lower=0> sigma;
  vector<lower=0>[M] alpha;
}
model {
  to_vector(z)~normal(0,1);
  for(d in 1:D) w[d]~normal(0,sigma*alpha);
  sigma~lognormal(0,1);
  alpha~inv_gamma(1,1);
  for(n in 1:N) x[n]~normal(w*col(z,n),sigma);
}

Topic		Replies	Views
New Transform for Orthonormal Matrices in Stan Publicity	26	3486	May 8, 2018
How to implement ADVI Algorithms	2	624	March 12, 2019
Converting PYMC to Stan Model Modeling	0	331	August 20, 2023
Newly Specified Bayesian Hierarchical Model with High Runtime Modeling techniques , specification	17	692	October 12, 2023
Large data sets with stan code Modeling	1	688	September 18, 2018

Ppca

Related topics