New Transform for Orthonormal Matrices in Stan

bgoodri · April 6, 2018, 9:48pm

Also, P has a determinant of 1 by construction, so this excludes the orthogonal matrices that have a determinant of -1. If you include those, then you have to deal with the 2^p possible reflections.

Bonnevie · April 8, 2018, 10:12pm

Hm, did you flip a sign somewhere? Shouldn’t it be

|I+S|^{\kappa-p+1}

with \kappa having positive sign?

edit: also odd you get difficult geometry for \kappa=0 since that should be the uniform case, no?

bgoodri · April 8, 2018, 10:31pm

I was trying to do it the way we would do it in Stan if this were implemented as a type, based on the equation before equation (11) in the paper that is a PDF for the skew-symmetric matrix. Since it is in the denominator, the whole exponent gets negated when you take the logarithm.

I think the reason it is weird when kappa = 0 is because there is a multivariate t underlying it with only p degrees of freedom.

Bonnevie · April 8, 2018, 10:45pm

Ah, right, I was looking at the density over the orthogonal matrices, but I agree that the density over s is a more suitable target. But annoying that it is weak at sampling from the uniform distribution in particular. Does the student-t relationship indicate that it is sampling from a degenerate space, or is it only near-degenerate?

bgoodri · April 8, 2018, 11:00pm

I don’t think it is degenerate; it is just heavy tailed. It would probably be okay if the data were even slightly cooperative.

Bonnevie · May 4, 2018, 8:13am

Got around to trying it in a factor model - seems to mix well enough, even with \kappa=0. I do get quite low BFMI, but I suspect this is due to rotational invariance inherent to factor analysis models.

edit:There is also the issue of only needing the first few vectors of the full rotation matrix - this likely introduces additional ridge geometries.

edit: I was wondering whether we could just sample orthogonal vectors sequentially. Sampling one direction x_1, we can construct the orthogonal projection matrix \bar{P}_1 and apply it to the next sampled direction \bar{P}_1\tilde{x}_2. As above, there is the issue that the effective distribution over \tilde{x}_2 will be degenerate, as the part parallel to x_1 will be cancelled out. Any way to reparameterize a degenerate Gaussian?

Bob_Carpenter · May 8, 2018, 6:33pm

You can just do it directly in terms of the semi-definite covariance matrix (though we don’t support that in multi-normal directly, you can code it yourself) if you know you will only be feeding it things meeting the constraints. But ideally, you’d work out the marginals on the free dimensions and then make the rest transformed parameters.

Topic		Replies	Views
Ppca Modeling	4	702	January 25, 2020
"Pairwise" alternative to multivariate normal isn't behaving for the hierarchical case; help? Modeling techniques , specification	13	791	June 10, 2021
Newly Specified Bayesian Hierarchical Model with High Runtime Modeling techniques , specification	17	560	October 12, 2023
Potential bug with (cmd)stan(py)'s optimisation process running a probabilistic PCA General fitting-issues	1	384	March 26, 2023
Integer parameters Modeling	4	2467	July 3, 2017

New Transform for Orthonormal Matrices in Stan

Related Topics