What is the point of the multi_normal_cholesky parametrization?

Bob_Carpenter · January 14, 2025, 4:33pm

Under all conditions. By starting with a pre-factored covariance matrix, the evaluation is quadratic in dimension rather than cubic. It saves on both the determinant calculation (because Cholesky is triangular, it’s only diagonal) and the quadratic form that involves the inverse of the covariance matrix. The autodiff is similarly sped up because it follows the basic evaluation.

There’s even more advantage within Stan because the way we parameterize a dense covariance matrix is using a Cholesky factor under the hood (N choose 2 unconstrained elements below the diagonal, and N diagonal elements which must be positive, so they’re log transformed). So it saves a lot of work in just creating a well-formed covariance matrix.

We should probably hint that it’s both more numerically stable and more efficient. We go into that fairly early in the User’s Guide in the regression chapter.

Topic		Replies	Views
Cholesky decomposition/speed/divergencies Modeling	11	2938	August 19, 2017
Multi_normal_cholesky is slower than hand coded lpdf Developers	6	828	April 11, 2022
About computation speed difference between multi_normal_lpdf and lpdf matrix operation General specification	2	546	July 30, 2020
Multivariate normal covariance matrix Modeling specification , covariance	5	1087	December 8, 2020
Multi normal pdf with low rank covariance matrix Developers features , math	4	960	February 13, 2020

What is the point of the multi_normal_cholesky parametrization?

Related topics