Gradients for prim/mat distributions

andrjohns · October 4, 2017, 4:13am

How are the gradients/derivatives for the multivariate distributions (prim/mat/prob) calculated in stan math?

I can see that the univariate distributions (prim/scal/prob) uses operands_and_partials, and the functions have gradients defined in fwd and rev, but I can’t see how it’s done for the mat distributions.

Is there any doc/wiki that I should be looking at?

bbbales2 · October 4, 2017, 2:21pm

If the corresponding functions aren’t in rev or fwd, probably means the functions are just being autodiffed themselves.

If they take advantage of big matrix operations and such (so that the bulk of the internal work has custom autodiffs), they should be pretty efficient.

andrjohns · October 5, 2017, 12:18pm

Ah that makes sense, thanks!

Bob_Carpenter · October 7, 2017, 3:08am

That’s right.

multi_normal could be made much more efficient with fully analytic derivatives. Specifically the quadratic form/inverse in (y - mu)' / Sigma * (y - mu) could be much more efficient. It is at least vectorized so that Sigma is Cholesky factored only once.

Topic		Replies	Views
Soliciting syntax ideas for user defined gradients and user defined transformations Developers	9	765	September 30, 2019
Operands and Partials: partials_ vs partials_vec_ Developers	10	988	February 3, 2018
Adding gradients - operands and partials vs. fwd/rev Developers	4	699	November 29, 2018
Documentation: Error in efficient alternative to diag_matrix in multi_normal? General	2	241	June 13, 2022
Operands and partials with more than five edges Developers math	4	514	August 8, 2018

Gradients for prim/mat distributions

Related Topics