Vari class, operands_and_partials, or adj_jac_apply

charlesm93 · January 17, 2020, 8:24pm

I’m implementing a new lpdf function for HMMs, with discrete latent states, which uses the adjoint method @betanalpha and I have been working on. The signature is

template <T_omega, T_Gamma, T_rho>
hmm_marginal_lpdf(const Eigen::Matrix<T_omega, -1, -1>& log_omegas,
                  const Eigen::Matrix<T_Gamma, -1, -1>& Gamma,
                  const Eigen::Matrix<T_rho, -1, 1>& rho,
                  int n_states) {...}

We derived an adjoint method to get sensitivities for log_omega, Gamma, and rho. Currently, I wrote a custom vari class.

benefit: I can calculate the derivatives when constructing the vari object, and then only store these derivatives (as opposed to the matrices required to construct these derivatives) before calling chain(). So the forward pass only stores the log density and the derivatives.

drawback: I don’t need sensitivities for all three, and I’m not quite sure how adequately template the vari class / I don’t want to write 8 vari classes.

Browsing the forum there seems to be two other options: operands_and_partials and Adj_jac_apply. The first one gives me the requisite templating, but forces me to do all the calculations during the chain call (though it seems like there should be a way around this). The second method seems fine but only for rev mode.

Do I have a reasonable assessment of the situation?

charlesm93 · January 17, 2020, 9:29pm

After further inspection, all three methods can achieve what I’m after.

Note operands_and_partials doesn’t give you control on when the derivatives, but computes them during the forward pass. This is fine, since this is what I intend to do.

Bob_Carpenter · January 23, 2020, 8:42pm

The n_states argument’s redundant given that you have Gamma and rho.

When there’s a reduction to a single scalar output, the Jacobian is the same size as the number of parameters in the input. So operands_and_partials should be efficient enough.

Topic		Replies	Views
Vectorization in HMM model specification Modeling specification	2	186	February 26, 2024
Adding gradients - operands and partials vs. fwd/rev Developers	4	807	November 29, 2018
Built-in HMM design Developers hmm	7	1131	July 30, 2019
Hidden markov modelling the derivative of a latent function Modeling specification	10	863	April 22, 2021
Marginalizing over latent discrete parameters Modeling	20	1281	November 11, 2021

Vari class, operands_and_partials, or adj_jac_apply

Related topics