Reverse mode SDE idea

stevebronder · January 10, 2020, 2:40pm

(This is mostly for Stan math devs)

@James_Savage sent me the tweet thread below

https://twitter.com/DavidDuvenaud/status/1215347970159382534?s=19

Var doesnt have any templates so we would need a new type, but could we just store the gaussian noise in a var and keep doing something similar to what we do now?

stevebronder · January 10, 2020, 5:23pm

When we move up to C++17 adding this would be a lot easier because class templates are automatically deduced by the constructor so

template <typename Arith, typename NoiseArith = double>
class var {
using Scalar = Arith;
using Noise = NoiseArith;

// stuff here

explicit var(Arith x) {
// ...
}

explicit var(Arith x, Noise x) {
// ...
}

};
var<double> a(10.0); // fine!
var a(10.0); // fine!
var<double, float> a(10.0 , -1.96); // also fine!
var a(10.0 , -1.96); // also fine!

Gives us full backwards compatability!

Bob_Carpenter · January 23, 2020, 8:54pm

That looks really interestring. I’ve been reading through how autograd works in Dougal Maclaurin’s thesis.

I think this stuff is now getting implemented in JAX. Here’s their Autodiff cookbook

Also, this is very relevant for the kinds of econometric equilibrium models we want to fit.

stevebronder · January 23, 2020, 9:40pm

Neat! I’ll have to give that a read-through

The main bummer is the C++17 wait for this, but once R bumps up we can look at how Python can handle 17 and go from there

betanalpha · January 24, 2020, 5:19pm

I’m not sure what the need for augmenting vars is. The SDE method is basically an adjoint ODE method with implicit marginalization over the diffusion, and it can be implemented in a similar was to our current ODE solvers. On the forward pass the Brownian bridge realization reconstructor is set up and stored, and then on the backwards pass the Jacobian-vector product is computed using a reverse-time SDE solve and the fixed Brownian bridge realization.

avehtari · January 24, 2020, 6:53pm

Oh, it needs to be Jim to get your attention (cf. Adjoint sensitivity method for stochastic differential equations)

I met David Duvenaud before Xmas and he said he would be happy to help to get this in Stan.

although it seems Mike already knows what to do.

stevebronder · January 24, 2020, 8:42pm

LOL sorry about that

betanalpha · January 25, 2020, 2:09am

Coincidentally @charlesm93, @vianeylb, and I have been working through old adjoint methods (and a shiny new, albeit somewhat obscure one). The first step is to expose CVODES adjoint method which will require storing enough of the checkpointing and interpolation during the forward solve to support the reverse solve. Charles has expressed interest in working this out with me.

Once that’s been demonstrated we can then think about this SDE method. The real novelty of the adjoint method is that it defines a well-posed derivative of the SDE solution, allowing us to have SDEs at all let alone with efficient Jacobian-vector products.

Bob_Carpenter · January 28, 2020, 1:34am

For some reason I missed your previous post on this, too.

We want to solve SDEs for the Sloan project for equilibrium in econometric models, so it’d be great if we could get this into Stan.

He was super patient in explaining to me how what I was calling “lazy chaining” related to the adjoint-Jacobian formulation of reverse mode. @bbbales2 then figured out how to code the adjoint-Jacobian products in Stan neatly to encapsulate all stack fiddling.

Topic		Replies	Views
ODE refactor now has a path forward Developers maintenance	45	2023	October 20, 2016
Mixed mode OperandsAndPartials Developers	3	781	May 9, 2017
Fvars Developers	7	1410	May 9, 2017
Metadata methods on the model class for output var type and size information Developers	5	405	July 17, 2019
Help write an autodiff handbook Publicity	19	1920	June 8, 2020

Reverse mode SDE idea

Related topics