Time Gradients in the ODE solver

betanalpha · September 15, 2017, 10:57pm

In preparation for a course I came across a model that required that the points where the ODE is evaluated be dependent on parameters, necessitating their gradients. Currently we require that the ODE time points be data, but it’s straightforward to promote them to parameters.

The corresponding Jacobian terms are simply

dz_{i} (t) / dt |{t = T} = dz{i}/dt(t = T, theta, z(t = T)).

In other words, the derivative of the states with respect to a given time point is just ODE RHS evaluated at that time point.

Operationally this would require

promoting the time input from vector<double> to vector<T>
using meta programs to grab the value of the times where they are currently used in the code
passing the vector of ODE RHS values at each time point (n_times * n_states values) into decouple_states so that they can be included in the precomputed_gradients call

Does this all sounds good? Charles and I would like to get this in within the next week so that we can use it as a demonstration in the aforementioned course so prompt comments are appreciated.

wds15 · September 16, 2017, 3:49pm

Could well work… but shouldn’t it be possible to work around it with what we have by multiplying the ODE with

dI/dt

on both sides? So you would have to multiply the ODE RHS with the (total) derivative of you function I and then you are done. Maybe this is nonsense.

However, what certainly worries me is how the ODE integrator reacts to this. Now the time differential changes through that function. If you make it part of the ODE RHS, then all should be fine, but if you put it like that then I am not sure if the integrator is able to figure out good step sizes. Of course, you can try.

… and last: It’s going to be a hell to implement this for all the 9 cases we then end up having …

betanalpha · September 16, 2017, 10:45pm

No, you cannot multiply by a Jacobian as the transformation would also involve evaluating the states at a nonlocal time.

But more importantly there are no additional states that need to be integrated or anything like that. The additional sensitivities for variations in t_{i} are literally just the evaluation of the ODE system at z(t_{i}). This includes the initial time.

Bob_Carpenter · September 17, 2017, 10:27pm

I’ll defer to the ODE experts about whether it’s worthwhile.

The basic code design sounds good.

billg · September 21, 2017, 1:56pm

This is definitely worthwhile. With pharmacometrics applications we regularly encounter situations where some of the times at which we evaluate the ODE solution are parameters.

Also the proposed approach is more attractive than one we were thinking of implementing in Torsten. It would use a change of variable like one I believe Sebastian proposed in the past, e.g., u = (t - t_i) / (t_{i+1} - t_i) where t_i and t_{i+1} are the lower and upper bounds of an integration interval, i.e., times at which ODE solutions are desired. The integration intervals are then fixed at (0, 1) and the t_i’s have to be passed to the ODE as part of the parameter vector. Not pretty…

yizhang · January 15, 2018, 5:10pm

Hi @betanalpha, is this feature still under development? I was checking the integrate_ode_rk45’s signature in develop branch and notice that “ts” is still data.

betanalpha · January 15, 2018, 7:10pm

Yes, due to other priorities an implementation is not on the immediate horizon.

Bob_Carpenter · January 16, 2018, 8:50pm

No reason we can’t generalize if you know how to calculate the derivatives and are willing to crack open the C++ code.

I don’t think anyone else has immediate plans to work on time derivatives, so I don’t think it’ll need to be coordinated with other work on the ODE solvers.

yizhang · January 16, 2018, 8:52pm

Yeah I’m looking into it. But before that I’ll need fulfill request on Torsten and just use the change-of-variable approach @billg mentioned.

betanalpha · January 17, 2018, 1:49am

Again, the math here is trivial. The challenge is adding more gradient components to the ODE code which is becoming increasingly more convoluted.

Topic		Replies	Views
Observation time in ODE as parameters Developers	3	522	May 16, 2018
Different chains, different time Modeling	11	856	March 2, 2022
Speed of forward mode vs reverse mode Developers	12	1158	October 22, 2018
Adjoint sensitivities Algorithms	25	4534	July 23, 2019
Best coding practices and examples for inference conditional on multidimensional ODE observations? Modeling	12	1207	December 23, 2021

Time Gradients in the ODE solver

Related Topics