Enable adjoint method #3

eozd · 2020-02-29T17:16:42Z

Fixes #2

This PR proposes a way to make the adjoint method work with tensorflow custom_gradient interface. The main changes are in tfdiffeq/adjoint.py and can be summarized as:

Don't pass the ODE parameters to OdeintAdjointMethod function. We instead get these parameters from the variables keyword argument of grad function.
tf.custom_gradient requires grad function to return two sets of gradients as a pair. These are
i. The gradient with respect to the inputs of OdeintAdjointMethod. These are x0 and t in our case.
ii. The gradient with respect to the parameters which are tf.Variable objects stored in our ODE object.
To prevent getting all the tf.Variable objects created in adams optimizer, we mark them as non-trainable. However, there still seems

Caveats: I wasn't able to make the method work with the adams method (therefore adams - adjoint test is not enabled either). The problem is that the elements of the tuple returned from augmented_dynamics function have different shapes, and this causes problems with adams.py:138

tensorflow custom_gradient decorator requires the grad function to return the gradients as a pair. The first value should contain the gradients of all the inputs passed to the function (in our case this is x0 and t). The second element must contain the gradients for the model parameters which are tf.Variable objects stored in the ODE object. We don't use these params in the function interface; instead tf passes all the trainable parameters related to our method in variables keyword argument.

Adams method for some reason crashes. Maybe this can be fixed in a later revision

titu1994 · 2020-02-29T17:32:01Z

This solution is ingenious ! I completely missed that I can recover the parameters from variables. Thank you so very much for your help with this.

As to adams-bashforth implementation, it seems there are certain issues with the current implementation, which I am closely following in the pytorch discussions.

As dopri tests pass, I will be glad to merge this PR upon your go ahead.

eozd · 2020-02-29T18:06:38Z

If the idea looks good to you, then by any means please go ahead. By the way, I would also like to thank you for the original implementation. As I will be working with tfdiffeq in the immediate future I will make sure to post any issues I may find with the changes I introduced.

titu1994 · 2020-02-29T18:22:03Z

Merged ! I do advise wrapping the callable portion of the ode function call(u,t) inside a tf.function block to see some noticeable speedups. There's some performance bottlenecks I'd like to look into, and hopefully somehow implement the universal ordinary differential equations paper in the future, if I ever get to parse the Julia codebase

eozd added 5 commits February 29, 2020 10:42

Export adjoint method

0d02f84

Enable adjoint tests except adams method

18f2203

Adams method for some reason crashes. Maybe this can be fixed in a later revision

Fix TF1 incompatibility issues in tests

e80e038

Fix gradys reshaping to avoid extra dimensions

f5ac8b7

titu1994 merged commit f0b4550 into titu1994:master Feb 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable adjoint method #3

Enable adjoint method #3

eozd commented Feb 29, 2020

titu1994 commented Feb 29, 2020

eozd commented Feb 29, 2020

titu1994 commented Feb 29, 2020

Enable adjoint method #3

Enable adjoint method #3

Conversation

eozd commented Feb 29, 2020

titu1994 commented Feb 29, 2020

eozd commented Feb 29, 2020

titu1994 commented Feb 29, 2020