Over/underflow in autodiff and hand calculated derivatives

spinkney · September 19, 2021, 12:55pm

When I hear about adding derivatives to stan-math it’s typically for speed. Is it also the case that implementing derivatives for functions can be more stable than autodiff?

betanalpha · September 20, 2021, 8:09pm

Yes. See for example the “GLM” density functions like poisson_log and bernoulli_logit. These functions compose a probability mass function with the inverse link function which allow for various cancellation of large intermediate terms that can be risk floating point overflow and underflow both in the evaluation of the function and its gradients.

This isn’t so much particular to autodiff, however, as it is to implementing function compositions directly in general.

spinkney · September 23, 2021, 11:02am

Thanks!

I found this blog post - introduction to automatic differentiation - from 2013 that says the same thing as you in more words.

Disclaimer: I have not actually personally seen any literature doing numerical error analysis of programs produced by automatic differentiation. The technique does avoid the gross problem of catastrophic cancellation when subtracting f(x) from f(x+dx), but may do poorly in more subtle situations. Consider, for example, summing a set of numbers of varying magnitudes. Generally, the answer is most accurate if one adds the small numbers together first, before adding the result to the big ones, rather than adding small numbers to large haphazardly. But what if the sizes of the perturbations in one’s computation did not correlate with the sizes of the primals? Then AD of a program that does the right thing with the primals will do the wrong thing with the perturbations.

It would be better, in this case, to define the summation function as a primitive (for AD’s purposes). The derivative is also summation, but with the freedom to add the perturbations to each other in a different order. Doing this scalably, however, is an outstanding issue with AD.

Topic		Replies	Views
Paper on autodiff for implicit function Publicity	0	634	December 30, 2021
Best way to generate a sequence of derivatives Developers	4	1077	February 7, 2018
Review of automatic differentiation and its efficient implementation Publicity	0	515	November 14, 2018
Differentiating conditionals and while loops Algorithms	13	4643	August 10, 2018
Numerical derivates Modeling	1	272	November 10, 2020

Over/underflow in autodiff and hand calculated derivatives

Related topics