Log of modified_bessel_first_kind

bgoodri · June 8, 2017, 6:10pm

As I was saying this morning, I have been working on the logarithm of one of the Bessel functions and have some questions about the implementation. Currently our Bessel functions can only be evaluated at integer orders even though Boost can calculate them with any order, but Boost does not implement log-versions. Here I am only talking about the (logarithm of the) modified Bessel function of the first kind.

The immediate reason for doing so is that line 72 of

https://github.com/stan-dev/math/blob/develop/stan/math/prim/scal/prob/von_mises_lpdf.hpp#L72

overflows for kappa > 710. That can be fixed by using log_sum_exp logic, although for large kappa it requires summing many terms. For sufficiently large kappa it is much cheaper to use an asymptotic expansion, which Boost does for kappa > 100 and very small orders. Tiny values of kappa are not really a problem numerically, although there is an asymptotic expansion for them.

The first question is what to do about the derivatives. The derivative with respect to kappa involves the same Bessel function but at order -1. It is somewhat inefficient to evaluate the Bessel function separately for one order and then an adjacent order, when both can be calculated simultaneously. But that is not something that we typically do under stan/math/prim/scal/fun. Is it okay to return a std::pair<double,double>? It would make perfect sense for the version under stan/math/rev/scal/fun to calculate the value and the derivative simultaneously (although that is not what is being done now), but von_mises_lpdf() does not call the rev version.

A second reason for reimplementing this is that we cannot currently do the Von Mises-Fisher generalization

because you have to evaluate the (log of) the Bessel function at order p / 2 - 1 and that would currently only work for even p (the dimensionality of the hypersphere). My general implementation works for any real order greater than or equal to -1, although there are specializations that could be implemented for half-integer orders that only involve summation of a finite number of terms.

I am not sure if it is worth the effort.

The last question is whether this Bessel function comes up in any other statistical contexts? In other words, do we only need to get this right for the von Mises (- Fisher) distribution? The modified Bessel function of the second kind is more common in statistics and is defined in terms of the modified Bessel function of the first kind

but there are different computational issues there that would require a separate function.

Bob_Carpenter · June 12, 2017, 6:44am

Returning pairs is OK or you can just pass in mutable references for the results.

Cutting over to approximations at some threshold should also be OK.

I don’t know how much the higher dimensional generalizations come up or whether the Bessel’s used anywhere else.

ppernot · March 28, 2018, 9:03am

Any follow-up ? I would be interested by using real orders in thus function…

bgoodri · March 28, 2018, 12:56pm

It got implemented in Stan Math but is not exposed in the Stan language yet.

ppernot · March 28, 2018, 1:51pm

OK, thanks!
I guess I will be patient ;-)

Bob_Carpenter · March 30, 2018, 8:08pm

@bgoodri, is there an issue in stan-dev/stan to add it with a signature? New functions are trivial to add to Stan—all we need is the doc for the manual and the function signature in stan/lang/function_signatures.h.

bgoodri · March 30, 2018, 8:09pm

I can’t remember. Likely not. I can do it.

Bob_Carpenter · March 30, 2018, 8:16pm

I see. Here’s the GitHub issue. I’ll assign myself and up-priortize it so I get to it.

@bgoodri: I see stan/math/prim/scal/fun/log_modified_bessel_first_kind.hpp. Can the Stan signature be:

real log_modified_bessel_first_kind(int v, real z);

We’ll be adding a generic data qualifier as soon as the refactor gets moved, so we could have a qualfiied real variable data real v that would have to resolve to data (i.e., something that doesn’t need a derivative).

bgoodri · March 30, 2018, 8:19pm

If we are going to expose it, then I should implement the rev version.

Bob_Carpenter · March 30, 2018, 8:24pm

We can do the two things independently. It’s still better as is than having the user write it, I suspect.

bgoodri · March 30, 2018, 8:29pm

OK, although we should doc that the log_ version requires the first argument to be greater than -1; otherwise we would be taking the log of something that might not be positive.

bgoodri · April 1, 2018, 2:47am

What is the best way to implement the derivatives of log_modified_bessel_first_kind? We know what the numerators are:

but those should get calculated simultaneously with I_v\left(z\right) because they share basically all of the computation. Is the best thing we can do to make a copy of the prim version, stick into the rev/ directory and insert things like

if (!is_constant_struct<T1>::value) {
  // calculate digamma, etc.
} 

if (!is_constant_struct<T2>::value) {
  // pretend v is v + 1 and calculate part of log_bessel_first_kind(v + 1, z) 
}

syclik · April 1, 2018, 3:13am

That sort of code can live in prim. We do things with shared computations all over the probability libraries and all the univariate ones only live in prim.

Hopefully that helps. Want me to send a specific example?

Bob_Carpenter · April 1, 2018, 5:41pm

You can use MathJax now, e.g., $\int e^x\,\mathrm{d}x$ becomes \int e^x\,\mathrm{d}x.

bgoodri · April 2, 2018, 5:12pm

So do it with operands_and_partials?

Sergey_Malashenko · November 1, 2019, 11:24am

Good day! Could you put descriptions of your approximating formulas?

nikunj410 · February 22, 2024, 12:30am

I am working on a custom pdf that uses {\displaystyle {}_{1}F_{1}(a,2a,x)} that can be represented in terms of Bessel Functions as follows

{\displaystyle {}_{1}F_{1}(a,2a,x)=e^{x/2}\,{}_{0}F_{1}\left(;a+{\tfrac {1}{2}};{\tfrac {x^{2}}{16}}\right)=e^{x/2}\left({\tfrac {x}{4}}\right)^{1/2-a}\Gamma \left(a+{\tfrac {1}{2}}\right)I_{a-1/2}\left({\tfrac {x}{2}}\right).}

The current Stan implementation of the Bessel function does not accept non-integer orders.
Ideally, I would have used the {\displaystyle {}_{1}F_{1}} directly, but the function is currently not available in stan either.

Is there a possibility of having this function for users in the near future?

maxbiostat · February 22, 2024, 9:43pm

@andrjohns

Bob_Carpenter · February 22, 2024, 10:38pm

I think the problem may have been calculating derivatives with respect to non-integer orders. There are a couple of relevant issues with commentary and potential workarounds:

Signature of [Modified] Bessel Functions [of the Second Kind] · Issue #23 · stan-dev/math · GitHub
Allow real `v` for `modified_bessel_second_kind` · Issue #2795 · stan-dev/math · GitHub

martinmodrak · February 23, 2024, 7:44am

For completenes, I’ll also add that I spent some effort on implementing the log of modified bessel of the second time and getting the derivatives right (especially w.r.t. order) proved to be beyond my skill:

github.com/stan-dev/math

Implemented log(BesselK) for fractional orders

stan-dev:develop ← martinmodrak:feature/issue-1112-continuous-besselk

opened 04:46PM - 18 Feb 19 UTC

martinmodrak

+880 -14

## Summary Implemented logarithm of the modified Bessel function of the secon…d kind (Bessel K) for fractional orders, supporting differentiation with respect to both variables. Log(BesselK) is useful for computing the lpdf of some distributions, such as [Generalized inverse Gaussian](https://en.wikipedia.org/wiki/Generalized_inverse_Gaussian_distribution) or [SICHEL]( https://www.rdocumentation.org/packages/gamlss.dist/versions/5.1-1/topics/SICHEL) The computation is based on https://github.com/stan-dev/stan/wiki/Stan-Development-Meeting-Agenda/0ca4e1be9f7fc800658bfbd97331e800a4f50011 (but modified to allow for stan::math::var arguments). Thanks to @bgoodri for providing it, I would not have been able to find this formula on my own. The code snippet linked above is in turn based on Equation 26 of Rothwell: Computation of the logarithm of Bessel functions of complex argument and fractional order https://scholar.google.com/scholar?cluster=2908870453394922596&hl=en&as_sdt=5,33&sciodt=0,33 Both derivatives are computed by auto-diffing over the 1d integrator. It should be possible to get an explicit formula for the derivative with respect to `v` similarly to the way it is computed in `modified_bessel_second_kind`. The name of the main function (`log_modified_bessel_second_kind_frac`) is provisional. The function is now in `rev/arr/fun`. This is IMHO weird, but I include `rev/arr/functor/integrate_1d` so the linter complained when the function was in `rev/scal/` In addition, I tried to improve the error messages from `integrate_1d` and its documentation, as following the current documentation led me astray. This could be moved to a separate pull request if desired. ## Tests A grid of values of the function and its gradients was computed in Mathematica (code is part of the comments in the test file). The relative error between Mathematica and this implementation is <1e-7 for all values and gradients tested. ## Side Effects The error messages of `integrate_1d` have been modified to include the error threshold. ## Checklist - [x] Math issue #1112 - [x] Copyright holder: Institute of Microbiology of the Czech Academy of Sciences The copyright holder is typically you or your assignee, such as a university or company. By submitting this pull request, the copyright holder is agreeing to the license the submitted work under the following licenses: - Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause) - Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/) - [ ] the basic tests are passing - unit tests pass (to run, use: `./runTests.py test/unit`) - header checks pass, (`make test-headers`) - docs build, (`make doxygen`) - code passes the built in [C++ standards](https://github.com/stan-dev/stan/wiki/Code-Quality) checks (`make cpplint`) - [x] the code is written in idiomatic C++ and changes are documented in the doxygen - [x] the new changes are tested

Topic		Replies	Views
Skellam distribution: overflow in modified_bessel_first_kind General	6	2356	October 6, 2017
BesselK with the order (v) as real (parameter) Modeling techniques	8	1287	August 13, 2019
Hoping for some guidance / help with implementing custom log likelihood and gradient for research project (details below) General	23	2257	October 18, 2021
Potential for the "Exponentially scaled modified Bessel function of the first kind" Developers gaussian-process	0	532	August 9, 2021
Problem implementing Zipf (power law) distribution -- external C++ RStan	52	3447	March 28, 2020

Log of modified_bessel_first_kind

Related topics