Questing regarding implementing a new function using existing Stan functions

mathDR · August 31, 2022, 2:37pm

Hello. I am working on adding a new function to the stan-math library and had a question regarding usage of existing templated Stan functions.

If we are suppling gradients/hessians for our function, is it better to “unroll” the Stan function when writing them out?

For example, assume our Stan function is created (in the functions block) as:

  row_vector get_holiday_lift(
    vector h_skew,
    vector h_shape,
    vector h_scale,
    vector h_loc,
    vector intensity,
    data matrix d_peak,
    data matrix hol_mask
    )
  {
    int num_holidays = dims(d_peak)[1];
    int num_dates = dims(d_peak)[2];

    row_vector[num_dates] z;
    row_vector[num_dates] tdd = zeros_row_vector(num_dates);

    for (h in 1:num_holidays) {
      z = (d_peak[h, :] - h_loc[h]) ./ h_scale[h];
      tdd += (2.0 * intensity[h] * exp(-abs(z) .^ h_shape[h]) .*
        inv_logit(h_skew[h] * z)
        ) .* hol_mask[h,:];
    }
    return tdd;
  }

Then when generating the function (and hessians) in stan/math/prim/fun I could use the inv_logit that is supplied with Stan, or I could “unroll” it as it is defined, e.g. use the 1 / 1+ exp(-x) formulation.

I guess my question is: if supplying my own derivatives, is it most proficient to get down to std functions?

Also, I still see a LOT of documentation regarding “Adding a new function to the stan-math library”. Is there a “blessed” set of docs to go by?

Bob_Carpenter · September 20, 2022, 9:18pm

Sorry this got missed. We’re used to discussing developer issues more through GitHub issues than on the forums, despite suggesting that people comment on the forums.

Sorry this is such a mess. To be honest, I’ve given up on writing new Stan functions for just this reason.

@stevebronder wrote out new doc according to how we do things now (which has been evolving for the last 10 years), but I don’t think it’s merged yet. If that gets merged, it’ll be the gold standard doc. The best advice I can give you at this point is find a function that’s like the one you are going to write and copy its style. Maybe @stevebronder or @rok_cesnovar have better suggestions.

To answer your more concrete questions.

D

Delegate to our existing functions wherever possible, as we’ve tried to optimize them for efficiency and arithmetic stability. You definitely don’t want to implement inv_logit the way you suggested. Check out our library implementation, which branches on either side of zero and uses a Taylor approximation for small inputs so that 1 + exp(a) doesn’t round to 1:

github.com

stan-dev/math/blob/develop/stan/math/prim/fun/inv_logit.hpp

#ifndef STAN_MATH_PRIM_FUN_INV_LOGIT_HPP
#define STAN_MATH_PRIM_FUN_INV_LOGIT_HPP

#include <stan/math/prim/meta.hpp>
#include <stan/math/prim/fun/constants.hpp>
#include <stan/math/prim/fun/exp.hpp>
#include <stan/math/prim/fun/inv.hpp>
#include <stan/math/prim/functor/apply_scalar_unary.hpp>
#include <cmath>

namespace stan {
namespace math {

/**
 * Returns the inverse logit function applied to the argument.
 *
 * The inverse logit function is defined by
 *
 * \f$\mbox{logit}^{-1}(x) = \frac{1}{1 + \exp(-x)}\f$.
 *

This file has been truncated. show original

If you template the primitive implementation, it can do double duty for forward-mode autodiff. Or you can write custom forward-mode, but either way, you don’t need to implement Hessians manually. We compute second-order derivatives, including Hessian matrices, by nesting reverse mode in forward mode.

Assuming you mean efficient then no. It’s most efficient to reduce things to lazy Eigen expression templates. After that, call our built ins rather than writing your own, unless you can make the combination more stable with algebraic reductions.

mathDR · September 21, 2022, 4:54pm

Okay this is all super helpful @Bob_Carpenter !

Followup question about the “adding to stan-math”: if I call my function as external c++, will I get all of the benefits of autodiff that you are alluding to? So to restate: if I have an external function that I want to “speed up” and I write it in templated c++, instead of adding it to stan-math, if I add it via “external c++”, is that kinda the same thing (in terms of efficiency)?

Bob_Carpenter · November 9, 2022, 9:51pm

Sorry I missed the follow up question. If you write a function in the Stan language, it gets transpired to templated C++ and run with autodiff. The way to speed things up with C++ is to provide analytical gradients for functions that are more efficient (in memory or time or both) and/or more numerically stable than autodiff.

mathDR · November 10, 2022, 4:49pm

@Bob_Carpenter is there a way to see the transpired function? (So I can copy the template when putting in the analytical gradients?)

For example, I would write my function in Stan (using the functions block) then view the transpired function to see what format I needed to use for the C++ template?

Can one do this?

WardBrian · November 10, 2022, 4:56pm

Yes, you could use something like https://rok-cesnovar.github.io/stanc3js-demo/index.html (or normal cmdstan) to inspect the generated code.

Note that the automatically generated code does not always match what we’d like a hand-written function to look like. One notable change is the pstream__ argument which is used for output from all user-defined functions, and the use of a rethrow_located function to generate nicer runtime errors.

mathDR · November 10, 2022, 5:02pm

Okay great! Yeah this is where my experience breaks down in writing templated C++.

I know there are some examples of user-defined templated functions here, but NOTE says that they are a “bit outdated”. Is there a suite of examples like this somewhere else?

Also, adding an example of vector inputs and vector outputs (and others – matrix inputs, matrix outputs, etc.) would be great!

WardBrian · November 10, 2022, 5:10pm

The Getting Started page on that section of the documentation has a modern example

Topic		Replies	Views
Custom Function with Known Gradients Interfaces	4	1245	December 12, 2017
Making arbitrary C++ functions available to Stan Modeling	14	1985	June 12, 2018
Gradient after transformation (math library) General stan-math	2	1431	November 22, 2017
Link Symbolicc++ function to Stan interface Modeling	9	495	September 12, 2021
Trouble adding a function of a vector with known gradients Modeling	2	364	April 17, 2019

Questing regarding implementing a new function using existing Stan functions

Related topics