A gradient block?

James_Savage · April 5, 2018, 2:27am

Some of the models I fit quite regularly have fairly straightforward analytical gradients but are very expensive to evaluate via autodiff. I know it’s in theory possible to hack together my own version of Stan with these analytical gradients defined on the back end, but I’d probably mess that up. Something like this would be extremely handy:

data {
  ...
}
parameters {
  real par1;
  real par2;
}
gradients {
  par1 = some expression;
  // gradient wrt par2 to be left to autodiff
}
model {
  ...
}

This isn’t much of a question, but would something like this be possible in theory?

aaronjg · April 5, 2018, 2:50am

Can you wrap it into its own function and then code the gradients by hand? You don’t need to recompile Stan to do this:
https://cran.r-project.org/web/packages/rstan/vignettes/external.html

I’ve done this for some parts of a model that have easy analytical gradients and are evaluated in the inner loop of the likelihood function.

andre.pfeuffer · April 5, 2018, 3:13am

Can you provide an example?
I would be nice to have an example prototype about how to extend Stan in C++.

aaronjg · April 5, 2018, 3:27am

Sure.

I needed to calculate \frac x {1 - x} or \exp(\mbox{logit}(x)) in a hot path, but could not find it in the Stan math library. It has derivative \frac 1 {(1-x)^2}

So I added this code to a separate .hpp

and included it as described in the tutorial.

inline double exp_logit(double u) {
  return u / (1 - u);
}

inline var exp_logit(const var& u) {
  const double uv = u.val();
  const double m_u = (1-uv);
  const double m_u_sq = m_u * m_u;
  
  
  return var(new precomp_v_vari(exp_logit(u.val()),
                                u.vi_,
                                1 / m_u_sq));
}
struct exp_logit_fun {
  template <typename T>
  static inline T fun(const T& x) {
    return exp_logit(x);
  }
};

template <typename T>
inline typename apply_scalar_unary<exp_logit_fun, T>::return_t
exp_logit(const T& x) {
  return apply_scalar_unary<exp_logit_fun, T>::apply(x);
}

template <typename T0__>
Eigen::Matrix<typename boost::math::tools::promote_args<T0__>::type, 
Eigen::Dynamic,1>
exp_logit(const Eigen::Matrix<T0__, Eigen::Dynamic,1>& x, std::ostream* 
pstream__) {
 return exp_logit(x);
}

struct exp_logit_functor__ {
template <typename T0__>
    Eigen::Matrix<typename boost::math::tools::promote_args<T0__>::type, 
Eigen::Dynamic,1>
  operator()(const Eigen::Matrix<T0__, Eigen::Dynamic,1>& x, std::ostream* 
pstream__) const {
return exp_logit(x, pstream__);
}
};

This is pretty much all the Stan C++ code I’ve ever written, so by no means take this to be best practices!

avehtari · April 5, 2018, 3:42pm

User defined gradients are in Stan Road Map https://github.com/stan-dev/stan/wiki/Stan-Road-Map,
but while waiting for them, what @aaronjg did doesn’t seem that complicated (given that the process of including additional .hpp is not too complicated for you)

sakrejda · April 5, 2018, 3:57pm

I’d trust 0% of users to keep the gradients block properly updated as they evolve their model but I can see the attraction.

andre.pfeuffer · April 5, 2018, 4:24pm

User’s autodiff could be cross-checked by Stan’s version at initial phase.

avehtari · April 5, 2018, 5:33pm

And for random draws after sampling stops. This is also in the plan.

Bob_Carpenter · April 13, 2018, 8:02pm

We have to trust our users. There’s so many ways to screw up a Stan program that we can’t really provide much protection.

I’d trust a fair bit more than 0% because some users are careful with software and test as they go.

And as others have noted, we wouldn’t do this without finite diff or autodiff tests.

What we’re likely to add is a way to define a function with gradients, not just define gradients w.r.t. the log density for parameters.

Then users will only define gradients w.r.t. constrained parameters, so we’ll need to chain the transforms and Jacobians for any constrained parameters.

James_Savage · April 14, 2018, 9:52pm

@Bob_Carpenter That sounds even better than a gradients block. Exciting!

bnicenboim · February 17, 2021, 7:30pm

Hi Bob,
I just found this from almost three years ago. Will this still happen? (soon? eventually?)

bbbales2 · February 17, 2021, 7:47pm

It’s something we talk about and this idea is reasonably popular but there’s no work on it as of yet :D.

bnicenboim · February 17, 2021, 8:05pm

I really look forward to be able to define something like a _grad function for my custom _lpdf/lpmf function. And not have to deal with c++.
Is there an open issue about this that I can track?
I can’t code something like this, but I volunteer to help with the testing and documentation :)

bbbales2 · February 19, 2021, 2:02pm

I don’t think there is, but if you ever get curious about the status just ask again. Hopefully it’ll be a yes in the future sometime :P

Johannes_Hendriks · August 4, 2021, 2:23am

Hey, is there a way to include external c++ code using the new pystan 3 interface?

ahartikainen · August 4, 2021, 5:37am

Currently, no.

Johannes_Hendriks · August 4, 2021, 10:45pm

Is there a plan to add support for this?

ahartikainen · August 5, 2021, 4:50am

I don’t think Stan has any ‘official’ way to do this, so answer is probably no.

Cc @ariddell @bbbales2

bbbales2 · August 6, 2021, 10:49pm

Yeah I don’t know any way to do this other than the rstan stuff above or modifying sources and rebuilding manually.

Johannes_Hendriks · August 11, 2021, 1:14am

The research project I am working on will require custom gradients. Is there a guide or some instructions on how to modify source and recompile for use with PyStan 3? Thanks in advance for any answers

Topic		Replies	Views
Bypassing numerical differentiation with this simple hack? Modeling	6	568	November 4, 2020
Automatic differentiation with stan math Modeling	12	4303	November 24, 2017
Jacobian/Gradient Function Developers rstan	3	577	September 20, 2018
Modifying external C++ function example to return analytical gradient Developers	1	361	June 10, 2021
Beginner, Forward model and calling external function General	7	1671	April 2, 2019

A gradient block?

Related Topics