Obtain gradient of every target+=

Charles_Driver · March 6, 2020, 9:40am

I just implemented a function in R to compute subjectwise gradient contributions from my stan models, useful for misspecification checks, but it would also be wonderful to get observation wise gradients. I can’t see an easy way to do this outside of stan because of the dependencies in the structure, has anyone thought in this direction before? It would be a neat feature, I think…

wds15 · March 7, 2020, 12:15pm

Maybe code the model such that the Stan model can be configured to only compute the log lik for a specific data item?

Is there a ref for what you are doing?

Charles_Driver · March 7, 2020, 12:22pm

hmmm true that’s a possibility. Inefficient but perhaps not too bad. I think there are lots of approaches that use the ‘score’ (gradient contribution) for various checks. individual parameter contributions is one small set of work I have some connection to… https://www.tandfonline.com/doi/full/10.1080/10705511.2019.1667240?af=R

edm · March 13, 2020, 2:42am

That is an interesting idea. About subject-wise vs observation-wise, I was involved in a paper where we considered this in the context of LMM. Not sure whether it is helpful, but see Sec 3.1 here:

https://www.jstatsoft.org/article/view/v087c01

Charles_Driver · February 16, 2024, 9:22am

4 years on and I’m back to thinking about this, it really would be helpful for fast post-hoc misspecification checks / model enhancements. Has anything changed in the stan backend that might make this easier to obtain? When there are dependencies in the dataset, obtaining the observation-wise gradient contributions at present actually seems to require that I compute the likelihood of the ‘almost full’ (n-1 rows) dataset for every row n, and subtract this from the full likelihood – I don’t think treating the rows individually works. Here’s one use case, though I’m more interested in time series.
https://psyarxiv.com/jw8xb/

edm · February 19, 2024, 7:12pm

Not sure, but maybe the bridgestan package could help?

Charles_Driver · February 20, 2024, 10:34am

Doesn’t look like it. Thinking about it I guess it would have to be an internal Stan method that would store and output the array of gradients, incremented for each target+= call. Would be interested to hear from e.g. @Bob_Carpenter how hard such a thing might be to implement… ‘easy’ might motivate me to look at c++ again ;)

Topic		Replies	Views
Exposing the gradient of the log-likelihood General	12	980	October 28, 2020
Using Stan to compute the gradient of a function General	15	787	June 1, 2024
Use gradient of log likelihood in model block General	2	438	February 14, 2021
Making arbitrary C++ functions available to Stan Modeling	14	2122	June 12, 2018
Any benefit to supplying gradient myself? Modeling	10	1132	June 12, 2018

Obtain gradient of every target+=

Related topics