Testing for 7-parameter function with known partials

valmapra · April 6, 2022, 9:19am

Dear all,
I’m working with @Franzi to get the 7-parameter DDM-LPDF-function ready for a pull request. We wondered how we should go about the testing and would be happy about some advice:

Distribution Tests: The current framework seems to be limited to 5 parameters. Should we try to extend this or just test the function using the current functionality with multiple combinations of 5 parameters while keeping 2 parameters fixed?
Second and third derivatives: Currently the function only supports first derivatives, i.e. arguments passed as double or fvar<double> (as our derivatives don’t work with var-Type yet, similar to the Wiki-example). The distribution tests/autodiff-tests pass for example fvar<fvar<double>> to the function, should we adjust the function to make this possible?
In addition, we have some Unit tests to check the basic functionality

The autodiff testing framework also requires higher-order derivatives, do we have to use these tests if we can also test the partials against correct values from other sources?

Best,
Valentin

Bob_Carpenter · April 6, 2022, 9:01pm

A 7-parameter function is going to be challenging to test.

I’m also worried the full set of signatures might wind up being too large. It’s also going to tax our type inference because we blow out the combinatorics of its types. If each argument can be double or var, then you already have 2^7 = 128 combinations of arguments to test. If they can also be array, vector, and row vector, and each of those can be primitive or autodiff, then we now have 8^7 > 10^6 combinations of arguments. This might be too much for our language type inference. @WardBrian—any idea what will happen with this?

Although we prefer fully defined functions, it’s OK to write functions that only have reverse-mode autodiff.

The current autodiff test framework is set up to do all of the higher-order tests as well as reverse mode. It would be helpful to modify the whole thing to let you specify which derivatives are being tested. That’d be a big, but super helpful change to our testing framework.

The usual way to make the higher-order autodiff work is to write a templated primitive version. But to do that, each argument needs to be templated separately. And all the primitives used need to be differentiable at the appropriate order.

You don’t need to use the testing framework if you have known derivatives, but I don’t know how to check all the combinations of input types otherwise.

Do you have any suggestions, @valmapra?

WardBrian · April 6, 2022, 9:15pm

We store every defined signature in memory, so this would be bad (though we don’t differentiate between double/var, so we probably wouldn’t get up to 8^7, but 4^7 is still 16k)

A not insignificant portion of our testing in the compiler is testing the combinatorics of a few distributions which ~5 arguments and a lot of overloads, so it’s a valid concern.

I should note that it probably wouldn’t be too horrible. Even assuming a probably-overkill allocation of 128 bytes per signature, we’re only talking about an additional 2MB of memory of each of the 7 arguments had 4 valid types. This is in a hash map, so it also wouldn’t really impact compile times of models which don’t use it.

So, the real question is how many overloads there would be for each argument

valmapra · April 7, 2022, 1:23pm

Thanks a lot for your suggestions. The terminology was a bit unclear in my last post, it’s 7 parameters + data so we end up with 8 arguments.

Currently they are all vectorized to make using the function more convenient. If we leave iterating over the arguments to the end user we could remove vectorization for the 7 parameters if it would be absolutely necessary. @WardBrian With 8 arguments the output from stanc --dump-stan-math-signatures | grep -A 2 wiener_full_lpdf is about 8MB as you estimated, would this be too much?

@Bob_Carpenter: Regarding specifying which derivatives are being tested: I had a look at test/prob/generate_tests.cpp. The create_files function is called 6 times in main, with a parameter index for selecting the kind of test (var, ffv, varmat, fd, fv, ffd). If we would allow for passing which indices should be included, maybe it wouldn’t be that big a change (I could have missed something, though). One possibility would be to introduce an optional line (e.g. // Derivatives: var fv) similar to how the “Arguments” line is currently processed. In create_files we could then check:

Is this line present? No → continue as before
Yes → Is the string corresponding to the current index present? (e.g 1 corresponds to var, so if index == 1, check if “var” is there) No → abort and return, not creating tests
Yes → Continue as before, creating tests

In our case, the huge number of parameters could get in the way, as the framework doesn’t allow for that many parameters and would probably take long if it would.
As our subfunctions only deal with doubles, the way we handle the templated arguments is very straight-forward. For an argument const T_y& y we do (all our arguments behave like y), omitting checks:

using T_y_ref = ref_type_t<T_y>;
T_y_ref y_ref = y;
scalar_seq_view<T_y_ref> y_vec(y_ref);
for(size_t i = 0; i < N; i++) {
    const double y_val = y_vec.val(i);
    ...
}

at which point we should always have a double in y_val that we work with. The derivatives are constructed using operands_and_partials. Therefore I think the risk that something will go wrong with the types is quite low (assuming that only var, double and the correspoding vectorized types will be passed). Do you think we could go without using the testing framework and relying on hand-written Unit tests (maybe scalar+vector with types double+var for each argument while using scalar doubles for each of the other arguments, which would give a doable 4*8=32 cases)?

WardBrian · April 7, 2022, 1:30pm

8MB of text output doesn’t necessarily mean that is how much memory the compiler will be using on it. Can you pipe the grep into wc -l instead to give a rough number of signatures used?

Edit: Based on some rough numbers I just collected, we currently store ~30k signatures in ~7mb

valmapra · April 7, 2022, 1:40pm

This are exactly 4^8, so 65536 signatures, which then would be about 15MB with your numbers

WardBrian · April 7, 2022, 1:59pm

Yeah I think that will not work. I’m having a hard time pinning down exactly what the memory usage of our current list is (I’m getting numbers between 7 and 17MB depending on which method I use), but increasing it by a factor of 200% is probably a no-go.

valmapra · April 7, 2022, 3:25pm

Ok, thanks for pointing that out, then we’ll reduce this. For us, there are 2 ways that would make sense (though we are not sure yet which on is better), one is with 1 and the other with 5 vectorized arguments, resulting in 512 or 8192 signatures. Would the latter be already small enough or do we not even have to consider it?
Edit: Sorry, I messed up the calculation, using factors of 2 instead of 1 for scalars. Right results would be 4 resp. 1024.

WardBrian · April 7, 2022, 3:34pm

8192 is still 4x larger than the current most-overloaded funtion, but it’s within the realm where I don’t think that is an immediate dealbreaker. At that point it is more of a question of what the demand/usecase is for this function and whether it would justify the extra testing and memory usage.

valmapra · April 8, 2022, 10:07am

Ok, we’ll probably go with allowing vectorized input only for the data, not the parameters, so we’ll end up with only 4 signatures (see my edit above). Thanks for your help in sorting this out :D

valmapra · April 12, 2022, 10:29am

@Bob_Carpenter The signature is only specified in stanc3, so being templated the C++ function in stan-math will still be able to take vectors, even if one cannot pass them through Stan language anymore. Is this ok or should we actively reject vector input for these parameters?
Do you have an opinion on the tests, would it be acceptable to just go with Unit tests (see above)?

Bob_Carpenter · April 15, 2022, 6:38pm

Yes, we only need unit tests.

But we want unit tests for the C++ functions, not just what gets exposed in Stan. So if you’re not going to test vector inputs, you shouldn’t accept them.

I’d hate to have to have you write this function sub-optimally because of testing. Is there really no way to test all of the arguments, even one at a time? You can exploit things you know about the implementation to formulate tests (it just makes them brittle to refactoring).

yaa · July 3, 2025, 5:13am

Hi everyone,

I hope this is the right place to post my question.

I am fitting a 7-parameter ddm with stan, I started with 4 parameters (with varying drift rates), stan worked fine. Then I started to add the inter-trial variability parameters following Franziska Henrich et al (2023) (PDF) The Seven-parameter Diffusion Model: an Implementation in Stan for Bayesian Analyses. It threw me errors.

I am using rstan. Please see the code and error below, and give me some advice. Thank you very much.

> library(rstan)
> stan_version()
[1] "2.32.2"
> 
> d=read.csv('D:\\testdata.csv')  
> dim(d)
[1] 500   3
> head(d)
   S  R        RT
1 s1 r1 0.4162090
2 s1 r2 0.2611794
3 s1 r1 0.3572997
4 s1 r1 0.5121406
5 s1 r1 0.3895399
6 s1 r1 0.4885035
> d$R=ifelse(d$R=='r1',0,1)
> min_rt=min(d$RT)*0.99999
> 
> stan_m='
+ data{
+ int <lower=0> n; 
+ array[n] real rt; 
+ array[n] int<lower=0,upper=1> response; 
+ real min_rt; 
+ }
+ 
+ parameters{
+ real<lower=0,upper=1> a; 
+ real<lower=0,upper=1> w; 
+ real<lower=0,upper=min_rt> t0; 
+ real v0; 
+ real v1;
+ real<lower=0> sv0; 
+ real<lower=0> sv1; 
+ real<lower=0,upper=1> sw; 
+ real<lower=0> st0;
+ }
+ 
+ model{
+ for (i in 1:n){
+ if(response[i]==1) 
+ {target+=wiener_lpdf(rt[i]|a,t0,w,v1,sv1,sw,st0);} 
+ else
+ {target+=wiener_lpdf(rt[i]|a,t0,1-w,-v0,sv0,sw,st0);}
+ }
+ }
+ '
> stan_d=list(n=nrow(d),rt=as.numeric(d$RT),min_rt=min_rt,response=as.numeric(d$R))
> fit=stan(model_code=stan_m,data=stan_d,chains = 4,iter = 2000)
error stanc(file = file, model_code = model_code, model_name = model_name, : 
  0
Semantic error in 'string', line 8, column 27 to column 66:
   -------------------------------------------------
     6:  model{
     7:  for (i in 1:n){
     8:  if(response[i]==1){target+=wiener_lpdf(rt[i]|a,t0,w,v1,sv1,sw,st0);}
                                    ^
     9:  else{target+=wiener_lpdf(rt[i]|a,t0,1-w,-v0,sv0,sw,st0);}
    10:  # y ∼ wiener_full(a, t0, w, v,sv,sw,st0 ).
   -------------------------------------------------

Ill-typed arguments supplied to function 'wiener_lpdf':
(real, real, real, real, real, real, real, real)
Available signatures:
(row_vector, row_vector, row_vector, row_vector, row_vector) => real
  Expected 5 arguments but found 8 arguments.
(vector, row_vector, row_vector, row_vector, row_vector) => real
  Expected 5 arguments but found 8 arguments.
(array[] real, row_vector, row_vector, row_vector, row_vector) => real
  Expected 5 arguments but found 8 arguments.
(real, row_vector, row_vector, row_vector, row_vector) => real
  Expected

valmapra · July 3, 2025, 6:59am

Hi,

thanks for reaching out. Usually, it is advised to open a new topic, especially if a thread is already quite old.

The function with inter-trial variability was introduced in Stan 2.35 (see here), so upgrading your Stan version to 2.35 or higher should resolve the error. If you encounter problems, please open a new topic and feel free to ping me.

Topic		Replies	Views
Mixed mode OperandsAndPartials Developers	3	781	May 9, 2017
Fwd and mixed autodiff testing Developers	2	630	June 5, 2018
Stan Test Structure Developers	1	507	November 2, 2017
Growing PR backlog & autodiff test framework Developers	9	918	February 7, 2018
Fvars Developers	7	1412	May 9, 2017

Testing for 7-parameter function with known partials

Related topics