What is the idiomatic way to check for logical constraints in input data?

jachymb · January 8, 2025, 8:34am

Suppose I want to have something like this in data:

int n;  
array[n] int a;
int s = sum(a);
vector[s] x;
vector[s] y;

But this is not possible, because assignment statements aren’t possible within data.

I had in idea to instead do something like:

data {
  int n;
  array[n] int a;
  int s;  // redundant input
  vector[s] x;
  vector[s] y;
}
transformed data {
 if (s != sum(a)) {fatal_error("does not add up!");}
}

another option would be

data {
  int n;
  array[n] int a;
  vector[sum(a)] x;
  vector[sum(a)] y;
}
transformed data {
  int s = sum(a);
}

but both feel a little clunky and would also be inefficient if instead of the sum was an expensive computation or I have to repeat the pattern many times over.

Would either of the options be considered more idiomatic/preferred for a good reason? Or is there a better way I don’t know of? Or maybe does an optimizer in the compiler recognize it’s a pure function and actually computes it only once?

Bob_Carpenter · January 8, 2025, 10:03pm

Your second approach is the preferred approach because it can’t fail by inconsistency of s.

I agree it’s clunky. If the function gets complicated, then you need to write it as a user-defined function.

The data and transformed data blocks are only executed once as the data is read in. They don’t require autodiff, so it’s all C++ primitives, which are super-duper fast. The I/O to bring data in from memory will be slower than summing an in-cache vector, so you won’t even be able to measure the slowdown here without a very careful instrumentation effort. It might take an extra microsecond or two if those vectors are 10K long.

jachymb · January 8, 2025, 10:11pm

OK, thanks for the comment! :)

Topic		Replies	Views
Bounds on a specific column of a 2D array Modeling techniques	22	1792	July 29, 2017
Assign values to an array or vector, and specifying fixed vectors/matrices Modeling	2	4074	June 15, 2017
Reassignment if vs if-else? Modeling specification	5	369	October 19, 2022
Efficient way to do int - int[]? Modeling	2	477	December 14, 2021
How to declare and use integer matrices Modeling matrix	3	1575	June 29, 2020

What is the idiomatic way to check for logical constraints in input data?

Related topics