Eigen mixed type binary operations

Bob_Carpenter · February 8, 2018, 8:59pm

@bgoodri sent this and I thought I’d share:

See Eigen: Eigen::ScalarBinaryOpTraits< ScalarA, ScalarB, BinaryOp > Class Template Reference but this actually already works in Stan somehow (I think via NumTraits); see this function which does not have to promote doubles to vars https://github.com/stan-dev/rstanarm/blob/master/inst/include/csr_matrix_times_vector2.hpp

I’d have thought the csr_matrix_times_vector2 would have analytic gradients and need that distinction anyway. Or does it only do analytic for double matrix and var vector?

bgoodri · February 8, 2018, 10:55pm

double CSR matrix times var vector was slower than double CSR matrix times double vector with precomputed gradients, so I commented out that specialization in the .hpp file.

But the point is that now we don’t have to overpromote and we don’t absolutely have to have analytic matrix calculus.

bgoodri · February 8, 2018, 11:08pm

slower = faster

Bob_Carpenter · February 9, 2018, 6:49pm

You can edit old posts.

Bob_Carpenter · February 9, 2018, 7:08pm

I don’t even see a specialization in rev, only the top-level implementation. I can see from looking at the code why it’d be hard to get any speedups as it delegates each output to a dot-product calculation, which is pretty optimal as written. The only saving would be in avoiding some intermediate copies, but those aren’t so bad compared to all the other ad-hoc indexing going on (very hard to cache). Really there’s nothing else to be gained other than avoiding some big matrix-sized copies and allocations which can be avoided. It could perhaps be optimized by pulling all the chain calculations into a single node, but that won’t be that big a savings and it’s very complicated.

The most obvious speedup is to avoid this pattern:

 Eigen::Matrix<result_t, Eigen::Dynamic, 1> b_sub(idx);
    b_sub.setZero();

If result_t is var, then you get idx number of allocations on the autodiff stack which are quickly replaced by idx copies of zero. What you really want to do is this:

 auto b_sub = rep_vector(result_t(0), idx);

Or you could spell the whole result type, which doesn’t change.

Topic		Replies	Views
Stan SIMD & Performance Algorithms	23	4390	January 23, 2020
Stan-math: Eigen support General stan-math	13	1812	July 4, 2017
Are matrix operations parallelized? Modeling	3	788	February 27, 2020
Dot_product vs vectorization Developers	6	2172	April 17, 2018
OpenCL symmetric eigendecomposition Developers math	3	431	May 10, 2021

Eigen mixed type binary operations

Related topics