Matrix product

lin.wang.idd.pasteur · September 3, 2020, 5:25pm

Given a row_vector of size M and a matrix of size MxN, I’m doing matrix product by row_vector * matrix.

I’m thinking if it’ll be faster or slower if we define a MATRIX of size NxM and a vector of size M, and then do matrix product by MATRIX * vector.

In my problem, M is small (e.g., = 10) but N may be large (e.g., = 10000). Very much appreciate if you could advise which way of matrix product will be faster.

caesoma · September 4, 2020, 3:59pm

This is far from my expertise, but since there are no replies so far I’ll give it a try; if I get it wrong enough maybe I’ll get yelled at and you’ll get a better answer.

I believe the efficiency of that in Stan will follow that of the libraries it uses in C++: Eigen (and also Boost, maybe). I’m not sure if that implementation of matrices as row- or column-major will make a difference for the speed of this operations in practice (especially since there are usually more costly operations at every iteration).

In practice, instead of wondering, you could just try both options and run some trials runs to see if you notice any difference. I’m guessing it won’t really make that much of a difference.

wds15 · September 4, 2020, 4:14pm

Eigen Matrices are column major in Stan.

lin.wang.idd.pasteur · September 8, 2020, 4:41pm

so row_vector * matrix would be faster?

stevebronder · September 8, 2020, 4:48pm

For a row_vector * matrix the row_vector is multplied by each column of the matrix. Since Eigen matrices are column major I’d assume that’s more performant. On the flip side matrix * vector multiplies each row of the matrix by the vector so it’s non-contiguous access. But eigen might have tricks for this so you may not even up with a difference either way. It’s always better just to benchmark these things

anon75146577 · September 9, 2020, 7:21am

The adjoint of c = Ab is \bar{b} = A^T \bar{c}, so the gradient is efficient if you right-multiply. So who even knows

lin.wang.idd.pasteur · October 16, 2020, 9:23am

Thank you! Do you mean below code with right-multiply will be faster?

data {
  int<lower=1> K;
  int<lower=1> N;
  vector[K] x[N];
  real y[N];
}
parameters {
  vector[K] beta;
}
model {
  for (n in 1:N)
    y[n] ~ normal(dot_product(x[n], beta), 1);
}

lin.wang.idd.pasteur · October 16, 2020, 9:49am

or below way?

data {
  matrix[N, K] x;
  vector[N] y;
}
parameters {
  vector[K] beta;
}
model {
  y ~ normal(x * beta, 1);
}

Not sure if right-multiply is good if N >> K. For example, N = 1 million, K = 3.
I’ll test both types of matrix multiplication to see performance.

Topic		Replies	Views
Sparse matrices in Stan Developers	12	2547	February 8, 2018
Computing the diagonal of a matrix product Developers	5	2834	July 17, 2019
Stan SIMD & Performance Algorithms	23	4471	January 23, 2020
Dot_product vs vectorization Developers	6	2192	April 17, 2018
Are matrix operations parallelized? Modeling	3	800	February 27, 2020

Matrix product

Related topics