Sparse matrices in Stan

wds15 · January 29, 2018, 6:26pm

Hi!

I just speeded up a model by a factor of 10x by replacing a matrix-vector product with a loop which only processes the non-zero elements of the matrix. The matrix was a design matrix (thus the matrix was data for Stan) for the random effects part of a model. I assume that this is the same situation for rstanarm.

So it seems to be a major problem as the AD stack gets inflated by all those zero times whatever product.

Do we have a good solution for the moment or on the horizon? looping will do, but writing matrix expressions is a lot more expressive for the model.

Sorry if this was discussed already and I missed it.

Best,
Sebastian

Krzysztof_Sakrejda · January 29, 2018, 6:32pm

Latest, with links back into history: A proposal for sparse matrices and GPs in Stan

anon75146577 · January 29, 2018, 6:33pm

There’s a fix in R-Stan (@bgoodri did it) and I can write one into math if it’s needed (the derivatives are not hard).

How much of a pain would it be to swap your code to column-major? Because that’s more useful for other sparse operations…

wds15 · January 29, 2018, 7:11pm

Thanks @sakrejda for the link.

I recall @bgoodri did come up with some proposals…is that documented somewhere?

Doing something about these sparse matrices would be a huge win. Like I say, I got a problem to sample 10x as fast and the matrices were not really large, but with massive amounts of 0s.

So, yes, if you have stuff for stan math then we should consider it seriously to bring it into math.

bgoodri · January 29, 2018, 7:13pm

github.com

stan-dev/rstanarm/blob/master/inst/include/csr_matrix_times_vector2.hpp

/*
 * This works exactly like csr_matrix_times_vector but faster and less safe
 */
template <typename T1, typename T2>
inline
Eigen::Matrix<typename boost::math::tools::promote_args<T1, T2>::type,
              Eigen::Dynamic, 1>
csr_matrix_times_vector2(const int& m,
                         const int& n,
                         const Eigen::Matrix<T1, Eigen::Dynamic, 1>& w,
                         const std::vector<int>& v,
                         const std::vector<int>& u,
                         const Eigen::Matrix<T2, Eigen::Dynamic, 1>& b,
                         std::ostream* pstream__) {
  typedef typename boost::math::tools::promote_args<T1, T2>::type result_t;
  Eigen::Map<const Eigen::SparseMatrix<T1,Eigen::RowMajor> >
    sm(m, n, w.size(), &u[0], &v[0], &w[0]);
  return sm * b;
}

This file has been truncated. show original

Krzysztof_Sakrejda · January 29, 2018, 7:25pm

The original reason the code was made as slow as it is was opposition to giving users a foot-gun so adding Ben’s version to math would involve re-hashing that. Once tuples are in I think we could build a type on top of them that would carry the sparse matrix component vectors and be constructed in transformed data then we could officially skip all the error-checking.

wds15 · January 29, 2018, 8:12pm

Hmmm… i recall. Have we considered adding an explicit unsafe version? If the unsafe is made clear in the function name, then one could possibly skip error checking, no?

Has that idea been discussed? I don‘t think it is ideal to have critical speed fixes in rstanarm alone and tuples are still some bit in the future as I understood.

bgoodri · January 29, 2018, 9:06pm

It wasn’t a huge change in speed. It might have actually been slower in doubles, but we gained a bit more speed by relying on the autodiff instead of precomputed_gradients.

Bob_Carpenter · February 7, 2018, 9:33pm

Yes. That’s why I always tell people not to do that and why we have a sparse matrix-vector multiply function.

Getting sparse matrices is backed up behind tuples, functions, and ragged arrays. So I doubt it’s going to happen in 2018. We could reprioritize if someone has a workable sparse matrix/vector proposal—we are currently stuck without a design.

anon75146577 · February 8, 2018, 4:31pm

When you say proposal what exactly do you mean? Is there an example for, say, ragged arrays (or MPI or ODEs or something else recent)?

Bob_Carpenter · February 8, 2018, 9:35pm

Ragged arrays: Home · stan-dev/stan Wiki · GitHub

It’s not particularly coherent because I just pasted a new proposal in front of the old one (everything before “what is a ragged array?”). You’ll also see that what we’ll have to do to declare these things in general will be horrendous.

Krzysztof_Sakrejda · February 8, 2018, 9:39pm

This seems like a fine state of things b/c between tuples and ragged arrays you get many of the tools you need to implement sparse matrices anyway…

Bob_Carpenter · February 8, 2018, 9:53pm

I took this opportunity to flesh out the

tuple functional spec

to match our current thinking. I could use some feedback on the actual syntax for tuple types and expressions.

On the other hand, the

sparse matrix spec

is very much a work in process.

Topic		Replies	Views
A proposal for sparse matrices and GPs in Stan Developers	35	4840	January 18, 2018
Sparse matrix roadmap Developers	37	1751	October 8, 2018
[Case-study preview] Speeding up Stan by reducing redundant computation Publicity performance	8	2122	June 6, 2020
Taking advantage of both sparse matrices and GPUs Algorithms	7	830	September 19, 2024
Speed of matrix multiplication versus multiplication loops Modeling rstan , techniques , performance	4	1664	November 8, 2022

Sparse matrices in Stan

Related topics