MPI Design Discussion

seantalts · September 14, 2017, 6:58pm

Okay, I think I understand a lot more now than when I started. Within this map_rect API, here’s what I might like to see in the design:

low-level independent functions:

flatten a vector<vector<T>>
unflatten a vector<vector<T>>
Send data to all nodes (maybe this is just broadcast) (not cached)
cached scatterv
maybe even cached scatterv for a doubly-nested vector (cached)

map_rect_mpi would look something like:

broadcast shared params & data
map across thetas
var-related memory management (not sure where this goes exactly)

map looks like:

reduce(std::vector.push_back composed with F, …)

Reduce looks like:

distributes data to map over with cached scatterv
Applies reduction op
gathers results with gatherv
Finish reductions by applying reduction op to gathered data on root node

reduce functor for map_rect might involve separate entry and exit “context manager” style fixtures as independent functions or classes:

start nested & un-nest / cleanup autodiff
perhaps the var memory stuff can go here?

Does this make sense? I’m happy to provide more example code as well if that helps. I was originally starting to do some refactoring on my own as its own example but figured I should put this out here first.

Topic		Replies	Views
Parallel reduce in the Stan language Developers	12	1170	April 11, 2019
Stan SIMD & Performance Algorithms	23	4019	January 23, 2020
Map_rect & data for the case of MPI Developers	3	530	January 23, 2020
Stan++/Stan3 Preliminary Design Developers	97	4493	June 12, 2018
Parameter packs for (de)serialization Developers	20	4267	August 20, 2018

MPI Design Discussion

Related Topics