Multiple calls to map_rect

Kevin_Reuning · November 30, 2018, 12:20am

I have a sort of general question that might be a bit dumb. I am trying to improve the efficiency of a relatively complicated model that can be best described as a MIMIC model that is also jointly estimated with an multinomial logistic model.

I am trying to improve the speed it estimates at by using map_rect but putting a lot of it into the map_rect function requires a lot of manipulation of the datasets (packing them and then unpacking them). I am curious if it is feasible/wise to call map_rect multiple times within a model of it that is a bad idea.

Thanks for any help. I can provide some models if necessary.

bgoodri · November 30, 2018, 12:21am

That’s fine.

stemangiola · May 6, 2019, 7:27am

I would like to expand on this question (@wds15) .

What about the performances of doing (?)

map_rect({
do A;
do B;
})

versus

map_rect({
do A;
})

map_rect({
do B;
})

is it equivalent, or the option (1) is much better than option (2)? Or depends on some condition?

Thanks.

wds15 · May 6, 2019, 7:17pm

I am assuming that A and B have many sub-tasks… then its best is to run a single map_rect, but with a random permutation of all jobs from A&B.

In all honesty, I am only giving you judgements on all of this based on my knowledge of having implemented it. So what map_rect does:

split your N jobs into B blocks of equal size N/B whenever you request to use B CPUs
run all B blocks in parallel as a chunk

That’s about the simplest queueing you can do. It works good if you can assume to have roughly equal work to do per chunk. A random order is essentially assumed.

The other rule of thumb is that parallelization is costly and you should reduce the number of map_rect calls since it adds overhead.

However… you can likely save your efforts here as a much improved version of this should be landing in stan at some point; though it’s a question of when - and here I am hesitant on a prediction.

stemangiola · May 7, 2019, 4:06am

Please! Variable packaging and un-packaging is a nightmare for complex models and very error prone!

Topic		Replies	Views
Practical questions on map_rect usage General performance	4	795	February 26, 2019
Map_rect, multithreaded on parameters only Algorithms	2	690	November 12, 2018
Catastrophic performance drop with map_rect Modeling paralellization	4	536	May 9, 2023
Using map_rect with multiple Gaussian processes (sampling time) Modeling paralellization	6	455	February 26, 2021
Help in improving model Modeling	4	329	July 28, 2020

Multiple calls to map_rect

Related topics