Dynamic threads_per_chain as chains finish

asceles · October 13, 2020, 1:30pm

In some cases one or more chains is significantly slower than others, getting stuck during warmup but eventually adapting appropriately. This results in the wait time being dominated by those chains. With within-chain parallelization, it would be nice for threads_per_chain to adjust dynamically as the faster chains finish, to provide idle CPUs to speed up the slower chains later in their life cycle.

bbbales2 · October 15, 2020, 12:03pm

You could try doing more threads than you have cores so that when some chains end the others just take more threads. The threading library is supposed to keep this scheduling sane, though I don’t know if there is a big overhead or something.

Topic		Replies	Views
Behavior of threading with reduce_sum() Algorithms	4	567	May 4, 2020
Within-chain parallelization idea (maybe crazy) Developers	35	2776	February 24, 2022
Reduce_sum cores, chains, threads Interfaces cmdstanr	13	1788	May 28, 2020
Between and within chain parallelization: threads and cores for multi vs. hyperthreading brms cmdstanr , paralellization	3	1427	July 15, 2024
Chains don't progress in parallel + model does nothing for a while before a quick sampling Modeling	3	817	July 29, 2021

Dynamic threads_per_chain as chains finish

Related topics