Help with multi-threading a simple ordinal probit model using reduce_sum

andrjohns · May 24, 2020, 3:10pm

Yeah that is kinda expected in this case. Since reduce_sum only slices the first argument, theta isn’t getting sliced and the entire (5000x3) matrix gets copied to each thread/process. This copy time can outweigh the performance benefit of splitting the loop between processes.

At least that’s how I understand things, @wds15 does that sound right?

Topic		Replies	Views
How to use reduce_sum when lpmf not directly added to target Modeling	3	328	July 12, 2022
Reduce_sum parallelisation issue Modeling cmdstanr , multivariate-normal	12	1032	February 24, 2022
Multi-logit regression using reduce_sum Modeling rstan , fitting-issues	8	444	January 4, 2024
Help with reduce_sum Modeling	32	1402	August 4, 2020
Reduce_sum performance many threads Modeling	9	654	August 14, 2020

Help with multi-threading a simple ordinal probit model using reduce_sum

Related topics