Haven’t checked the notebook above but the table doesn’t mention it: In general I also see quite some dependence of the sampling run time of mpi_rect
-based Stan models on the way the data is split into shards(i.e. number of shards). There is at least one thread in this forum providing examples of this kind…