I think this is somewhat outdated. IIRC, map_rect
handles within-chain parallelism.
It’s not so much parallelism between chains, since each chain is independent of the others. There’s no communication between the chains.
It’s a little more involved, but not by that much. If you aren’t bothering with map_rect
, then you can tell your Stan implementation to create as many chains as you have cores available. That may involve, say, passing $SLURM_CPUS_ON_NODE
to Stan somehow, which will depend on which implementation you use, e.g. RStan, PyStan, CmdStan, etc.