Chain Parallelization in Stan

I think this is somewhat outdated. IIRC, map_rect handles within-chain parallelism.

It’s not so much parallelism between chains, since each chain is independent of the others. There’s no communication between the chains.

It’s a little more involved, but not by that much. If you aren’t bothering with map_rect, then you can tell your Stan implementation to create as many chains as you have cores available. That may involve, say, passing $SLURM_CPUS_ON_NODE to Stan somehow, which will depend on which implementation you use, e.g. RStan, PyStan, CmdStan, etc.