Looking for an optimization for running multiple Stan models in parallel with OpenMP

You could call with chains = 1 in parallel, but then you would have to re-assemble the chains yourself to do convergence diagnostics, etc.

More generally, if you are getting variance on the order of days, then your Markov chain is not efficient enough to be doing anything reliable with the draws anyways.