Different Outputs in RStan vs. PyStan

extra_compile_args=['-O3', "-march=native", "-mtune=native"]

Are these not already set by default? They could give some easy performance benefits to Pystan users, since the flags enable the maximum level of compiler optimisation and SIMD vectorisation.

(Apologies if explaining what you already know!)