extra_compile_args=['-O3', "-march=native", "-mtune=native"]
Are these not already set by default? They could give some easy performance benefits to Pystan users, since the flags enable the maximum level of compiler optimisation and SIMD vectorisation.
(Apologies if explaining what you already know!)