Usually that is waaaayyyy toooo much! With Stan often the 4 chains with 1000&1000 are sufficient.
Otherwise, get a i9 with lots of CPU cache and more CPU cores if possible… then you can benefit from reduce_sum parallelism which is hopefully soon in brms.