This looks like a posteriordb sorta thing: Beta-release Bayesian Posterior Database
Maybe you can hook in there and get models to test for correctness.
The advantage of this over default Stan would be taking advantage of Tensorflow’s scalability, right? So big data stuff?