Fantastic! In case you haven’t seen, we’re also working on MPI at the same time so that we can parallelize lage likelihoods.
With both GPUs and distributed multi-core, we should be in business for seriously speeding up big models. Especially since the speedups are pretty much independent of one another.