MPI Design Discussion

Ok, I have shown the good performance already a few times of MPI, but to this end mostly with synthetic examples. I just compiled a realistic example

  • Hierarchical ODE based model
  • 1300 subjects
  • real world data set

The running time on a single core takes 2.6 days to finish. I have setup things such that the 10 & 15 core run were on a single machine while the 20, 40 & 80 core run were distributed in blocks of 10 (so 2, 4 and 8 machines) onto the cluster which is networked using infiniband. The key question to me was how well the performance scales and the result is stunning. The 64h on a single core go down to about 1h which is a 62x speedup, but look for yourself.

… and all results match exactly.

5 Likes