Try to find a better way to speed up a hierarchical logistic model and reduce_sum only helped a little

Oh, I just noticed that that demo doesn’t Include use of rows_dot_product, which I recently learned provides an additional speed up. Lemme edit it…