As Ben notes the cost of HMC is proportional to the cost of the gradient calculation times the number of gradient calculations needed to generate a new proposal. The former scales in readily-analyzed ways with the amount of the data but the latter does not. See Chains stuck when use larger dataset, but not smaller for some discussion. Exchangeable Gaussian mixture models are particularly poorly-identified and some of the pathologies can be amplified as more data is introduced.