Speeding up MCMC by running on AWS?

JimBob · February 14, 2022, 11:34am

I’ve recently been running some models in brms that take a prohibitively long time to run (only getting a couple of % through their iterations overnight, for example). I think the reason is simply because these models are highly intensive in terms of memory requirements as e.g., they are ordinal models with repeated measures across participants, and many other multilevel components (random effects and intercepts) to enable poststratification. for thousands of respondents.

My current setup is a 16GB RAM computer and I can see this is getting maxed out when I run these models. In general, can I expect that using a cloud service such as AWS would increase the computational power sufficiently that these very long models would become more practically feasible? What kind of speed increases can I expect, or am I misunderstanding how cloud computing and MCMC sampling works, and a powerful cloud computer won’t actually help?

torkar · February 14, 2022, 12:47pm

I’ve been using AWS before with good results. Adding enough RAM and enough virtual CPUs can speed things up, and then you can use threading also to “chop up” the chains on several CPUs, i.e., see: Running brms models with within-chain parallelization

It’s easy to set up an environment on AWS, but since then I’ve moved to a similar set up offered to employees at Swedish universities so whenever I run bigger jobs I run them in the cloud (it’s neither sane nor cool to run it on a laptop :)

JimBob · February 14, 2022, 2:56pm

Thanks @torkar . Nice that it should be simple to do - and yes I’ve been using within chain parallelization on my laptop, but have had to buy a massive and noisy cooling pad to prevent overheating! Does AWS assign a certain number of cores or how does that work? Is it like, X amount of cores with lots of RAM available in each or something else? Sorry that these are kind of noob questions but I haven’t used AWS before

torkar · February 14, 2022, 2:58pm

You get what you pay for :) So you pick what you want and then you pay for using the resources.

JimBob · February 14, 2022, 3:07pm

Great - sounds like I’ll need to sign up and see what’s on offer then! Thanks again

Max_Mantei · February 14, 2022, 3:26pm

Hey, did you already check out this: Setting Up Rstudio and Cmdstanr on AWS EC2 – The Stan Blog ?

Also, I think you want to go for the Compute optimized instances - Amazon Elastic Compute Cloud with Stan.

JimBob · February 15, 2022, 12:09pm

Great - thanks for these I will check them out. I had not come across them previously

Topic		Replies	Views
Strategies for fitting a large model via AWS EC2 services General fitting-issues	17	1826	July 27, 2022
Choosing the right AWS instance General rstan , performance	2	1481	December 7, 2020
Error with brms (that calls rstan) when using mclapply() on an AWS instance brms fitting-issues	2	754	December 21, 2022
Best CPU for RStan/brms General	14	1707	February 12, 2025
Seamlessly Running Stan Much Faster (using AWS/HPC etc) General	3	1052	May 29, 2021

Speeding up MCMC by running on AWS?

Related topics