Apple's new M1 processors and Stan

paul.buerkner · March 9, 2021, 4:29pm

Hey all,

I just wanted to ask if anybody has experience already with Apple’s new M1 processors (Apple M1 Chip - Apple) and how they get along with Stan?

Max_Mantei · March 9, 2021, 4:38pm

Hey Paul!

No experience, just linking the following:

Cheers,
Max

paul.buerkner · March 9, 2021, 4:43pm

thanks! I should learn to use the search function better :-D

rok_cesnovar · March 9, 2021, 4:49pm

One more small benchmark here: Detect the use of the M1 ARM-based CPU and suggest adding CXX · Issue #365 · stan-dev/cmdstanr · GitHub

In general, ARM processors are super nice to Stan, more so with parallelization (using reduce_sum). I have more recent experience with Linux ARM but there is the same story here. On Linux ARM we have seen models run from 50% to up to 3x faster (same number of cores).

I think its mostly due to better use of caches.

wds15 · March 9, 2021, 7:21pm

It’s funny that the intel tbb makes things fly on arm!

awellis · March 12, 2021, 9:19pm

When can we expect to be able to install Rstan natively on M1 Macs? Will this coincide with the native build of R 4.1?

rok_cesnovar · March 12, 2021, 9:26pm

That is correct.

Only option for native compiling of Stan in R until then is CmdStanR.

awellis · March 12, 2021, 9:28pm

ok, thanks.

There’s really no way to get Rstan? I’m running R dev for aarch64, and it works fine so far.

rok_cesnovar · March 12, 2021, 9:35pm

There is an experimental version of R for native use on m1: https://mac.r-project.org/

With that you could build rstan from source natively. But I have no idea how safe that is and if it works for rstan.

Edit: oh i guess you meant this with rdev on aarch64.

awellis · March 12, 2021, 9:39pm

yes, that’s the R build I’m using. I’ll try installing Rstan

awellis · March 13, 2021, 10:23pm

I’ve installed Rstan, Rstanarm, brms. So far, Stan (via brms) seems to compile and sample about 3x faster on my M1 Macbook Air than on my work laptop (2 year old 13" Macbook Pro, quad-core i7).

mike-lawrence · March 13, 2021, 11:25pm

Cool! How many cores does parallel::detectCores() return for you?

awellis · March 13, 2021, 11:26pm

parallel::detectCores()
[1] 8

mike-lawrence · March 13, 2021, 11:29pm

I wonder how the scheduler handles the heterogeneity of there being 4 performance cores and 4 efficiency cores. Have you tried parallel chains?

awellis · March 13, 2021, 11:32pm

yes, I get the impression that when running 4 chains in parallel, I’m running on 4 cores, but when I ran 6 chains, I got 4 cores, and then the 2 remaining chains were run afterwards, but I’ve only been playing around with this for half an hour.

It’s been hard to tell, tbh, because sampling is so incredibly fast.

jbaranowski · April 5, 2021, 7:18am

I have a new M1 Mac mini. If someone provide me with a python based test code I can volunteer to run it.

jbaranowski · April 5, 2021, 11:05am

What might be interesting for some, when using CmdStan on M1:

Compilation is very fast - no idea why, but much faster than on i9
Last update of Big Sur has broke installation of CmdStan ( “Dyld: Library not Loaded” errors when running from python, and " error: half args and returns was disabled in PCH file but is currently enabled, error: PCH file was compiled for the target ‘x86_64-apple-macosx11.0.0’ but the
current translation unit is being compiled for target if running from terminal"). Reinstalling CmdStanPy and CmdStan solved the issue.

jroon · October 29, 2021, 3:15pm

Anyone tried Stan on the new Mac M1 Pro or M1 Max chips yet ?

Intriguing looking results from some Python test runs here:

tinosai · August 14, 2022, 12:59am

A bit late to the game, but I compiled and installed httpstan and pystan on Mac M1 Max. It is remarkably faster than anything I have tried before on non-arm architecture.
I don’t have a benchmark though.

Bob_Carpenter · August 15, 2022, 9:24pm

Thanks for following up @tinosai.

You might want to try CmdStanPy. It’s lighter weight than httpstan and should be faster. But it doesn’t include log density or gradient calculations if you need those. I’d be curious what the speed comparison looked like on an M1.

But what I really want to know is how the M2 MacBook Pros perform. I’m about to get a new notebook through work and that seems like the obvious choice.

Topic		Replies	Views
Anyone planning on getting an M1 machine to benchmark? Developers performance	17	2897	April 14, 2021
Mac M1 experience with cmdstanR General mac	4	1164	July 4, 2022
Stan on M4 Mac? General	16	581	February 6, 2025
STAN (pySTAN) on upcoming Apple Silicon Developers	6	1666	July 30, 2020
Will the new M1 chip affect cmdstanR? General	12	4378	June 4, 2021

Apple's new M1 processors and Stan

Related topics