Multicore Speedups are different between models

betanalpha · July 12, 2017, 6:34pm

This can happen, but if the sampler is well-behave then the differences from chain to chain should be small.

You can do this only if each of the individual chains are well-behaved.

When you run MCMC you have to be very careful that you’re getting an accurate answer, which is hard because there’s no way to prove that a given sampler will be accurate for your specific model. Instead all we have are conditions that we know should not happen for well-behaved chains (i.e. we have necessary conditions but not sufficient ones).

One of the common ways that these pathologies manifest is chains behaving differently depending on where they are in parameter space. This is why we run many chains and compare them with R-hat – if behavior of even one chain deviates from the behavior of the others then we should doubt the validity of all of the chains.

So if you run multiple chains and you see drastically different speeds then it’s likely your sampler can’t handle your model. To verify this check out the R-hats as well as all of the other diagnostics (especially divergences). If the diagnostics look fine and the speed differences aren’t huge (between 1 and 2) then the variation is likely due to small differences in adaptation or core performance, in which case the variations can be ignored.

Topic		Replies	Views
Four chains vs four jobs General cmdstan	28	216	June 19, 2024
Different execution times between chains General rstan	4	553	May 25, 2022
Inconsistent chain speed - does this give a clue about the problem? Algorithms optimization	10	4607	July 20, 2018
Stan on M4 Mac? General	16	664	February 6, 2025
Multi-chain vs single-chain Developers	7	2180	March 7, 2023

Multicore Speedups are different between models

Related topics