Correct way to use MPI with cmdstanpy

mitzimorris · August 26, 2020, 6:04pm

damn - we don’t yet have the correct invocation needed to run MPI - cf discussion in this issue:

github.com/stan-dev/cmdstanpy

Best way to use MPI with cmdstanpy

opened 04:12PM - 11 Aug 20 UTC

closed 04:58PM - 19 Mar 21 UTC

grburgess

duplicate

#### Summary: What is the proper workflow for running a python script when on…e wants to use the MPI threading for reduce_sum #### Description: I have a python script which will pass some data to a stan model. ```python import cmdstanpy model = cmdstanpy.CmdStanModel( stan_file=stan_file, cpp_options={"STAN_MPI: TRUE"} n_chains = 4 n_procs = 12 model.sample(..., parallel_chains = n_chains, threads_per_chain=n_procs) ``` Do I run this with ```bash mpirun -np 48 python my script.py ``` ? So far, this is all crashing locally although all the mpi checks in Stan-math pass. #### Current Version: CmdStan = 2.24 CmdStanPy = 0.9.5

the implementation should be really pretty simple - PR’s welcome!

if you’re running reduce_sum without MPI, there’s some documentation here:
https://cmdstanpy.readthedocs.io/en/latest/sample.html#example-high-level-parallelization-with-reduce-sum

it can be the entire process including model compilation - or you can compile the model and pass in the exe file location - here’s an example script to run a single chain per node on a cluster:

# Use CmdStanPy to run one chain
# Required args:
# - cmdstanpath
# - model_exe
# - seed
# - chain_id
# - output_dir
# - data_file

import os
import sys

from cmdstanpy import CmdStanModel, set_cmdstan_path, cmdstan_path

useage = """\
run_chain.py <cmdstan_path> <model_exe> <seed> <chain_id> <output_dir> (<data_path>)\
"""

def main():
    if (len(sys.argv) < 5):
        print("missing arguments")
        print(useage)
        sys.exit(1)
    a_cmdstan_path = sys.argv[1]
    a_model_exe = sys.argv[2]
    a_seed = int(sys.argv[3])
    a_chain_id = int(sys.argv[4])
    a_output_dir = sys.argv[5]
    a_data_file = None
    if (len(sys.argv) > 5):
        a_data_file = sys.argv[6]

    set_cmdstan_path(a_cmdstan_path)
    mod = CmdStanModel(exe_file=a_model_exe)
    fit = mod.sample(chains=1, chain_ids=[a_chain_id], seed=a_seed, output_dir=a_output_dir, data=a_data_file)
    print(fit)

if __name__ == "__main__":
    main()

Topic		Replies	Views
Running cmdstanr in parallel on computing cluster General	6	1003	December 9, 2022
CmdStanPy and multithreading Modeling	10	1271	June 27, 2024
Multiprocessing and/or multithreading problem - CmdStanPy Modeling cmdstanpy , paralellization	12	105	January 2, 2025
Cmdstanpy, mpi speedup Developers	26	253	November 19, 2024
Cmdstanpy reduce_sum doesn't produce executable Developers compiler	22	749	January 2, 2024

Correct way to use MPI with cmdstanpy

Related topics