Proper Way to Include Stan Model in Python Package

jdossgollin · October 11, 2017, 5:15pm

Original question posted on Stack Overflow. Please let me know if I can copy and paste your answer to Stack Overflow, or you can do the same (for your credit).

I am writing a package in python (3.x) for working with a particular data set. The package reads this data, accesses it, plots, it, etc. I have also defined Bayesian models using stan and pystan which I would like to implement in this package.

pystan works by pointing to a particular .stan file which specifies the model:

def compile_model(filename, model_name=None, verbose=False, **kwargs):
'''
This will automatically cache models - great if you're just running a
script on the command line.
See http://pystan.readthedocs.io/en/latest/avoiding_recompilation.html
Code by Aki Vehtari: see
https://github.com/avehtari/BDA_py_demos/blob/new_pystan_demos/utilities_and_data/stan_utility.py
'''
from hashlib import md5
import pystan
import pickle

with open(filename) as f:
    model_code = f.read()
    code_hash = md5(model_code.encode('ascii')).hexdigest()
    if model_name is None:
        cache_fn = 'cached-model-{}.pkl'.format(code_hash)
    else:
        cache_fn = 'cached-{}-{}.pkl'.format(model_name, code_hash)
    try:
        sm = pickle.load(open(cache_fn, 'rb'))
    except:
        sm = pystan.StanModel(model_code=model_code)
        with open(cache_fn, 'wb') as f:
            pickle.dump(sm, f)
     else:
            if verbose:
                print('Using cached StanModel')
    return sm

I would like to know, in the context of building a package in python, what is the proper way to point to the filename so that I can call from a fitting function something like:

sm = compile_model(filename=stan_file)

(i.e. how to specify stan_file). My package’s directory structure is quite simple – simplified version is:

 .
├── LICENSE
├── README.md
├── pgkname
│   ├── PyFile.py
│   ├── PyFile2.py
│   └── __init__.py
│   stan
│    └── mod1.stan
└── setup.py

Clarification: the .stan file I would like to link to is ./stan/mod1.stan

ariddell · October 11, 2017, 5:26pm

You’re trying to build a package that one might distribute on PyPI, right?

The main obstacle to distributing a compiled Stan model is that it needs to be compiled for different platforms separately. A compiled model which was compiled on macOS will not work on Linux and vice-versa.

A simple way around this problem would involve compiling (and saving) the desired model during installation with some sort of setuptools hook.

I hope this answer helps. I wish I had better news.

jdossgollin · October 11, 2017, 5:56pm

Yes – I actually am OK with having the model compile the first time the user runs it. My problem is actually just a simpler one of how to correctly point to the stan file – do relative paths work?

ahartikainen · October 11, 2017, 6:10pm

Are the models static?

Why not include them as a variable/class (str) inside the package?

modulename.model_str.stan_model1 == "functions { ...."

edit. Relative path should work or you can even use fileobject. See how stanc function reads in the data.

Edit2. (If external files are needed) maybe the most robust way to read in a file is to use the absolute path ( os.path.join(modulename.__file__, 'model_dir', 'stan_model_file.stan')

github.com

stan-dev/pystan/blob/develop/pystan/api.py

#-----------------------------------------------------------------------------
# Copyright (c) 2013-2015, PyStan developers
#
# This file is licensed under Version 3.0 of the GNU General Public
# License. See LICENSE for a text of the license.
#-----------------------------------------------------------------------------

import hashlib
import io
import logging
import warnings

import pystan._api  # stanc wrapper
from pystan._compat import string_types, PY2
from pystan.model import StanModel

logger = logging.getLogger('pystan')


def stanc(file=None, charset='utf-8', model_code=None, model_name="anon_model",

This file has been truncated. show original

bgoodri · October 11, 2017, 6:22pm

I only know how to do this for R, but you might look at how Facebook does their Python version of prophet:

avehtari · October 17, 2017, 12:31pm

Oops, that code is not written by me. That code was written by Michael Betancourt and I picked it up from his case study and I didn’t notice that the particular file didn’t have the copyright notice included (licence info was only in the directory). I’ve now added the correct copyright and license (Michael Betancourt, BSD3) notice to that file.

jdossgollin · October 17, 2017, 3:02pm

Got it. Thanks for clarifying! I should have looked to the directory’s license file.

betanalpha · October 17, 2017, 9:53pm

To give credit where credit is due, @seantalts wrote that particular function based of an example by @ariddell!

David_Knowles · June 11, 2019, 11:55pm

Sorry to awaken a zombie thread but are there any developments on preferred ways to do this? Ideally I’d like to have the model get compiled as part of setuptools magic.

mitzimorris · June 12, 2019, 12:08am

hi David, I’m not sure if this is what you’re looking for, but you might want to check out the (very) Beta version of CmdStanPy - https://github.com/stan-dev/cmdstanpy
it’s distributed with script install_cmdstan - as part of the install process it compiles the CmdStan example model bernoulli.stan.
https://github.com/stan-dev/cmdstanpy/blob/master/bin/install_cmdstan

note - this package isn’t yet on PyPI - you have to get it from github -

pip install -e git+https://github.com/stan-dev/cmdstanpy

ariddell · June 12, 2019, 2:26am

Nobody has documented how to do this yet.

It might be useful to look at how Prophet does it. Whatever they are
doing clearly works. (I haven’t looked at it.)

David_Knowles · June 12, 2019, 2:50am

Thanks a lot for the pointers, I’ll take a look at prophet and cmdstanpy.

Topic		Replies	Views
How to use your own stan model -- full tutorial Modeling	15	3816	December 12, 2019
Is there a way to cache and reuse a compiled Stan model in pystan 3? PyStan	1	709	August 14, 2021
Pystan3, compiling a model and using with different data Developers pystan , compiler	4	650	October 11, 2023
Changing the location of caching models/results of Stan Developers	3	668	November 15, 2022
Pystan feature request: don’t require `data` at build time PyStan	2	383	May 26, 2021

Proper Way to Include Stan Model in Python Package

Related topics