Status on RStan, CRAN, and latest versions of Stan?

avehtari · July 25, 2022, 5:49pm

Hah, that was a misleading error message!

Ah, I’ll wait then

WardBrian · July 25, 2022, 6:03pm

I will discuss with @rok_cesnovar - I think there have been enough minor fixes in the past week to justify a 2.30.1

hsbadr · July 25, 2022, 6:08pm

After installing the experimental version of RStan, you can use the nightly stanc3 as follows:

file.remove(system.file("stanc.js", package = "rstan"))
download.file("https://github.com/stan-dev/stanc3/releases/download/nightly/stanc.js", file.path(find.package("rstan"), "stanc.js"))

Sounds good. Thanks!

avehtari · July 25, 2022, 6:28pm

Should RStan stan() command honour options(stanc.allow_optimizations = TRUE)? I installed the latest experimental version (with your commits today), but running stan() creates the C++ code without --O1 flag.

hsbadr · July 25, 2022, 6:42pm

It should. What’s the output of the following:

formals(rstan::stanc)

and

getOption("stanc.allow_optimizations")

avehtari · July 25, 2022, 6:44pm

> getOption("stanc.allow_optimizations")
[1] TRUE
> formals(rstan::stanc)
$file


$model_code
[1] ""

$model_name
[1] "anon_model"

$verbose
[1] FALSE

$obfuscate_model_name
[1] TRUE

$allow_undefined
isTRUE(getOption("stanc.allow_undefined", FALSE))

$allow_optimizations
isTRUE(getOption("stanc.allow_optimizations", FALSE))

$standalone_functions
isTRUE(getOption("stanc.standalone_functions", FALSE))

$use_opencl
isTRUE(getOption("stanc.use_opencl", FALSE))

$warn_pedantic
isTRUE(getOption("stanc.warn_pedantic", FALSE))

$warn_uninitialized
isTRUE(getOption("stanc.warn_uninitialized", FALSE))

$isystem
c(if (!missing(file)) dirname(file), getwd())

hsbadr · July 25, 2022, 6:58pm

How did you know it doesn’t honour the option? Maybe, your code isn’t affected by O1 optimizations? Also, does rstan::stanc(model_code = <stancode>, allow_optimizations = TRUE) generate different C++ code?

avehtari · July 25, 2022, 7:02pm

The generated code shows

    return std::vector<std::string>{"stanc_version = stanc3 v2.30.0", "stancflags = "};

Works with CmdStanR: with 50% drop in sampling time.

Also misses the --O1 from that line I mentioned above

WardBrian · July 25, 2022, 7:04pm

I would not rely on the "stancflags = " for stancjs. This line is generated based on the presumed existence of argv from the system, which won’t be populated on stancjs

avehtari · July 25, 2022, 7:06pm

Argh

WardBrian · July 25, 2022, 7:08pm

Agreed, this is an oversight we should fix

Edit: Allow driver code to set printed argument list in generated hpp by WardBrian · Pull Request #1231 · stan-dev/stanc3 · GitHub

rok_cesnovar · July 25, 2022, 7:20pm

I was not aware of that not being respected in stanc.js. Sorry for the misleading.

avehtari · July 25, 2022, 7:34pm

Don’t worry. It was useful with CmdStanR, and thanks to this it will be fixed.

spinkney · July 25, 2022, 7:55pm

When will the 01 optimizations be certified Aki-fresh?

Seriously, @avehtari, thanks for going down this rabbit hole and improving the code base!

avehtari · July 25, 2022, 7:56pm

I got now stan_model() with allow_optimizations = TRUE to work and see 50% speedup. I don’t have now time to investigate further or time to test stan() or rstanarm compilation.

WardBrian · July 25, 2022, 9:04pm

I would also like to thank @hsbadr - having the experimental branch of RStan working with develop is great for stress testing these features of the stancjs interface and also testing against more complicated models which people in the R ecosystem have been building up for years

spinkney · July 25, 2022, 9:36pm

Yes, huge thanks to @hsbadr! He’s the hero of this story.

avehtari · July 26, 2022, 8:41am

I have now succesfully tested that these work

stan_model(file=file, allow_optimizations = TRUE)

and

options(stanc.allow_optimizations = TRUE)
stan(file=file, data=data)

Two suggestions

In RStan when using stan_model(.,verbose = TRUE) it would be useful to print out also the stanc flags
The RStan model object stores the CPPFLAGS, but it would be useful to store also the STANCFLAGS

There is also an issue that RStan tries to be clever avoiding recompilation, but fails when STANCFLAGS change. I first run stan_model(file=file1, allow_optimizations = FALSE), and after that running stan_model(file=file2, allow_optimizations = TRUE), where file2 was the same as file1 except for the name and initial whitespace, doesn’t recreate C++ code but recompiles the cached C++ code with optimizations off.

avehtari · July 26, 2022, 8:53am

I also was able to install the latest rstanarm, but not yet convinced that --O1 flag did get through.

However, I realized I can check whether rstanarm code has anything that can use SoA, and it seems there is not much speedup expected. For example, bernoulli.stan:

> model <- cmdstan_model(stan_file="bernoulli.stan", compile=FALSE);model$check_syntax(stanc_options = list("debug-mem-patterns", "O1"))
vector[z_beta_1dim__] z_beta: AoS
vector[K_smooth] z_beta_smooth: AoS
vector[smooth_sd_raw_1dim__] smooth_sd_raw: AoS
array[vector[K], hs] local: AoS
array[vector[K], mix_1dim__] mix: AoS
vector[q] z_b: AoS
vector[len_z_T] z_T: AoS
vector[len_rho] rho: AoS
vector[len_concentration] zeta: AoS
vector[t] tau: AoS
vector[K] beta: AoS
vector[K_smooth] beta_smooth: AoS
vector[smooth_sd_1dim__] smooth_sd: AoS
vector[q] b: AoS
vector[len_theta_L] theta_L: AoS
vector[0] inline_hs_prior_return_sym109__: AoS
vector[inline_hs_prior_K_sym110__] inline_hs_prior_lambda_sym111__: AoS
vector[inline_hs_prior_K_sym110__] inline_hs_prior_lambda2_sym113__: AoS
vector[inline_hs_prior_K_sym110__] inline_hs_prior_lambda_tilde_sym114__: AoS
vector[0] inline_hs_prior_return_sym102__: AoS
vector[inline_hs_prior_K_sym103__] inline_hs_prior_lambda_sym104__: AoS
vector[inline_hs_prior_K_sym103__] inline_hs_prior_lambda2_sym106__: AoS
vector[inline_hs_prior_K_sym103__] inline_hs_prior_lambda_tilde_sym107__: AoS
vector[0] inline_hsplus_prior_return_sym94__: AoS
vector[inline_hsplus_prior_K_sym95__] inline_hsplus_prior_lambda_sym96__: AoS
vector[inline_hsplus_prior_K_sym95__] inline_hsplus_prior_eta_sym97__: AoS
vector[inline_hsplus_prior_K_sym95__] inline_hsplus_prior_lambda_eta2_sym99__: AoS
vector[inline_hsplus_prior_K_sym95__] inline_hsplus_prior_lambda_tilde_sym100__: AoS
vector[0] inline_hsplus_prior_return_sym86__: AoS
vector[inline_hsplus_prior_K_sym87__] inline_hsplus_prior_lambda_sym88__: AoS
vector[inline_hsplus_prior_K_sym87__] inline_hsplus_prior_eta_sym89__: AoS
vector[inline_hsplus_prior_K_sym87__] inline_hsplus_prior_lambda_eta2_sym91__: AoS
vector[inline_hsplus_prior_K_sym87__] inline_hsplus_prior_lambda_tilde_sym92__: AoS
vector[0] inline_make_theta_L_return_sym126__: AoS
vector[len_theta_L] inline_make_theta_L_theta_L_sym127__: AoS
matrix[inline_make_theta_L_nc_sym132__, inline_make_theta_L_nc_sym132__] inline_make_theta_L_T_i_sym133__: AoS
vector[inline_make_theta_L_nc_sym132__] inline_make_theta_L_pi_sym137__: AoS
vector[inline_make_theta_L_r_sym142__] inline_make_theta_L_T_row_sym139__: AoS
vector[0] inline_make_b_return_sym145__: AoS
vector[rows(z_b)] inline_make_b_b_sym146__: AoS
matrix[inline_make_b_nc_sym149__, inline_make_b_nc_sym149__] inline_make_b_T_i_sym150__: AoS
vector[inline_make_b_nc_sym149__] inline_make_b_temp_sym153__: AoS
vector[(K + K_smooth)] coeff: AoS
vector[N[1]] eta0: AoS
vector[N[2]] eta1: AoS
vector[inline_ll_clogit_lp_J_sym180__] inline_ll_clogit_lp_summands_sym183__: AoS
vector[inline_ll_clogit_lp_N_g_sym185__] inline_ll_clogit_lp_eta_g_sym187__: AoS
vector[0] inline_pw_bern_return_sym159__: AoS
vector[inline_pw_bern_N_sym160__] inline_pw_bern_ll_sym161__: AoS
vector[inline_pw_bern_N_sym160__] inline_pw_bern_pi_sym162__: SoAvector[0] inline_pw_bern_inline_linkinv_bern_return_sym4___sym163__: AoS
vector[0] inline_pw_bern_return_sym168__: AoS
vector[inline_pw_bern_N_sym169__] inline_pw_bern_ll_sym170__: AoS
vector[inline_pw_bern_N_sym169__] inline_pw_bern_pi_sym171__: SoAvector[0] inline_pw_bern_inline_linkinv_bern_return_sym4___sym172__: AoS
vector[(p[inline_decov_lp_i_sym197__] - 1)] inline_decov_lp_shape1_sym193__: AoS
vector[(p[inline_decov_lp_i_sym197__] - 1)] inline_decov_lp_shape2_sym194__: AoS

The specific example I’ve been running is a bit sensitive to random initialization and exact prior specifications, but with logistic regression with regularized horseshoe prior and p=1536, n=54, stan_glm is now an order of magnitude slower than corresponding brms generated code. I know it’s probably difficult to change rstanarm code as it needs to be very flexible, but pinging @bgoodri so that at least he is aware of the experiment.

avehtari · July 26, 2022, 8:55am

And thanks @hsbadr for all the help here and for all the work on getting RStan and rstanarm t owork with the latest Stan!

Topic		Replies	Views
Rstan Versioning RStan rstan	4	1734	February 23, 2022
RStan 2.26 released on CRAN Announcements rstan	1	949	September 12, 2023
Is rstan still alive? RStan	14	1308	June 26, 2023
rStan math version Developers maintenance , rstan , math	11	1234	December 9, 2019
Rstan 2.17.3 released and mostly available Announcements	1	949	January 23, 2018

Status on RStan, CRAN, and latest versions of Stan?

Related topics