Request for Volunteers to Test Adaptation Tweak

betanalpha · July 4, 2019, 8:58pm

Recently I realized that our adaptation target, which is a heuristic proxy meant to mimic a Metropolis acceptance probability, might be too conservative, ultimately giving a step size that’s smaller than necessary which itself results in lower effective sample sizes and more expensive numerical integrations.

I’ve implemented what I believe is a more appropriate adaptation target in stan-dev/stan:feature/updated_stepsize_adapt_target and preliminary results confirm the theoretical intuition, but before making a pull request I want to see how robust the performance is.

Hence I’m reaching out to the community in case you have a second to give it a try on a sophisticated model that you might happen to have lying around. I’m hoping to get the summaries for the adaptation parameters and the effective sample size and effective sample size per time for current develop and this new branch.

For example in CmdStan the comparison might look like

git clone git@github.com:stan-dev/cmdstan.git (if you has SSH set up on GitHub)
git clone https://github.com/stan-dev/cmdstan (otherwise)
cd cmdstan
git submodule update --init --recursive

make build
make CC=clang++ -j4 O=3 <MODEL_NAME>
./<MODEL_NAME> <cmdstan options>

# Compute sampler parameter summaries
../../../bin/stansummary output.csv | awk 'NR > 5 && NR < 15 {print $0}'

# Compute sum of effective sample sizes
../../../bin/stansummary output.csv | awk 'NR > 14 && NR < 115 {sum += $8} END {print sum}'

# Compute sum of effective sample sizes per time
../../../bin/stansummary output.csv | awk 'NR > 14 && NR < 115 {sum += $9} END {print sum}'

cd stan 
git checkout feature/updated_stepsize_adapt_target
cd ..

make clean-all
make build
make CC=clang++ -j4 O=3 <MODEL_NAME>
./<MODEL_NAME> <cmdstan options>

# Compute sampler parameter summaries
../../../bin/stansummary output.csv | awk 'NR > 5 && NR < 15 {print $0}'

# Compute sum of effective sample sizes
../../../bin/stansummary output.csv | awk 'NR > 14 && NR < 115 {sum += $8} END {print sum}'

# Compute sum of effective sample sizes per time
../../../bin/stansummary output.csv | awk 'NR > 14 && NR < 115 {sum += $9} END {print sum}'

but comparisons run in any interface are welcome provided the above information can be communicated.

Thanks to any volunteers to are able to produce comparisons and those who make an attempt!

anon79882417 · July 5, 2019, 2:55am

woah how’d you realize this? hmmm

anon79882417 · July 5, 2019, 3:01am

also, wouldn’t the Neff be dependent on the dataset used with said models? Like if someone specifies a garbage model? How is this a good heuristic?

fabio · July 5, 2019, 6:40am

I tried to reproduce the steps but… the first command gives me:

git clone git@github.com:stan-dev/cmdstan.git
Cloning into 'cmdstan'...
git@github.com: Permission denied (publickey).
fatal: Could not read from remote repository.

Please make sure you have the correct access rights
and the repository exists.

aornugent · July 5, 2019, 10:59am

git clone https://github.com/stan-dev/cmdstan

bbbales2 · July 5, 2019, 12:27pm

What’s the intuition for the new one? Is there any specific case we’d expect it to perform better on?

anon79882417 · July 5, 2019, 12:58pm

I’d like to know if there’s something principled that was done, like a diagnostic that was changed, a method, or a mathematical explanation. With mathematics, code examples, etc. Eliminate all ambiguity.

betanalpha · July 5, 2019, 1:31pm

Sorry the example presumed that you have SSH keys set up with GitHub. As @aornugent notes you can clone via HTTPS via the command git clone https://github.com/stan-dev/cmdstan.

betanalpha · July 5, 2019, 1:33pm

In theory the performance should be uniformly better but I want more empirical evidence before making any practical claims, hence the request. The intuition is a bit subtle but if the empirical evidence corroborates then I’ll write something about it up in the PR.

jroon · July 5, 2019, 2:35pm

I tried to follow the steps but line 2 giving me issues - compounded by fact I don’t know what that line does:

~ user$ git clone https://github.com/stan-dev/cmdstan
Cloning into 'cmdstan'...
remote: Enumerating objects: 36, done.
remote: Counting objects: 100% (36/36), done.
remote: Compressing objects: 100% (29/29), done.
remote: Total 6288 (delta 14), reused 24 (delta 7), pack-reused 6252
Receiving objects: 100% (6288/6288), 153.89 MiB | 13.32 MiB/s, done.
Resolving deltas: 100% (3324/3324), done.
~ user$ git submodule update —init —recursive
fatal: Not a git repository (or any of the parent directories): .git

betanalpha · July 5, 2019, 2:47pm

You have to first change directories into the new cmdstan directory. I updated the instructions, although keep in mind that the instructions are only meant as a guideline and one aimed at those who have used CmdStan before.

jroon · July 5, 2019, 2:49pm

Got ya. I had tried that also and got these errors:

error: pathspec '—init' did not match any file(s) known to git.
error: pathspec '—recursive' did not match any file(s) known to git.

I have dabbled with cmdstan before its just been a while and I don’t tend to do ther stuff in the shell so i’m rusty!

betanalpha · July 5, 2019, 2:57pm

The dashes autoformatted into one during my copy and pasting. Sigh.
To run your model you’ll have to have your data handy in a data.R file and know how to configure the cmdstan executable – if that ends up giving you trouble then don’t worry about trying to force your way through. Thanks for the attempt regardless!

jroon · July 5, 2019, 3:17pm

Yerp - got things loading. I used to be able to do this so I should be able to remember it. I have a look for interesting model to try amongst my past dabblings.

jroon · July 5, 2019, 4:13pm

Hi again so I got it to run - although I’ve had trouble with the second and third summary commands above which just give me nan. Nevertheless, here is what the first summary command gives:

First run:

Second run:

The model was a linear multivariate normal model with two Y variables and run on simulated data. In case it is useful I attach both the model linear_mvnormal.stan (1.7 KB) and the data file data.R (72.9 KB). I can try this again in real data if you wish. Note run1 took about 43 seconds to run, while run 2 took about twice that. I’d share the output file but they are huge because I forgot to remove the posterior predictions 🤦‍♂️

betanalpha · July 5, 2019, 5:47pm

Thanks! Try chaining the 115 in the second and third commands to 15 + N_{\text{params}} where N_{\text{params}} is the total number of parameters (not including transformed parameters or generated quantities).

jroon · July 5, 2019, 6:24pm

Hey tried its sorry its still nan despite trying various numbers. If it helps with the awk command this is a screen shot of the first run’s csv file

betanalpha · July 5, 2019, 6:36pm

The problem is the components of the Cholesky factor that are fixed and return ill-defined effective sample sizes in CmdStan. Can you just grab the first 20 lines or so from the stansummary output that follows the sampler diagnostic summaries that you already shared? Thanks.

jroon · July 5, 2019, 6:45pm

Oh yeah that makes sense. Sure see attached txtsummary_run1.txt (4.1 KB) summary_run2.txt (4.3 KB) files. Note the mu[ , ] mark the start of transformed parameters so I cut it off a few lines into that.

betanalpha · July 5, 2019, 7:03pm

Great, thanks!

Topic		Replies	Views
Much lower effective sample size with 2.15 Developers	9	1476	April 26, 2017
Details on how Stan adaptively tunes the HMC parameters? (i.e. mass matrix, step size and leapfrog steps) General algorithms	4	677	February 20, 2023
Low effective sample size after running Bayesian cognitive model in Stan Modeling rstan , fitting-issues	8	782	August 18, 2021
Acceptance ratio in NUTS Developers	4	909	June 28, 2021
Comparing Stan's adaptation phase to that of nuts-rs? Algorithms	20	1615	August 11, 2023

Request for Volunteers to Test Adaptation Tweak

Related topics