Timed iteration updates and backward compatibility standards

bgoodri · June 19, 2018, 6:04pm

The stan/services/util/generate_transitions.hpp file does not include the chain ID in its stringstream:

stan-dev/stan/blob/develop/src/stan/services/util/generate_transitions.hpp#L59


      
                                  callbacks::logger& logger, size_t chain_id = 1,
                                  size_t num_chains = 1) {
          for (int m = 0; m < num_iterations; ++m) {
            callback();
          
            if (refresh > 0
                && (start + m + 1 == finish || m == 0 || (m + 1) % refresh == 0)) {
              int it_print_width = std::ceil(std::log10(static_cast<double>(finish)));
              std::stringstream message;
              if (num_chains != 1) {
                message << "Chain [" << chain_id << "] ";
              }
              message << "Iteration: ";
              message << std::setw(it_print_width) << m + 1 + start << " / " << finish;
              message << " [" << std::setw(3)
                      << static_cast<int>((100.0 * (start + m + 1)) / finish) << "%] ";
              message << (warmup ? " (Warmup)" : " (Sampling)");
          
              logger.info(message);
            }

If you want me to add it in RStan or other interafaces, I can do that, but I don’t think interfaces should be responsible for doing things like that.

Bob_Carpenter · June 19, 2018, 6:10pm

I agree. That’s why we’re trying to move all this down to the services layer.

bgoodri · June 19, 2018, 6:14pm

I am definitely open to different ideas about how to do time-based refreshes. I just don’t think adding a refresh_seconds argument that preserves the existing behavior moves the needle very much on accomplishing the two goals of

Printing less crap on the screen, particularly when it has negative value added in short runs
Making it easier to guestimate “How much longer will this long run take?”

Over months or more realistically years, users would learn to specify the refresh_seconds argument, but in the interim you still have all the costs I mentioned. In contrast, reinterpreting refresh makes immediate progress toward those two goals with not much cost.

No, I mean ones like those in rstanarm. If you have a compiled version of foo.stan by CmdStan, then ./foo.stan will still work (albeit with the old progress behavior) and it is pretty easy for the CmdStan user to recompile foo.stan in order to get the new progress behavior. Someone who is using an R package with old compiled models that do not have a refresh_seconds argument and a new RStan that is passing a refresh_seconds argument gets an error message that they can’t do anything about until the R package gets recompiled with newly generated C++.

Bob_Carpenter · June 19, 2018, 6:19pm

I take it that’s because all of our services template the model class? We really need to change this so it’s not a problem going forward. That is, all the services should take references to a base class, not a template. Then we can also speed up compilation.

Cleaning up the base class for models and making this change is the number one priority for me after MPI goes in and @mitzimorris’s refactor of the AST lands. So we should have a solution to this in 2.19, I hope. I really want to start tackling compilation time problems, and this will help with that.

syclik · June 19, 2018, 6:20pm

This is exactly what the interfaces are responsible for. See:

github.com

stan-dev/rstan/blob/b4d4c3e86e63f754d29a34b544d5122c8d2b9c1a/rstan/rstan/inst/include/rstan/stan_fit.hpp#L558


      
          
          if (!args.get_append_samples()) {
            writer.write_sample_names(s, sampler_ptr, model);
            writer.write_diagnostic_names(s, sampler_ptr, model);
          }
          
          // Warm-Up
          clock_t start = clock();
          
          std::stringstream prefix_stream;
          prefix_stream << "\nChain " << args.get_chain_id() << ", ";
          std::string prefix = prefix_stream.str();
          std::string suffix = "";
          R_CheckUserInterrupt_Functor interruptCallback;
          
          stan::services::mcmc::warmup<Model, RNG_t,
                                       R_CheckUserInterrupt_Functor>
            (sampler_ptr, args.get_ctrl_sampling_warmup(), args.get_iter() - args.get_ctrl_sampling_warmup(),
             args.get_ctrl_sampling_thin(),
             args.get_ctrl_sampling_refresh(), args.get_ctrl_sampling_save_warmup(),
             writer,

It’s just a matter of configuring the writer properly (which the code is already there for it to happen). CmdStan doesn’t need to do this because it doesn’t work with multiple threads.

Just add the appropriate prefix to the current writer and it’ll get fixed. I thought I submitted a PR to fix it, but maybe I didn’t.

bgoodri · June 19, 2018, 6:24pm

The main thing is for the chain ID to get restored one way or another. But why is it the responsibility of stan/services/util/generate_transitions.hpp to report the Iteration, the percentage of iterations completed, whether it is during or after the warmup period, and the setw but it is the responsibility of the interfaces to report the chain ID?

sakrejda · June 19, 2018, 6:44pm

It makes sense from the stan-dev/stan perspective because each service method is launching one chain, so of course the interface has to tell the code which chain is which at writer construction. We could change services to handle multiple chains but that’s another story… it would be really easy to do with threads with zero impact on performance at this point since threading was already introduced within chains.

bgoodri · June 19, 2018, 6:51pm

I agree about threads. In the meantime, it is probably easiest to add the chain ID in the interfaces. But the services API has to know the chain ID in order to know which file to write to, so it should be able to construct a complete message and just say to the interfaces “print this”.

sakrejda · June 19, 2018, 6:57pm

BTW: My preference is to just keep ‘refresh’ or whatever the argument is called and give it the new meaning. We all seem to agree there’s minimal cost and its an aspect of services that downstream should NOT rely on. It will mess up some scripts somewhere a little, in cosmetic terms, but the results will be fine. Thanks for putting the PR together.

My position is that the services should produce something like ‘chain id’, ‘iterations’, etc… and we should provide some functions within stan-dev/stan that can format these into standard messages. That way we could say both “you can rely on the presence of a chain ID and iteration output as an integer” and “you can’t rely on the exact text of the message so if you do good luck keeping up”.

bgoodri · June 19, 2018, 7:04pm

That is plausible, but we should have some standards to keep what the interfaces print consistent.

sakrejda · June 19, 2018, 7:08pm

Makes sense, I’m imagining that the helper functions should be used to achieve this goal and have well-documented standard usage.

ahartikainen · June 22, 2018, 12:31am

Hi, (sorry if this is off-topic)

In PyStan 3 the sampling progress output is done with tqdm-progressbar library which has support for multiple progress bars and updates everytime a draw is sampled (httpstan is streaming output from Stan model).

Something similar probably could be done with CmdStan / RStan interfaces.

edit. cc @ariddell

Bob_Carpenter · June 22, 2018, 10:46pm

The issue is whether the interfaces or stan should be responsible for things like refresh and progress updates. The motivation for pulling things down into C++ implementations in Stan is that it enforces uniformity for the interfaces.

PyStan3 really has multiple interfaces if I understand what you’re trying to do:

the server interface to Stan
the client interface to the server
the user interface to the client

avehtari · June 27, 2018, 8:56pm

Thanks @bgoodri for the clear justifications. I now agree with you.

avehtari · June 27, 2018, 9:01pm

R has also a nice progress bar package which works also in terminal.

I would say interfaces, but they should have easy way to access the current progress status without need to read csv. But I’m fine with Ben’s proposal now, as it probably takes some time before interfaces could get that information easily.

syclik · June 28, 2018, 12:21pm

I think there’s a better way to do this, but happy to move forward with this change.

My request is that it’s documented correctly. The refresh in seconds is not a guarantee that a message will be written at that time. It means that if you wanted for a refresh of 60 seconds and it took 60,000 seconds to run, you won’t get 1000 messages.

The better way to do this is just to notify when there’s an iteration (this is what I’ve suggested). Then the interfaces could format the message consistently with functions in Stan, but would have the ability to run a separate progress bar or something in a different thread to handle the user’s request of refreshing at a specific time much more easily. And this behavior that’s requested could also be handled easily.

Actually, it wouldn’t be hard to do this. @bgoodri, are you in a rush? Can this wait for the weekend?

Bob_Carpenter · June 29, 2018, 5:27am

Isn’t that currently what happens with the writers?

syclik · July 7, 2018, 2:29am

No. And that’s why I think we can just do this a lot better.

What’s getting output is a string message that looks like:

Iteration: 2000 / 2000 [100%]  (Sampling)

The logic as to when that gets printed is handled by the service code. I think it’s just a lot cleaner for the services to let the client know that there’s another iteration (instead of jamming a string into a something that’s supposed to be written out) and have the client deal with it how it wants. If the client wants to handle a real timer in a separate thread so that it’s actually reported every X seconds, that’s fine. If the client wants to use a progress bar, that’s fine too.

Sure, we can change this argument, but I think it just falls short. It’s really unsatisfying to me that it’s not actually going to report the iteration number at the specified time. For example, if the requested seconds was 30 and each iteration lasts 600 seconds, then I think it’ll just look broken.

Bob_Carpenter · July 11, 2018, 5:27am

That’s a good point.

The other place I wonder it’ll look broken is when we have something like refresh = 10 and that used to spew output and now it waits 10 seconds.

We’re trying to balance three demands:

wanting the interfaces to behave the same way,
not wanting to write code multiple times,
giving the interfaces flexiblity to do whatever they want with the output.

I think @syclik is arguing we want to prioritize (3) over (1) and (2).

I don’t see how this is going to prevent progress bars. Those would go on the iterations, not this refresh thing that’s just a message to the console.

As far as messages to the console go, I thought we were OK with having the basic logging/console output be not under the same backward compatiblity requirements as draws, etc. If that’s not everyone’s understanding we need to come to some common understanding so we can make progress.

bgoodri · July 17, 2018, 11:21pm

I am trying to fix this for 2.18 but I don’t understand. Before we were passing strings around, so you could prefix. Now, it just passes this stream_logger thing

github.com

stan-dev/rstan/blob/develop/rstan/rstan/inst/include/rstan/stan_fit.hpp#L395


      
            std::vector<int> params_i;
            std::vector<double> constrained_params;
            boost::ecuyer1988 rng = stan::services::util::create_rng(random_seed, id);
            model.write_array(rng, const_cast<std::vector<double>&>(params), params_i,
                              constrained_params);
            return constrained_params;
          }
          /**
          * @tparam Model
          * @tparam RNG
          *
          * @param args: the instance that wraps the arguments passed for sampling.
          * @param model: the model instance.
          * @param holder[out]: the object to hold all the information returned to R.
          * @param qoi_idx: the indexes for all parameters of interest.
          * @param fnames_oi: the parameter names of interest.
          * @param base_rng: the boost RNG instance.
          */
          template <class Model, class RNG_t>
          int command(stan_args& args, Model& model, Rcpp::List& holder,
                      const std::vector<size_t>& qoi_idx,

to Stan and Stan builds the info message.

Topic		Replies	Views
Development of time-based refresh Developers stan	3	624	October 22, 2018
Proposal for Stan updates based on time rather than iteration number General	0	433	February 14, 2020
Stan 2.20.0 released! Announcements	3	1519	July 19, 2019
FYI - Next Stan release (2.21) October 18 - now with feature freeze Developers	33	1819	October 24, 2019
Stan 2.21.0 released Announcements	11	1932	October 21, 2019

Timed iteration updates and backward compatibility standards

Related topics