RFC: Return of the Monorepo

syclik · June 15, 2018, 6:31pm

I don’t know if we’ve discussed, but does it make sense to just have the monorepo for stan-dev/stan and stan-dev/math? That way CmdStan, RStan, and PyStan are treated the same.

Bob_Carpenter · June 15, 2018, 6:33pm

Yes, we discussed that.

I’d be more inclined to bring RStan and PyStan into the monorepo at this point. The separate repos have been nothing but a pain for testing with upstream dependencies.

ariddell · June 15, 2018, 9:21pm

With PyStan 3 (httpstan, technically), I’ve stopped using a submodule
for stan and just committed the relevant Stan release tarball into the
tree. It works fine so far.

Bob_Carpenter · June 15, 2018, 9:24pm

How do you keep up with the develop branch of Stan that way?

ariddell · June 16, 2018, 12:03pm

When there’s a new release of Stan that requires changes the relevant
changes are made at the same time as the new Stan code is added to the tree.

wds15 · June 16, 2018, 4:45pm

The mono repo is great, but I think we should only include the cmdstan interface, because

we should have a frontend in order to test all our layers … including the top user-facing interface
cmdstan is very light in terms of additional dependencies to have
cmdstan is to me the “vanilla” stan - and it’s a good thing to have a reference

However, if we include cmdstan in the mono-repo - how can rstan and pystan make this mono-repo a subbmodule without the cmdstan stuff?

Bob_Carpenter · June 17, 2018, 5:30am

If that’s only official releases, does that mean you don’t keep up with the develop branch on stan-dev/stan? I guess that’s pretty stable.

That’s the plan to start. CmdStan is lightweight enough I don’t see a problem bringing that in along with the submodules in R and Python. Sounds like Allen isn’t even using submodules for PyStan.

ariddell · June 17, 2018, 12:37pm

Right, PyStan does not really need to update the Stan source code
between releases. If there is a new Stan feature that requires (early)
testing with development Stan code, I think we’d just do this in a
separate branch.

syclik · June 19, 2018, 12:23pm

Thanks, @wds15. I have similar concerns.

I don’t want to elevate CmdStan above RStan and PyStan (or demote it for that matter). I think they should be on equal footing. I’d rather they were all in or all out. I treat the interfaces that use CmdStan as a different class of interface since they don’t write any C++ code directly.

The only things that could break at the CmdStan level when updating Stan happen rarely:

changes to build instructions
API changes

The behavior of the samplers are tested at the Stan level and should be kept there.

In my mind, it’s easier if Stan and Math were merged together leaving CmdStan, RStan, and PyStan as separate repos. But I could be convinced that only CmdStan should be there (against what I think is natural) or that all three interfaces live together. It’d be great to have comprehensive tests across the interfaces, but we’ve never gotten that going.

wds15 · June 19, 2018, 12:29pm

one more point to consider (sorry if already raised) is the reason why we split stan-math apart from stan: As I understood a key advantage of this setup was that the test-burden was much decreased for anything upstream from stan-math.

In practice this means that we do not have to run distributions tests whenever we change things in the language.

Can we still have this convenience with a mono-repo? If the answer is no, then this is a hefty price we are paying here.

seantalts · June 19, 2018, 12:37pm

We will be able to configure testing separately from git history, so we can avoid testing distributions every time we change anything.

I still think that CmdStan:

is fairly simple, something similar to an “example” or bare bones interface representing the minimal complete Stan implementation
from a people perspective is grouped with Stan and Math
is required for end-to-end testing

and so we should include it with the other two repos.

syclik · June 19, 2018, 12:44pm

That’s enough of a justification for me to be behind it. I don’t think anyone is trying trying to make CmdStan more than that, so we’re safe for now.

Bob_Carpenter · June 19, 2018, 6:09pm

CmdStan is different than RStan and PyStan in many relevant ways:

BSD license
pure C++
zero dependencies outside of those in stan and math libs
[from @seantalts] same developers as stan and math (for now anyway)

What’s the advantage of not including CmdStan in the monorepo? I like that it gives you an end-to-end functioning package.

That was a motivation. But we were wrong. This has hugely increased the test burden now that we do upstream tests which we can’t synchronize.

Right, but we do run the other way around, which is much more common.

Yes, but I think we may just go to more combined testing because it’s too hard to keep all the dependencies in place otherwise.

seantalts · October 25, 2018, 5:59pm

Heads up - I’m working with a contractor who is ready to begin this work. I think it might be a reasonable idea to freeze merges to develop(s) for a week while he finishes? Is that feasible? @Bob_Carpenter

syclik · October 26, 2018, 3:31pm

Freezing merges should be ok. I think a week would be fine, but @Bob_Carpenter can reply.

If it doesn’t go as planned, we can always merge into wherever and then reapply the merges afterwards, right? Hopefully we won’t have too many that would need to be reapplied.

seantalts · October 26, 2018, 3:43pm

Yeah, it should be doable and just a few one-offs if we mess up, we’re not super high traffic at the moment.

syclik · October 26, 2018, 3:53pm

Cool. And I’m definitely not opposed to a freeze for a week.

mitzimorris · October 28, 2018, 6:55pm

found this today:

(starting from this: https://stackoverflow.com/questions/33569189/convert-git-repo-with-submodules-to-single-repo)

Bob_Carpenter · October 29, 2018, 5:12pm

Yes, that should be OK. What happens to the PRs in progress now?

Bob_Carpenter · October 29, 2018, 5:13pm

Thanks—that looks like the kind of thing we need.

Topic		Replies	Views
Makefiles PRs about to go in for Math, Stan, and CmdStan Developers	7	535	September 27, 2018
The Stan refactor branch is into github.com/stan-dev/stan Developers maintenance	2	674	January 10, 2017
MathematicaStan Developers	16	1254	February 24, 2017
Schedule for Splitting Apart The Stan Repos? Developers	16	1227	November 7, 2017
Makefile upgrades. Can anyone review them? Developers	3	539	September 25, 2018

RFC: Return of the Monorepo

Related topics