Tool to auto-generate model diagrams?

mike-lawrence · January 21, 2019, 4:36pm

Shot in the dark, but a reviewer asked for a diagram of my model, and while I could do one by hand I thought I’d check if anyone knows any tools that attempt to automate some portion of that task. Obviously a hard problem, but figured it might be common enough that someone might have tried to tackle it.

increasechief · January 21, 2019, 4:39pm

Greta does this. It’s about midway down here under Plotting: https://greta-stats.org/articles/get_started.html

mike-lawrence · January 21, 2019, 4:46pm

Ah, my model was done in Stan; presumably no automatic Stan-to-Greta converters too?

increasechief · January 21, 2019, 4:50pm

No but they do offer help for translating from Stan to Greta: https://greta-stats.org/articles/example_models.html

mike-lawrence · January 21, 2019, 5:00pm

I think it’d be about the same amount of work to convert (and check) the code as it would be to do then diagram by hand.

increasechief · January 21, 2019, 5:06pm

Yeah I think the (most?) interesting aspect to Greta is using TensorFlow to fit probability models in R.

betanalpha · January 21, 2019, 8:39pm

The diagrams to which you refer are for specifying probabilistic graphical models. The Stan Modeling Language, on the other hand, is richer than just probabilistic graphical models which means that we cannot define diagrams for a general Stan program. Moreover, in turns out to be theoretically impossible to translate those limited Stan programs that are equivalent to graphical models to a graphical model and hence graphical model representations like diagrams. Heuristic translations can go a long way, but given the tricky edge cases no one has attempted anything along this line.

increasechief · April 19, 2019, 8:45pm

Came across this nice new R-package for visualizing DAGS yesterday thanks to @bgoodri which may have been helpful to you: ggdag. The video content is here: https://youtu.be/3p5zCXoggtA?t=2483. It’s not auto-generating but it could be the best alternative in R at the moment. What did you end up using?

eee · May 30, 2019, 10:47pm

Exactly what i was searching for, to see the structure and causal relations in my Stan function. The examples are so easy to code, its easier than sketching it on a piece od paper.

andymilne · May 30, 2019, 11:56pm

Also worth checking out dagitty for causal DAGS – the online version has a nice GUI to play with: http://www.dagitty.net/development/dags.html

seantalts · June 14, 2019, 12:52pm

Wait, why is it theoretically impossible to generate a diagram from a Stan model that is equivalent to a graphical model? If anyone reading this wants to learn ocaml and help build such a tool based on the AST in the new compiler, I’m creating an issue here: https://github.com/stan-dev/stanc3/issues/177

sakrejda · June 14, 2019, 1:16pm

See: ICAR. The real question isn’t about theoretical but in practice can you take a model I specify as target+= and make a useful DAG-like viz. out of it.

seantalts · June 14, 2019, 1:39pm

Agreed - we don’t need it to work in 100% of cases including halting-problem-style pathological examples. It’d be great if it worked for 50% of models people write already and helped influence people to write more generative models.

betanalpha · June 14, 2019, 4:15pm

What are commonly classed “probabilistic graphical models” are a very particular subset of probabilistic models, namely those faithfully specified as directed acyclic graphs. Another common class of graphical models are those faithfully specified as undirected graphs. The overlap between these two classes is only partial, with some models faithfully specified by one, both, or none. Additionally there are generalizations of graphical models to expand the scope of faithful representations. See Bishop 8.3.4.

The Stan language goes beyond all of this by requiring only a density representation. Converting a Stan program into a graphical specification will be well-posed only if the model falls into the domain of the corresponding graph type, and then implementable only if one can identify that correspondence in finite time.

The problem with heuristic translations beyond their heuristic nature is that they confuse the intent of the Stan language. If we wanted a pure graphical language then we would have written an entirely different language. Indeed this is the approach that BUGS, PyMC, and others have taken which is why they have such tools.

In my opinion throwing down incomplete heuristics without a hell of a good UX that is able to communicate what the graphical model representation means and what it doesn’t mean is only going to confuse users and hence will be more danger than benefit. And I haven’t seen any discussion of such an UX at all.

Bob_Carpenter · June 14, 2019, 5:26pm

I think it’d be more interesting if it was fast. When I tried Greta, it was super slow. Here’s the thread where I evaluated it. It’d be interesting to know if it’s gotten faster or if there was just something I was doing wrong.

Bob_Carpenter · June 14, 2019, 5:29pm

I agree. I don’t want something that only works on a subset of the Stan language (one with only sampling statments, single assignment to variables, no conditionals, and no local variables.

I think a more viable approach for us would be to define a directed graphical modeling language like BUGS/JAGS/PyMC3 and a translation of that to Stan. That’d let us do all the cool stuff you can do with a directed graphical model like do automatic simulation, allow missing data, etc.

Matthijs · June 14, 2019, 7:08pm

Why not just extract a factor graph (these encompass both directed and undirected graphical models) from a Stan program? Ryan already has implemented the code to do this. Then you can visualize that. This is something that is always possible.

Moreover, it’s easy to check whether a Stan program is a generative model using static analysis. If so, we can give users the option to also visualize it as a DAG if they want. I don’t see the problem.

Not sure if it’s easy to check whether the program corresponds to an undirected graphical model. If so, then we can also allow people to visualize the models that way if they so desire.

jeffreypullin · June 15, 2019, 2:10am

Hi Bob,

After the 0.3 release greta is now much faster. We don’t have specific benchmarks but small models which used to take minutes now sample in a few seconds.

jeffreypullin · June 15, 2019, 2:19am

I do like the idea of a graphical modelling language which compiles to Stan - that sort of gives you the best of both worlds: a high level* UI for those that don’t want to get into the nitty gritty and the more low level Stan interface for those who need to.

I would naively imagine that such graphical language would include less types etc. than Stan

wpetry · August 7, 2019, 5:15pm

Have you seen the bayesvl package? If I follow correctly, this will converts “hand-drawn” DAGs --> Stan code, although I haven’t played around with it much more than the examples. It doesn’t appear to have the inverse (Stan code --> DAG) that was requested in the original post.

Topic		Replies	Views
Compiling a (simple) model graph (the DAG) to a Stan (or BUGS) program General	6	597	February 18, 2023
Greta package? General	4	2917	June 5, 2018
Extracting Factor Graphs for Stan Models from stanc Developers compiler	4	79	February 11, 2025
Stan reading material General	6	543	March 7, 2023
Adding models to Stan manual Developers	8	872	June 27, 2019

Tool to auto-generate model diagrams?

Related topics