Posterior draws objects => recover original array data structure?

wds15 · October 13, 2020, 11:11am

Hi!

I am running a cmdstanR fit and getting back “draws” objects which do not make sense to me in that I can’t deal with them as I want to. The issue is that the returned “draws” objects flatten out all the dimensions, but I don’t want that. Essentially I would like to get the same output format as I am getting things from the “extract” method in rstan - that is a list of all the variables where each variable still has it’s strucutre.

So I do not want to see things like “theta[1]”, “theta[2]”, … etc., but rather “theta” which is structured accordingly (just as the extract method would do it from rstan).

It’s not clear to me how to reformat that in an elegant way. tidybayes does not seem to solve it since it targets tidy data structures.

Many thanks for any help on this.

Sebastian

ahartikainen · October 13, 2020, 11:30am

That wide structure is “original” structure.

To get ndim structure, have you tried posterior package?

(E.g. in CmdStanPy you can use ArviZ InferenceData to get ndim structure idata = az.from_cmdstanpy(fit))

rok_cesnovar · October 13, 2020, 11:38am

Not completely sure, but I think posteriors extract_variable and extract_variable_matrix from posterior can be used for that.

wds15 · October 13, 2020, 11:38am

I have looked at the posterior package, but the documentation is not helping me… maybe I overlook something?

wds15 · October 13, 2020, 11:40am

Nope. extract_variable gives me back a flat 1D vector, but not the original structure.

ahartikainen · October 13, 2020, 1:02pm

Yeah, at least their examples show ndim structure.

But if posterior don’t want to implement that kind of functionality, then there is always option to do it manually (this should then be inside CmdStanR)

Get a table of all theta vars --> (check order -->) reshape to correct order.

Not sure how easy this would be in R (probably similar as in python).

rok_cesnovar · October 13, 2020, 1:15pm

tidybayes has support for posterior draws (and thus cmdstanr as well) on a branch. That branch works directly with cmdstan fit. I have seen tweets from @mjskay with demos of that with gather_draws a while back. Not sure when that will hit cran (guessing posterior needs to be put on cran first).

I remembered we have an issue for that: https://github.com/stan-dev/cmdstanr/issues/183
Though not sure whether this falls under cmdstanr or posterior. My feeling is more the latter, but idk.

Cc: @jonah

Edit: i was reffering to this: https://twitter.com/mjskay/status/1289987974973685760?s=20

rok_cesnovar · October 13, 2020, 1:36pm

In the meantime you can simply use rstan::read_stan_csv(fit$output_files()) to get the rstan stanfit on which you can then use extract.

wds15 · October 13, 2020, 1:37pm

Nope… I tried that and it failed for me.

EDIT: Ok… so this approach fails if there is only warmup in the csv file, but no iterations from the sampling phase.

> rstan::read_stan_csv(scale_fit$warmup$output_files())
Error in `[<-`(`*tmp*`, buffer.pointer, , value = scan(con, nlines = 1,  : 
  subscript out of bounds

but when there are also samples from a sampling phase, it does seem to work. Maybe a rstan bug.

rok_cesnovar · October 13, 2020, 2:18pm

That is a bug in rstan’s read_stan_csv it seems. Will take a look.

mjskay · October 13, 2020, 3:08pm

It sounds like you might be looking for something like the rvar interface I am working on for posterior. It is very close to me making a PR onto the main branch (I got interrupted by the beginning of the fall quarter), but you can see a description of it here or try it out on the rv-like brach

wds15 · October 14, 2020, 6:42pm

The rvar idea sounds like what I am looking for… hopefully you can resume your efforts on this. Looking forward to it.

Bob_Carpenter · August 20, 2021, 4:22pm

That works for me, but it’d be nice to have this in cmdstanr without having to load rstan. One of the main advantages of cmdstanr is not having to install rstan!

I only need the draws, not anything else that read_stan_csv returns. I only want to be able to get the draws in a structured way that lets me avoid having to build strings representing indexed variables.

ahartikainen · August 20, 2021, 5:00pm

I think cmdstanr has this already (similar functionality is in cmdstanpy too)

Topic		Replies	Views
Extract draws object for the posterior package from a stanreg object? rstanarm posterior-package	1	617	January 30, 2021
Extracting draws from cmdstanr: array vs. df General	2	574	September 4, 2022
[Interface roadmap] fit objects and `extract` Developers	44	2352	September 17, 2019
How to extract output General	11	980	August 10, 2020
Combining posterior data from multiple chains when saving .csv output from CmdStanR inference object Other cmdstanr , posterior-package	2	825	February 1, 2022

Posterior draws objects => recover original array data structure?

Related topics