Reference on benefits of rstanarm GAM models over frequentist alternatives?

benskov · April 6, 2020, 9:13am

Hi everybody,

I would like to use rstanarm::stan_gamm4 for a study we’re preparing, and to substantiate my choice of Bayesian parameter estimation (unfortunately, this is still needed) I’ve come across some these claims and was wondering if there were any references to back them:

“The documentation of lme4 and gamm4 has various warnings that acknowledge that the estimated standard errors, confidence intervals, etc. are not entirely correct, even from a frequentist perspective.
A frequentist point estimate would also completely miss the second mode in the last example with stan_nlmer. Thus, there is considerable reason to prefer the rstanarm variants of these functions for regression modeling. The only disadvantage is the execution time required to produce an answer that properly captures the uncertainty in the estimates of complicated models such as these.” (Conclusion in link)
“Estimating these models via MCMC avoids the optimization issues that often crop up with GAMMs and provides better estimates for the uncertainty in the parameter estimates.” (last sentence of first paragraph of the Details in the documentation)

If anyone has such references, examples or similar, I’d be keen to know of their existence.

Cheers,
Ben

emiruz · April 6, 2020, 11:29am

Why not do both? If you’re going to be cross validating your model then a good argument might be to show that the Bayesian version fits the data better in k-fold cross validation or whatever.

benskov · April 20, 2020, 9:51am

The point of the model is not predicting but causal inference on the effect estimates, so we don’t plan on using cross validation; only if PSIS-LOO fails.

The point on incorrect errors directly implies that using the implementation in rstanarm is desirable for the purpose of our analysis, so it would be nice to have a proper reference on this point.

Albeit interesting, the point on convergence problems is more technical and would only become relevant if the model one would fit turned out to fail.

maxbiostat · April 20, 2020, 12:52pm

You can always run simulation-based calibration and show that the model-fitting strategy you’re using is “consistent”.

benskov · April 20, 2020, 12:56pm

Thanks for the pointer, @maxbiostat! Will look into that.

maxbiostat · April 20, 2020, 12:58pm

If you decide to go down that road, maybe @bgoodri or @jonah can give some insight into how to do SBC with rstanarm .

Topic		Replies	Views
Rstanarm - for my own learning, trying to match credible interval to lm() confidence interval rstanarm specification	5	1214	November 3, 2020
Splines in Rstanarm General	4	871	April 23, 2021
Stan_gamm4 output discrepancy Modeling	4	891	November 8, 2018
Marginal and Conditional R2 for Stan's GLMMs Modeling	6	1595	February 1, 2019
Maximum likelihood via rstanarm::stan_glm rstanarm	1	524	December 2, 2022

Reference on benefits of rstanarm GAM models over frequentist alternatives?

Related topics