New projpred is now on CRAN

AlejandroCatalina · October 28, 2020, 4:19pm

It’s now on CRAN with the latest version!

As mentioned in the NEWS.md, the latest release includes:

We have fully rewritten the internals in several ways. Most importantly, we now leverage maximum likelihood estimation to third parties depending on the reference model’s family. This allows a lot of flexibility and extensibility for various models. Functionality wise, the major updates since the last release are:

Added support for GLMMs and GAMMs via lme4 and gamm4 .
Formula syntax support internally that allows for easier building upon projections.
Thanks to the above point, we save some computation by only considering sensible projections during forward search instead of fitting every possible submodel.
We have added a new argument search_terms that allows the user to specify custom unit building blocks of the projections. This can be used to include fixed terms across all projections, for instance. New vignette coming up.
We have fully changed the way to define custom reference models. The user now provides projection fitting and prediction functions (more information in a new upcoming vignette).

More documentation and vignettes to come.

bgoodri · October 28, 2020, 5:44pm

I was wondering, but neither @avehtari nor @paul.buerkner knew, can the new projpred project to a linear model if the original formula is like y ~ s(x) or does the original formula need to be more like y ~ x + s(x)?

AlejandroCatalina · October 28, 2020, 5:58pm

I haven’t personally tried it out but my guess is it would work by passing search_terms = [“x”] to either varsel or cv_varsel. It would also work by hacking the formula slot in the refmodel object to say ref$formula <- y ~ x even though the reference model is a GAM.

Best,
Alejandro

bgoodri · October 28, 2020, 8:20pm

It seems like a linear (sub)model is an important special case of a GAMM, so should we be recommending the y ~ x + s(x) form so that projpred could find linearity without any additional arguments or hacks?

AlejandroCatalina · October 28, 2020, 8:24pm

Wouldn’t that suffer from high correlations as well? I’ll try projecting y ~ x + s(x) and see, but some tests that I did resulted in bad maximum likelihood estimates, because they are harder to find I believe. If the reference model incorporates and properly identifies both terms then both would be projected. Maybe what happens is that fitting y ~ s(x) already fits x + s(x)?

Best,
Alejandro

bgoodri · October 28, 2020, 8:49pm

Hmm. It seems that y ~ x + s(x) does not sample well, even though mgcv is making a rank-deficient spline basis. But varsel(..., search_terms = list("x")) on a model with y ~ s(x) yields

Error in sub[“kl”, i] : incorrect number of dimensions

Am I doing it wrong? There does not seem to be an example of using the search_terms argument.

AlejandroCatalina · October 28, 2020, 9:18pm

Yes, that was my concern, my experiments with models of the type x + s(x) didn’t sample well either. My guess is that s(x) already includes the linear unpenalized term in libraries like mgcv or gamm4, but I haven’t dig into that.

I’ll confirm tomorrow how to properly set the search_terms argument for varsel because it’s a parameter that we haven’t really used yet so it’s usage is a bit unexplored. I will add more examples and documentation for it.

Thanks for reporting!

Best,
Alejandro

AlejandroCatalina · November 9, 2020, 11:55am

I finally came back to this. So it happens that search_terms must include the intercept, so the correct syntax would be search_terms = c("1", "x"). This is something that is not very intuitive for the user so I’ll change it to detect whether it includes the intercept or not and include it myself automatically otherwise.

Sorry for the late response.

Topic		Replies	Views
Projection predictive variable and structure selection for GLMMs and GAMMs Publicity	8	755	October 16, 2020
Projpred: Projection Predictive Feature Selection now in CRAN Publicity	2	790	February 20, 2018
New update for projpred Announcements projpred	4	768	September 25, 2020
New projpred 2.1.1 in CRAN Publicity	1	536	April 28, 2022
Projection predictive feature selection for multilevel phylogenetic models Interfaces projpred	8	1313	March 25, 2021

New projpred is now on CRAN

Related topics