Help with lasso example in brms

alexhallam · April 25, 2018, 4:43pm

I normally use glmnet for variable selection tutorial here. The brms documentation says that there is a lasso function, but I am struggling to get a working example. I get the error.

Error: Defining priors for single population-level parametersis not allowed when using horseshoe or lasso priors(except for the Intercept).

Could someone show a simple working example of variables selection using lasso with brms?

paul.buerkner · April 25, 2018, 4:45pm

Please provide the code you want to get working. Also, I suggest using the horseshoe prior rather than lasso, since the former provides much better shrinkage.

alexhallam · April 25, 2018, 4:56pm

This is a data set from “An Introduction To Statistical Learning”. It should be reproducible and relevant to variable selection via lasso.

library(ISLR)
library(tidyverse)
library(brms)

hitters <- Hitters %>% na.omit()

for_lasso <- brm(Salary ~ .,data = hitters)

summary(for_lasso)

paul.buerkner · April 25, 2018, 5:03pm

You can set a lasso prior as follows:

for_lasso <- brm(Salary ~ .,data = hitters, prior = prior(lasso(), class = "b"))

alexhallam · April 25, 2018, 5:48pm

How can I be more aggressive or less aggressive with setting coefficients equal to 0? I was assuming that df was the argument for this, but maybe I am wrong. I am seeing that none of the following models have covariates getting set to 0.

Here is a more complete example:

library(ISLR)
library(tidyverse)
library(brms)

# define function to scale variables
my_scale <- function(...) as.numeric(scale(...))
hitters <- Hitters %>% 
  na.omit() %>% 
  # remove non-numaerics before scaling
  select(-NewLeague,-League ,-Division ) %>%
  # scale
  mutate_all(my_scale)


for_lasso1 <- brm(Salary ~ .,data = hitters, prior = prior(lasso(df = 1), class = "b"),
                 iter = 500, chains = 3)
for_lasso2 <- brm(Salary ~ .,data = hitters, prior = prior(lasso(df = 10), class = "b"),
                 iter = 500, chains = 3)
for_lasso3 <- brm(Salary ~ .,data = hitters, prior = prior(lasso(df = 100), class = "b"),
                 iter = 500, chains = 3)
summary(for_lasso1)
summary(for_lasso2)
summary(for_lasso3)

paul.buerkner · April 25, 2018, 5:51pm

That’s because you are in a Bayesian framework. There is no absolut shrinkage to zero. See the paper about the Bayesian lasso I cite in the doc of ?lasso.

In fact, the lasso prior is a bad shrinkage prior. I rather suggest using the horseshoe prior instead.

alexhallam · April 25, 2018, 6:10pm

This is the code with the horseshoe priors. After glancing at the paper it seems as if the Bayesian lasso is a compromise between lasso and ridge, but as you mentioned the coefficients don’t shrink to 0. In the paper they also used double-exponential.

What is the justification of the horseshoe prior?

Also, is it true that the smaller the df the more regularization with df = 1 being the most regularized?

   for_lasso1 <- brm(Salary ~ .,data = hitters, prior = prior(horseshoe(df = 1), class = "b"),
                     iter = 500, chains = 3)
    for_lasso2 <- brm(Salary ~ .,data = hitters, prior = prior(horseshoe(df = 10), class = "b"),
                     iter = 500, chains = 3)
    for_lasso3 <- brm(Salary ~ .,data = hitters, prior = prior(horseshoe(df = 100), class = "b"),
                     iter = 500, chains = 3)
    summary(for_lasso1)
    summary(for_lasso2)
    summary(for_lasso3)

bgoodri · April 25, 2018, 6:18pm

I don’t think so. I would say that the regularization is mostly due to the expected number of non-zero coefficients. Even still, you are not going to obtain exact zeros, although you can use the ideas in the projpred package to obtain a model with fewer coefficients that is expected to predict future data about as well.

avehtari · April 26, 2018, 2:01am

I’ll add to the Ben’s post, for getting coefficients equal to 0 see http://link.springer.com/article/10.1007/s11222-016-9649-y and several examples and video of projpred in https://github.com/avehtari/modelselection_tutorial

avehtari · April 26, 2018, 2:03am

And see also Betancourt’s case study comparing “lasso” prior and horseshoe https://betanalpha.github.io/assets/case_studies/bayes_sparse_regression.html

Topic		Replies	Views
Horseshoe prior on subset of predictors brms bioinformatics	14	4043	August 7, 2019
Horseshoe regression implementation with brms Modeling brms	2	807	April 20, 2022
Using Horseshoe prior in hierarchical model for variable selection Modeling hierarchical-model , horseshoe-prior	13	3886	November 17, 2023
Horseshoe prior for logistic regression in brms brms	0	1014	July 12, 2018
Setting priors for multi-logistic (categorical) model brms	6	5274	June 18, 2018

Help with lasso example in brms

Related topics