Clarification on what matrices brms uses to compute smooths

PhDemetri · June 22, 2024, 8:43pm

I’m writing a stan model and would like to use penalized smooths similar to how brms implements s(x).

Thanks to TJ Mahr’s excellet blog post on the topic, I feel like I have a handle on how brms might handle these smooths: there is 1 fixed effect (which appears to be a linear component) plus a bunch of random effects. The random effects are applied to the basis functions to compute the smooth.

I’m looking for clarification on what matrices are extracted from mgcv to be used in the model. Below, I show a small example that I think is correct and compare it to a brms fit. I use the mcycle data, similar to TJ’s post.

The stan model is

data{
  int n;
  int k;
  int k2;
  vector[n] y;
  matrix[n, k] X;
  matrix[n, k2] Z;
}
parameters{
  real Intercept;
  vector[k] beta;
  vector[k2] gamma;
  real<lower=0> sigma;
  real<lower=0> sigma2;
}
transformed parameters{
  vector[n] mu = X * beta + Z *(sigma2 .* gamma ) + Intercept;
}
model{
  Intercept ~ student_t(3, -13.3, 35.6);
  beta ~ normal(0, 1);
  gamma ~ normal(0, 1);
  sigma ~ student_t(3.5, 0, 35.6);
  sigma2 ~ student_t(3.5, 0, 35.6);
  y ~ normal(mu, sigma);
}

The matrices X and Z are extracted in the following way

library(tidyverse)
library(mgcv)

mcycle <- MASS::mcycle %>% 
  tibble::rowid_to_column(var = 'i')

sm <- smoothCon(
  s(times, k=-1),
  data=mcycle,
  absorb.cons = T,
  diagonal.penalty = T
)


re <- smooth2random(sm[[1]], "", type=2)

X <- re$Xf
Z <- re$rand$Xr

When I this model and a similar model using brms, I get very similar estimates. Show below is a plot of the predictions, where the line is the mean of mu from my model, and the dots are the results from predict(fit_brms). Things look pretty good, so I’m hopeful my appraoch is correct, but wanted to check.

Full code for reproducibility

library(tidyverse)
library(mgcv)
library(brms)
library(tidybayes)

mcycle <- MASS::mcycle %>% 
  tibble::rowid_to_column(var = 'i')

# Fit with brms

brms_formula <-  accel ~ s(times, k=10)

fit_brms <- brm(
  brms_formula,
  prior = c(
    prior(normal(0, 1), class = 'b')
  ),
  data = mcycle,
  backend = 'cmdstanr',
  adapt_delta = 0.99
)


pred <- predict(fit_brms) %>% 
  bind_cols(mcycle)


# -------------------------------------------------------------------------



stan_code <- '
data{
  int n;
  int k;
  int k2;
  vector[n] y;
  matrix[n, k] X;
  matrix[n, k2] Z;
}
parameters{
  real Intercept;
  vector[k] beta;
  vector[k2] gamma;
  real<lower=0> sigma;
  real<lower=0> sigma2;
}
transformed parameters{
  vector[n] mu = X * beta + Z *(sigma2 .* gamma ) + Intercept;
}
model{
  Intercept ~ student_t(3, -13.3, 35.6);
  beta ~ normal(0, 1);
  gamma ~ normal(0, 1);
  sigma ~ student_t(3.5, 0, 35.6);
  sigma2 ~ student_t(3.5, 0, 35.6);
  y ~ normal(mu, sigma);
}

'

sm <- smoothCon(
  s(times, k=-1),
  data=mcycle,
  absorb.cons = T,
  diagonal.penalty = T
)


re <- smooth2random(sm[[1]], "", type=2)

X <- re$Xf
Z <- re$rand$Xr

stan_data <- list(
  n= nrow(mcycle),
  y = mcycle$accel,
  X = X, 
  Z = Z,
  k = ncol(X),
  k2 = ncol(Z)
)


stan_code %>% 
  write_stan_file() %>% 
  cmdstan_model() -> model


fit_stan <- model$sample(stan_data, adapt_delta = 0.99)


fit_stan %>% 
  spread_draws(mu[i]) %>% 
  mean_qi(mu) %>% 
  inner_join(mcycle) %>% 
  ggplot(aes(times, mu)) +
  geom_line() +
  # geom_point(data=mcycle, aes(times, accel), color='red', inherit.aes = F) + 
  geom_point(data=pred, aes(times, Estimate ), color='red')

jonah · July 18, 2024, 4:34pm

Sorry nobody responded sooner. What you have looks reasonable, but it’s been a while since I looked at what brms is doing here, so I don’t really remember. The code that brms uses to prepare the data is at

github.com

paul-buerkner/brms/blob/2917e72ec0cbd9cd8ade6465cf44cfaf7c457461/R/data-predictor.R

#' Prepare Predictor Data
#'
#' Prepare data related to predictor variables in \pkg{brms}.
#' Only exported for use in package development.
#'
#' @param x An \R object.
#' @param ... Further arguments passed to or from other methods.
#'
#' @return A named list of data related to predictor variables.
#'
#' @keywords internal
#' @export
data_predictor <- function(x, ...) {
  UseMethod("data_predictor")
}

#' @export
data_predictor.mvbrmsterms <- function(x, data, sdata = NULL, ...) {
  out <- list(N = nrow(data))
  for (r in names(x$terms)) {

This file has been truncated. show original

and if you search for mgcv:: on that page you should find all the function calls to mgcv. I’m pretty sure it’s similar to what you have here.

(Also I agree that @tjmahr’s post is excellent!)

Topic		Replies	Views
Creating matrix of grouped smooths as in brms brms techniques	5	659	August 7, 2021
Group-level/varying/'random' effects syntax in `brms` formula similar to `mgcv` vs `lme4`? brms	2	588	February 24, 2023
Generalised Additive Modelling (GAMs) in Stan General rstan , techniques	4	1379	January 24, 2023
How to input matrix data into brms formula (for signal regression using smooths)? Modeling rstan , matrix , r , brms	5	1123	June 29, 2021
Using design matrices in stan (based on brms code) brms	7	1474	January 4, 2021

Clarification on what matrices brms uses to compute smooths

Related topics