Understanding the Likelihood Model for the Cox Family in brms

kaybenleroll · November 3, 2025, 10:09pm

I am trying to understand the basic approach of the cox() family in brms. My understanding was that the idea is to model the baseline rate as a spline using the spline2 package, and then the covariates are proportional hazards off this baseline hazard rate.

Effectively, rather than having a semi-parametric baseline rate like in the classic Cox-PH, you instead have a modelled baseline curve using a spline.

So far, so good.

The problem lies when I try to reconstruct that spline from the various parameters outputted by the model. I appreciate this output is not available currently in brms, but I am happy to hack out a solutoin for this myself, but I am not quite understanding the logic in the cox() family code in brms.

I was extracting the spline but the baseline rate seems to start a 1 and so my estimated survival rates are much too low, despite both the brms and classic mle mostly agreeing on the values of the model parameters.

Any pointers welcome and happy to share code and any other bits that might help.!

bdeonovic · November 14, 2025, 3:43pm

lets assume your brms model fit is called bfit. First lets get the baseline hazard (m-spline) basis matrix. This matrix is based on the time to event values that are being modeled.

s_data <- standata(bfit)
b0 <- brms:::bhaz_basis_matrix(s_data$Y, list(df=5, intercept=TRUE)) ## you might need to modify the arguments passed to the basis function if you changed the defaults

now lets pull out the posterior samples of the coefficients of the splines

sbhaz_post <- rstan::extract(bfit$fit, "sbhaz")$sbhaz ## You'll need to do some more indexing if you stratified your model, i'll leave that as an exercise

Let’s construct a new basis matrix for large range of time points that we wish to plot. These will need to be I-splines so we make sure to set integrate=TRUE

cb <- brms:::bhaz_basis_matrix( seq(0, max(s_data$Y), by=0.1), basis=b0, integrate=TRUE)

Now samples of the baseline hazard are given by cb %*% t(sbhaz_post)

You can plot it with something like this:

tibble(
    time = seq(0, max(s_data$Y),by=0.1), 
    cum_haz = rowMeans(cb %*% t(sbhaz_post)), 
    lower = apply(cb %*% t(sbhaz_post), 1, quantile, 0.05/2), 
    upper = apply(cb %*% t(sbhaz_post), 1, quantile, 1-0.05/2)
) |> 
ggplot(aes(x=time, y = cum_haz)) + 
    geom_line() + 
    geom_ribbon(aes(ymin=lower,ymax=upper),alpha=0.15)

bdeonovic · November 20, 2025, 6:28pm

@paul.buerkner could you glance over this and let me know if I did this correctly?

kaybenleroll · November 30, 2025, 7:35pm

That is extremely helpful, thank you!

I think I know how to proceed now and I’ll let you know how I get on with it. Much appreciated.

rtnliqry · February 13, 2026, 9:25am

@bdeonovic Am I correct here that setting integrate = FALSE in bhaz_basis_matrix would return the baseline hazard (h_0(t), i.e. the derivative of the cumulative hazard) rather than the cumulative hazard as you show here?

Topic		Replies	Views
Baseline hazard from a Cox model in brms Modeling brms	3	969	May 5, 2021
Brms survival model with interactions between a binary variable and a spline function brms	22	3264	December 15, 2019
Gaussian process distributional parameter in brms? brms	7	768	June 5, 2019
Survival models in rstanarm Developers	120	13936	October 21, 2020
How can I get spline bases from "stan_surv"? rstanarm survival	0	178	February 28, 2024

Understanding the Likelihood Model for the Cox Family in brms

Related topics