Parameter scaling and hitting the maximum tree depth

roeysc · February 23, 2023, 7:20pm

Dear Stan Community,

I’m having some trouble fitting a large model. The model has many moving parts, so I will try to give the jist of it without having to copy all of it here.

In essence, I’m trying to model several dynamical processes on longitudinal data in a hierarchical model. These dynamical processes are weighted by four weights that sum to 1.

Two of the terms in my model are modeled as a scaled fraction of some baseline value called mu_pr:

pow_a_frac[1] = 5.0*Phi_approx(pow_a_pr[1]).*mu_pr[2];
lrn_intrcpt[1] = 5.0*Phi_approx(lrn_intrcpt_pr[1]).*abs(mu_pr[2]);

Here’s the tricky part. If I use a scaling of 5 as in this example, I get 100% of the samples hitting the maximum tree depth, and all(?) the model parameters are stuck tightly around their prior.

If, however, I use a scaling of 2, the model converges with little problem, and only 6% of the samples hit the maximum tree depth.

I’m not sure what to make of it. Is the scaling interfering with the MCMC posterior sampling? Is this related to the step size I’m using? Or am I missing something else completely?

Any help is greatly appreciated! (even just some sympathy).

Thanks,

Roey

betanalpha · March 7, 2023, 9:03pm

Given that the constant scaling should be equivalent to scaling mu_pr[2] my guess is that the constant scaling of 5 makes for a more difficult adaptation during warmup. In particular the default initialization in Stan might lead to more extreme behavior with the larger constant scaling which frustrates early exploration enough that the sampler adaptation ends up in a poor state that then compromises performance in the main sampling phase.

You can confirm this by digging into the Hamiltonian Monte Carlo adaptation configuration, in particular the individual step sizes and inverse metric elements, between the two fits.

roeysc · March 8, 2023, 11:12am

Thanks for this clear explanation, Michael!

Topic		Replies	Views
Convergence of multiple chains Modeling rstan , fitting-issues	5	348	March 27, 2024
Warning on some iterations saturated the maximum tree depth of 10 Modeling rstan , fitting-issues	2	431	October 26, 2023
Saturating the max tree depth for every sample in extremely simple model Modeling	14	1984	December 10, 2018
Model gets stuck - but not quite always Modeling	7	1357	July 18, 2018
Stan adjust MCMC sampling internally on a parameter-by-parameter basis? Modeling fitting-issues , reinforcement-learning , divergences , max_treedepth	3	76	December 10, 2024

Parameter scaling and hitting the maximum tree depth

Related topics