Truncated normal not mixing sometimes

sakrejda · November 9, 2017, 5:40pm

Ok, at this stage the “performance” you are looking for is something that will get around the distribution and give you a chance to understand mixing problems. If stepsize crashes then you have a sampler that can’t even do that… so lowering adapt_delta is the right step here.

So what are the performance problems you are getting?

atanzap · November 9, 2017, 5:53pm

It was mixing in general. I will re-run it and post what it looks like with adapt_delta = 0.6. I knew I should have saved it - sorry for not doing so!

As for changing distributions, also keen to do this given that the normal without truncation works so well.

atanzap · November 9, 2017, 10:23pm

So this is what I mean by “performance” with adapt_delta =0.6:

And here is the acceptance:

sakrejda · November 9, 2017, 10:54pm

So stepsize still crashes.

atanzap · November 10, 2017, 9:55am

Not as epically as before:

In better news, it occurred to me that some of the groups (i.e. sites) might have distinct distributions. I previously explored the balance by month and day but not sites. Turns out there are small extreme values accounting for 2, 4, and 10% of the values in these three sites, respectively. Dropping these three sites, the model fits quite nicely in 200 iterations. So, as I suspected, something to do with the raw data. I’m still not sure why the truncated normal couldn’t handle all this, especially as in total these observations aren’t overly common, but posterior geometries are well beyond my understanding!

atanzap · November 10, 2017, 10:08am

I should add that the other sites do have small extreme values but they’re much less common.

sakrejda · November 10, 2017, 10:33am

Sigh, it’s not the truncated normal that’s the problem.

atanzap · November 10, 2017, 10:39am

A bit of both I’d say… Why shouldn’t it work if there is a lot of mass near the lower bound? The 3 sites I mentioned weren’t showing great signs of bimodality, the problems were just a few hundred observations in a dataset of 87k, and it all worked fine without truncation. I’m probably just too ignorant on the underlying maths to see why this is obvious.

Bob_Carpenter · November 27, 2017, 5:52pm

The computational problem is that lower bounds in the constrained space map to negative infinity in the unconstrained space. So they tend to require large step sizes, which can be unstable w.r.t. overflow.

The statistical problem is that people think that if the true value of a parameter theta is 1.2, then imposing a prior like uniform(1, 2) won’t affect the posterior mean. That’s true for maximum likelihood, but not for Bayes. What happens is that any mass that would’ve been below 1 gets pushed above 1 and the estimate will be higher than if you’d had a uniform(-10, 10) prior. So adding a lower bound that truncates some mass pushes that mass above the lower bound thus increasing the posterior mean (usually unintentionally).

Bob_Carpenter · November 27, 2017, 7:38pm

I just scheduled a blog post on Andrew’s blog for tomorrow that provides an example (where tomorrow is 28 November 2017, 13:00 EST).

Topic		Replies	Views
Constraint checking and resulting distribution (truncated normal) Modeling truncated	1	806	December 8, 2021
Fitting a basic uniform distribution -- what am I missing? Modeling fitting-issues	11	690	December 7, 2020
Half-normal, Half-Cauchy and Half-t Modeling	8	9803	October 3, 2020
Truncated Multivariate normal distrbution Modeling techniques	1	279	June 26, 2023
Zero-Truncated Poisson Lognormal Parameter Estimation Modeling rstan , fitting-issues	2	691	February 1, 2022

Truncated normal not mixing sometimes

Related Topics