Normal priors on scale parameter

JoLee · February 18, 2022, 6:37pm

I often use normal prior distributions on, for example, a scale parameter, \sigma, with support (0,\infty). I understand that Stan will transform \sigma to an unconstrained variable behind the scenes.

I also recall, although I have no references, that normal priors are often a better choice than many of the densities that have a positive support (e.g. a Half-Cauchy).

Anyway, I often see papers specifying priors with support to match the constraints of variables (of course, a proper prior must be used). How would I formally state that I used a normal prior on a scale parameter, \sigma > 0?

I don’t want readers saying my prior is improper for \sigma (although technically it is, Stan just does some work behind the scenes).

Would it be correct to say the prior distribution is \sigma \sim N(0,10^6) (0,\infty)? It looks a little strange.

I ask here as I think I have seen many moderators suggest normal priors for variables with positive support since Stan knows how to handle this nicely.

mike-lawrence · February 18, 2022, 11:43pm

For reporting, I’ve seen people just use ~ half-normal(...) rather than ~ N(...)

By the way, a scale of 10^6 will only be sensible for quantities within an order of magnitude or so of that value; be sure not to use such a large value merely to make the prior “super-weakly-informed”.

JoLee · February 19, 2022, 12:23am

So would you say, \sigma \sim \text{Half-Normal}(0,10) would be appropriate to represent a \text{Normal}(0,10) prior on \sigma, when \sigma has support (0,\infty)?

A 95\% CI for \sigma is [0.70,1.19]. I very often use a \text{Normal}(0,10^6) prior thinking it worked well as an informative prior.

yizhang · February 19, 2022, 12:27am

Not just that, recall that there is 1/\sigma^3 somewhere in the derivative of normal lpdf, large \sigma could risk underflow or at least losing precision.

mike-lawrence · February 19, 2022, 12:29am

The greater the scale, the more diffuse the prior credibility density, so a scale of 10^6 is highly uninformed.

betanalpha · February 25, 2022, 8:51pm

For some motivation for half-normal priors see Section 3 of Prior Modeling.

For why

<lower=0> x;
...
x ~ normal(0, 1);

in Stan implements a (truncated) half-normal prior and not a normal prior see An Introduction to Stan.

jsocolar · February 25, 2022, 10:21pm

Not quite. Instead, I would say that you don’t have a \mathrm{Normal}(0,10) prior on \sigma. You have a half-normal prior. The fact that we express this in Stan as sigma ~ normal(...) is just a bit of Stan syntax. The prior that is being encoded here, given a positivity constraint on sigma, is half-normal.

Topic		Replies	Views
Prior recommendation for scale parameters in hierarchical models too strong? Modeling	25	8340	January 31, 2018
Half-normal priors for sigma (sd) in hierarquical analysis Modeling rstan , specification	2	92	February 20, 2025
Half-normal, Half-Cauchy and Half-t Modeling	8	12466	October 3, 2020
Prior for sigma Modeling specification	3	1895	February 7, 2021
Can I express that the lower bound on a parameter should be greater than zero? Modeling techniques , priors	3	1266	December 26, 2022

Normal priors on scale parameter

Related topics