Help understanding prior specification

Longshot408 · January 10, 2020, 11:16pm

Hello all,
Quick and simple question about specifying priors for a logistic regression with stan_glm. Can someone explain exactly what is meant by both “scale” and “location”? I thought I got it, then I read this link and talked to my mentor a bit, and now it’s fuzzy. I dug through the documentation in the user’s guide and manual and couldn’t find explicit definitions or references to either.

Specifically, does the location simply mean “expected regression coefficient”? And if so, in the case of a logistic model with stan_glm, do I have to transform my expected coefficient values before putting them in the prior? Lets say I got a posterior beta weight of 1.3 for a coefficient in experiment one and wanted to set that as the prior location for experiment two. Would the location just be 1.3? Or log(1.3)? Etc.

Does scale mean the Standard Deviation? Or is it also a coefficient value, that is equal to 1 Standard Deviation? (e.g. I input 1.5, which means 1.0 Standard Deviations in that distribution=1.5 on the unit scale)

bbbales2 · January 10, 2020, 11:34pm

I think the scale and location lingo come from wanting to talk about parameters of lots of distributions (student-ts, cauchys), that don’t necessarily have means or standard deviations. For a normal, it is the mean and standard deviation.

These scales and locations by default are adjusted according to your data, so if you’re moving them from experiment to experiment I’d guess you’d want to turn off the automatic adjustment: Prior Distributions for rstanarm Models (all of that is probably useful to read)

If you’re doing this sort of model in brms you can use make_stancode and make_standata to see what priors are actually getting generated in the code. The tradeoff there is then you gotta compile models all the time, but being able to see the actual Stan code helps a lot in terms of answering these questions.

Longshot408 · January 10, 2020, 11:45pm

I’ve read most of those vignettes already, and sadly I’m still confused. Take this line of code for example, from the very last part of that page where he talks about setting more informed priors:

my_prior <- normal(location = c(-10, 0), scale = c(5, 2), autoscale = FALSE)
stan_glm(y ~ x1 + x2, data = dat, prior = my_prior)

I get that -10 could be a coefficient, but 5 for its Standard Deviation? At 3.0 SD’s you’ve captured 99.99% of the data already, so why 5? Or is 5.0 supposed to mean that 1 Standard Deviation in this distribution is equal to a change of 5 units?

bbbales2 · January 10, 2020, 11:54pm

5 and 2 would be the standard deviations of the first and second parameter.

So the prior for the first parameter is mean -10 with standard deviation 5. So 68% of the prior probability is between -15 and -5 in the prior.

Longshot408 · January 11, 2020, 12:00am

AHA! That clears things up. Thank you!

bbbales2 · January 11, 2020, 12:01am

But if you care about the super-specifics of your priors, then maybe do a few experiments to understand what the adjustment is doing.

Topic		Replies	Views
Prior statement in stan Algorithms	1	345	September 14, 2021
Help understanding stan_glm() commands Modeling specification	3	748	December 9, 2019
New to Stan, how do input my prior to the glm model General rstan , techniques , specification	1	509	April 27, 2020
Specifying different priors for different coefficients in RStanARM? rstanarm	6	585	April 2, 2020
Same model with different default priors in rstanarm and brms? rstanarm	4	1298	May 7, 2021

Help understanding prior specification

Related topics