Choosing prior for "overdispersion" in Dirichlet Multinomial distribution

Jyotishka · May 11, 2020, 4:37pm

Sorry for commenting on an old thread, but I was looking for recommendations for modeling the over-dispersion parameter in an integrated Dirichlet-Multinomial model and it seemed there’s an excellent thread already (the one by @stemangiola on the very first post on this thread).

Is there a recommendation for prior choice for the over-dispersion parameter in the integrated Dirichlet–multinomial model, or, should we treat it as a tuning parameter or empirical Bayes? Anything that you’ve found particularly useful? Any practical suggestions/insights will be really useful.

Context: I was trying to mimic the numerical study in this paper (but using a shrinkage prior rather than spike-and-slab). It seems they have a precision parameter but couldn’t find out how to model it.

Full paper here (if anyone is interested):
http://www.stat.rice.edu/~marina/papers/paper_BMCBIOINFO.pdf

martinmodrak · May 12, 2020, 12:43pm

I don’t have any good suggestions for this case in particular, but generally there are two (complentary) ways to choose priors that I like:

prior predictive checks (as in https://arxiv.org/abs/1709.01449)
penalized complexity priors, i.e. make the prior favor a simpler model. In your case it might make sense to favor \psi = 0 with something like \psi \sim \mathrm{Beta}(1, q) for a suitable q., Some more background and maths in https://arxiv.org/abs/1403.4630 but I won’t pretend I completely understand it, neither that my suggestion of prior here follows the maths of the paper.

Jyotishka · May 12, 2020, 1:34pm

Thanks a lot @martinmodrak !

I love PCP and have used them before (for a different problem), and it makes a lot of sense to think in that direction. I was actually using a Beta(\frac{1}{2}, \frac{1}{2}) prior on \psi, in the same spirit as the regular horseshoe - i.e. favour \psi \approx 0 and \psi \approx 1 (but it leads to more divergence/ higher \hat{R} etc.). Maybe a Beta(1, \beta) or Beta(\alpha,1) with \alpha < 1 will work better.

I’ll also check the prior sensitivity: that ought to give us some insights.

martinmodrak · May 12, 2020, 2:12pm

If you are having divergences, one thing I would check is that as \psi \rightarrow 0 you might be actually passing large/infinite params to the DM distribution which might cause numerical issues that manifest as divergences.

I am not sure how to handle this well, but I guess some form of mathematical rearrangement to give a numerically stable implementation of DM in this parametrization should be possible (I don’t see it immediately though…).

I am also moving this discussion to a new thread for clarity.

Topic		Replies	Views
Prior for over-dispersion parameter in neg_binomial_2_log_lpmf Modeling	1	1449	November 22, 2018
Prior Choice for Beta Binomial Dispersion Modeling prior-choice	5	1486	October 15, 2021
Hierarchical multinomial model with sparse data Modeling	3	455	October 20, 2022
Dirichlet Priors General	7	951	February 12, 2021
Transforming a multinomial model into a dirichlet-multinomial Modeling dirichlet-multinomial	3	1990	February 17, 2022

Choosing prior for "overdispersion" in Dirichlet Multinomial distribution

Related topics