Marginalize over denominators in mixture of binomials

martinmodrak · February 11, 2020, 4:10pm

Thx @maxbiostat for tagging me. First note the thread at Sum of binomials when only the sum is observed where I discussed a similar topic. It turns out it is important how you choose N_1 and I think that the process you describe is actually identical to this one:

For each element:

Flip a biased coin (with prob \tau) independently
If the coin was heads, choose \hat{\theta} = \theta_1 else choose \hat{\theta} = \theta_2
Flip a biased coin with prob \hat{\theta}, if it is heads, add one to X

If that is so, then X \sim \text{Binomial}(N, \tau \theta_1 + (1-\tau) \theta_2) and you cannot infer any information about \theta_1, \theta_2, \tau individually.

If the process is actually different, the saddlepoint approximation mentioned in the paper linked by @maxbiostat might be sensible.

I wrote about implementing a saddlepoint approximation for sum of negative binomials here: https://www.martinmodrak.cz/2019/06/20/approximate-densities-for-sums-of-variables-negative-binomials-and-saddlepoint/ which discusses all the nuts and bolts to get it running in Stan.

For negative binomials the approximation was not very useful as there are simpler approximations that still work good. Binomials are however different (see the thread I linked earlier). Saddlepoint is however very slow to compute.

Best of luck with your model!

Topic		Replies	Views
Saddlepoint approximation for sums of negative binomials in Stan Publicity	1	822	July 3, 2020
Sum of binomials when only the sum is observed Modeling	21	3697	November 10, 2021
Mixture of Binomial Question Modeling	2	533	January 20, 2021
Marginalizing out unknown binomial sample size when observations are proportions Modeling	4	409	August 15, 2023
Marginalizing a double binomial-Poisson hierarchical distribution Modeling stan-math , ecology	7	872	June 16, 2022

Marginalize over denominators in mixture of binomials

Related topics