How to compute P-value for Mixture Model

yab · November 30, 2020, 2:22am

Hello! I’m fitting the data using two component mixture model 𝒚𝒊 ~ 𝑾𝟎𝒊𝒑(.|𝝀𝟎𝒊)+ 𝑾𝟏𝒊𝒑(.|𝝀𝟏𝒊). Where 𝑊0𝑖 and 𝑊1𝑖 represent the mixing probability, and 𝒑(.|𝝀) represent the Poisson distribution. After I pass through some processes using Stan package, I got like the following output:

yi	W1i
5	0.4
2	0.7
10	0.6
2	0.4

Finally, I define the new latent variable 𝑍𝑖, 𝑖 = 1,…,𝑛 that indicates the category of observation group, i.e., whether it is in the first or second category. The indicator variable has two outcomes (0 and 1), and it follows Bernoulli distribution, 𝑍𝑖~𝐵𝑒𝑟𝑛𝑜𝑢𝑙𝑙𝑖(𝑊1𝑖), for 𝑖 = 1, 2,…, 𝑛 and it is concluded that the observation 𝑖 is in the second group (I call it significant observations) whenever 𝑃(𝑍𝑖 = 1|𝑌) is bigger than a cutoff value, say 0.5.

my question is that:

is it need to use statistical significance or FDR to select significant observations, instead of using one ad hoc number (cutoff of the posterior probability of 𝑊1𝑖 > 0.5)?

Max_Mantei · December 3, 2020, 10:54am

Hi yab!

I’m not really sure, but I would say “it depends”. In a Bayesian approach you don’t “need” statistical significance thresholds. What constitutes a significant observations should IMO come from your domain expertise. If the notion is “it’s a significant observation when it’s more likely to be in category 1 than category 0”, then the W1i > 0.5 threshold makes sense. However, it could be a significant observation if it almost surely falls into category 1 and then you’d probably want to go for something like W1i > 0.95 or something along those lines. But that’s more of a decision (as in decision theory) than an estimation issue I would say.

I hope this was at least a bit helpful. Maybe others have more/different ideas…

Cheers,
Max

Topic		Replies	Views
Mixture Bayesian Poisson Regression Model Modeling	45	3900	April 17, 2019
Cumulative probit model: predicting the probability an observation is in a category or higher (i.e. above or below a single threshold) General brms	4	585	August 2, 2022
First Steps in Reporting Bayes Results - Stuck in Frequentist Thinking General	2	808	January 25, 2022
Multimodality issues in regression model with mixture prior Modeling techniques , fitting-issues	4	1020	August 29, 2019
Bayesian Noob: Interpreting Mixture Model Output Modeling techniques , interpret-results	0	633	June 5, 2019

How to compute P-value for Mixture Model

Related topics