Given a fitted bimodal Gaussian mixture model, how can I verify from the model parameters that a model is identifiable or not?
It essentially means that the model parameters remain ambiguous no matter how much data you have. If you try to fit a Gaussian Process to pure noise for instance, the noise level and the length scale are not jointly identifiable because the data can be explained just as well as pure noise (the âcorrectâ inference) or by a very rapidly varying function with no noise.
In your case, whether or not itâs been fittted already and what the estimated parameter values, isnât relevant since identifiability is a property of the model itself, not any particular instance of it. GMMs in general are not identifiable because there is no difference between \theta_1 \mathcal{N}(0, 1 / \theta_1^2) + (1 - \theta_1)\mathcal{N}(0, 1/(1 - \theta_1)^2) and \mathcal{N}(0, 1) where the latter is just a degenerate mixture model with \theta_1 = 1.
There is a symmetry which makes it impossible to distinguish between various different parameter settings.
Hello! This topic comes up quite regularly. The following posts and links seem relevant:
- Blog post: Identifying non-identifiability
- Trouble with Gaussian Mixture Model
- Identifiability of Gaussian mixture mode
and the ever relevant:
I donât think this is accurate . A two mixture Gaussian is made identifiable by a simple ordering constraint on the locations.
Typically you just fit your model with at least 4 chains and youâll see the Rhats of your parameters converge to something close to 1 if your model is identified. Itâs unlikely to happen by accident.
@hhau Thank you for these.
I went through âhttps://betanalpha.github.io/assets/case_studies/identifying_mixture_models.html#42_breaking_the_labeling_degeneracy_with_non-exchangeable_priorsâ
In this model, how could I draw posterior samples ?