Recovering class index in mixture model using generated quantities

nhuurre · March 11, 2020, 10:15am

That doesn’t sound right. categorical_rng does not just select the maximum probability category.
If \theta is (0.25,0.75) then one-in-four individuals are assigned to the first category.

Anyway, mixture weight theta is essentially a prior for category assignment and categorical_rng(theta) samples a new individual.
You can recover the categories for observed individuals by applying Bayes’ theorem and sampling the posterior assignment probability. This is straightforward; posterior is equal to normalized likelihood.

generated quantities{
  vector[N] log_lik; // log-likelihood for calculation of LOO
  int<lower=1> Z[N];          // group index

  for (n in 1:N) {
    vector[K] ll;
    for(k in 1:K) {
      ll[k] = log(theta[k]) + normal_lpdf(logod[n] | mu[k], sigma[k]);      
    }
    log_lik[n] = log_sum_exp(ll);
    Z[n] = categorical_rng(exp(ll - log_lik[n]));
  }
}

Topic		Replies	Views
Generated quantities for mixture models Modeling	6	3246	March 16, 2019
Mixture-Model Inference: how to report hidden states, rather than state proportions? Modeling rstan , techniques , specification , mixture	6	725	June 17, 2021
Generated quantities from stick-breaking mixtures Modeling rstan	1	70	July 14, 2025
Mixture models: how to get P(data \| cluster) Modeling	13	2487	May 29, 2017
Dirichlet_log: probabilities is not a valid simplex Modeling	26	3776	September 27, 2017

Recovering class index in mixture model using generated quantities

Related topics