Latent Dirichlet Allocation with Binomial response

capplestein · October 12, 2022, 8:24pm

The standard LDA example given in the STAN manual assumes purely a bernoulli presence/absence response. I’m trying to edit this example for a binomial response where I have a proportion given by the number of species | number of total species hits.

I have two more variables added to the data for this response: the numerator (specieshits[N]) and the denominator (tothits[M]). I cannot figure out how to modify gamma in order to incorporate this type of response instead of a purely binary one.

data {
  int<lower=2> K;               // num communities
  int<lower=2> V;               // num species
  int<lower=1> M;               // num sites
  int<lower=1> N;               // total species instances
  int<lower=1,upper=V> w[N];    // species n
  int<lower=1,upper=M> doc[N];  // site ID for species n
  int specieshits[N];           //number of species hits at site M for species n
  int<lower=1> tothits[M];      // total number of all species hits at site M
  vector<lower=0>[K] alpha;     // community prior
  vector<lower=0>[V] beta;      // species prior
}
parameters {
  simplex[K] theta[M];   // community dist for site m
  simplex[V] phi[K];     // species dist for community k
}
model {
  for (m in 1:M)
    theta[m] ~ dirichlet(alpha);  // prior, proportion of each community at each site
  for (k in 1:K)
    phi[k] ~ dirichlet(beta);     // prior, proportion of each species within each community
  for (n in 1:N) {
    real gamma[K];
    for (k in 1:K)
      gamma[k] = log(theta[doc[n], k]) + log(phi[k, w[n]]);
    target += log_sum_exp(gamma);  // likelihood;
  }
}

Topic		Replies	Views
[Solved] Beta binomial with multilevel partial pooling Modeling	3	1055	February 12, 2018
Binomial Mixtures with a Mixture of Dirichlet Process Prior Modeling	2	509	April 7, 2022
Bayesian Beta Binomial model with latent draw Modeling rstan , hierarchical-model	6	591	August 1, 2020
Hierarchical overdispersed count models Modeling binomial-response , count-data	2	655	November 10, 2021
Partially pooled beta binomial model Modeling	9	1646	April 12, 2019

Latent Dirichlet Allocation with Binomial response

Related Topics