How to write the poisson_lpmf in the product of two Poisson distribution

yab · July 4, 2020, 4:01pm

Let W ~ f(Xi ,Yi), i = 1, 2, ........n be a set of two-dimensional
random vectors, and we assume that:

(𝑋,𝑌) ~ 𝑊0𝑓(.|𝜆0𝜇0) + 𝑊1𝑓(.|𝜆1𝜇1) + 𝑊2𝑓(.|𝜆2𝜇2)

It is three components mixture model and 𝑓(.|𝜆𝜇) is the product of two Poisson distribution. Therefore, is it correct to write the poisson_lpmf in this way?

model {
   .
   .
  for(i in 1:N){
    
           lp[1]  =  log(p[i,1])  + poisson_lpmf(y1[i]|mu2[i]*exp(z1))
                                  + poisson_lpmf(y2[i]|mu2[i]); 
           lp[2]  =  log(p[i,2])  + poisson_lpmf(y1[i]|mu1[i])
                                  + poisson_lpmf(y2[i]|mu1[i]);
           lp[3]  =  log(p[i,3])  + poisson_lpmf(y1[i]|mu1[i])
                                  + poisson_lpmf(y2[i]|mu1[i]*exp(z2));
  }
  target += log_sum_exp(lp);
}

Thank you for your help!

emiruz · July 6, 2020, 1:00pm

So far as I can see, the Stan code matches your intentions. Generally, a good way to test is to generate some mock data and see if you can recover the parameters. It’s fairly unlikely that your model will do a good job if the code doesn’t work.

yab · July 19, 2020, 6:13pm

in this equation, I want to specify the following relationship:

𝜆0 > 𝜇0, 𝜆1 = 𝜇1 and 𝜆2 < 𝜇2.

transformed parameters{
     vector[N] mu1 = exp(alpha1 + x1*beta1);
     vector[N] mu2 = exp(alpha2 + x2*beta2);   
  }
   
  // Likelihood
   for(i in 1:N){
     lp[1] = log(p[i,1]) + poisson_lpmf(y1[i]|mu1[i]) 
                         + poisson_lpmf(y2[i]|mu1[i]-z1);     
     lp[2] = log(p[i,2]) + poisson_lpmf(y1[i]|mu1[i])
                         + poisson_lpmf(y2[i]|mu1[i]);
     lp[3] = log(p[i,3]) + poisson_lpmf(y1[i]|mu2[i]-z2)
                         + poisson_lpmf(y2[i]|mu2[i]);
   }
     target += log_sum_exp(lp);

I compile the code in this way. But still I have confusion on the expression of likelihood part. I have different corresponding covariates for y1 and y2. But, when I express in the above way, e.g. in lp[1], to express mu1 > mu2, I declared mu2=mu1 - z1. Since I used mu1 only, I lose the covariates of y2. The same is true for others.

Therefore, is there any technique that keeps the covarites of y1 and y2 in each lp conditional on 𝜆0 > 𝜇0, 𝜆1 = 𝜇1 and 𝜆2 < 𝜇2. relationship?

nhuurre · July 20, 2020, 9:48am

The loop loop overwrites lp so target += log_sum_exp(lp); must be inside the loop.

The code you’ve written has covariates for both mu1 and mu2 and has the relationship between mus and lambdas (no lambda0 or mu0, though?). But you want a relationship between mu1 and mu2, right? I don’t quite understand what your goal is.

yab · July 25, 2020, 3:12am

YES!

parameters {
  .
  .
  
  real<lower=0,upper=100000> d; // this is hyperparameter that gives more fexibility to the dirichlet distribution
}

transformed parameters{
        vector[N] mu1 = exp(alpha1 + x1*beta1);
        vector[N] mu2 = exp(alpha2 + x2*beta2); 
     }  

     model {
      real lp[3];
      beta1 ~ normal(0,3);    
      alpha1 ~ normal(0,3);
      beta2 ~ normal(0,3);    
      alpha2 ~ normal(0,3);
      z1 ~ normal(0,3);
      z2 ~ normal(0,3);
      
      for (i in 1:N){
         p[i] ~ dirichlet(a[i]*d);   
     }
      
  // Likelihood
  
   for(i in 1:N){
   
     lp[1] = log(p[i,1]) + poisson_lpmf(y1[i]|mu1[i])
                         + poisson_lpmf(y2[i]|mu1[i] - z1); 
     lp[2] = log(p[i,2]) + poisson_lpmf(y1[i]|mu1[i])
                         + poisson_lpmf(y2[i]|mu1[i]);
     lp[3] = log(p[i,3]) + poisson_lpmf(y1[i]|mu2[i]-z2)
                         + poisson_lpmf(y2[i]|mu2[i]);
   }
     target += log_sum_exp(lp);
   }

Here, I want to express mu1 > mu2 in lp[1], mu1 = mu2 in lp[2], and mu1 < mu2 in lp[3] . To come up with this point, I used the above expression. Example, in lp[1], I substitute the value of mu2 by mu1-z1 , where z1 is positive number.

My question here is that, if I substitute mu2 by mu1-z1 , I ignored (mu2 = exp(alpha2 + x2*beta2)) and instead I used (exp(alpha1 + x1*beta1)) - z1 . But here, mu2 should estimate over exp(alpha2 + x2*beta2)
Due to this case, may be some miss estimation will exist. Therefore, would you please tell me some techniques that express the mu's relation in each lp's (rather than adding or subtraction positive numbers technique)?

and to give more flexibility to the dirichlet distribution (in p[i] ~ dirichlet(a[i]*d)), I considered the hyper parameter with prior “uniform distribution” . But, when I read some where, the Stan team discourages the uniform priors. Therefore, would you please tell me which priors will give more flexibility to the dirichlet distribution?

nhuurre · July 25, 2020, 10:31am

This is what I don’t understand. Whether mu1 > mu2 or mu1 < mu2 is solely function of whether x1 < x2. What is the difference between the mixture components? If all you want is to calculate the probability that mu1 < mu2 then it’s not a mixture model.

Usually a good guide to model construction is to think how you would simulate fake data.
Let’s say someone gives you an algorithm that supposedly solves your inference problem. A simple but effective way to test that the algorithm works is

assume values for parameters alpha, beta, etc
simulate fake data conditional on those parameter values
use the algorithm to estimate the parameters from the fake data
compare the estimates to the known “true” parameter values

Can you describe how you would do step 2?

yab · July 25, 2020, 12:26pm

I got your point.
Thanks a lot!

Topic		Replies	Views
Mixture of generalized poisson distribution Modeling mixture , poisson	18	1961	December 26, 2022
Vectorizing Mixture Poisson model Modeling	3	453	April 22, 2019
Poisson-LogSkewNormal mixture + partial pooling Modeling	3	1084	October 14, 2017
Create (and debug) a new _lpmf function Modeling	9	1365	May 9, 2017
Vectorizing a Zero Inflated Poisson Model Modeling specification , performance	2	1642	September 21, 2017

How to write the poisson_lpmf in the product of two Poisson distribution

Related topics