Loo and loglikelihood calculation for bivariate poisson

fabio · August 25, 2022, 12:26pm

Given a standard model like y_i \sim \mathcal{Poisson}\left(\lambda\right), then the calculation of the log-likelihood function for using it with the loo package is straightforward:

vector[N] log_lik;
for (n in 1:N) {
  log_lik[n] = poisson_lpmf(lambda);
}

My model is more complicated, since I have:

x_i \sim \mathcal{Poisson}\left( \lambda_1\right)\\ y_i \sim \mathcal{Poisson}\left( \lambda_2\right)

with x_i and y_i jointly distributed.
In this case, how can I write the log-likelihood calculation in the generated quantities of my model?
Thank you in advance!

avehtari · August 25, 2022, 6:03pm

This might be what you need

vector[N] log_lik;
for (n in 1:N) {
  log_lik[n] = poisson_lpmf(x[n] | lambda1) + poisson_lpmf(y[n] | lambda2);
}

but I would need to see the model block to be certain

fabio · August 25, 2022, 8:34pm

Thanks @avehtari for the quick answer. I would like to compare the two models proposed by jake thompson to model soccer scores.
Here are the two models:

i) the bivariate Poisson model

model {
  vector[num_games] lambda1;
  vector[num_games] lambda2;
  vector[num_games] lambda3;

  // priors
  raw_alpha ~ normal(0, 1);
  raw_delta ~ normal(0, 1);
  raw_rho ~ normal(0, 1);
  mu ~ normal(0, 10);
  eta ~ normal(0, 10);
  sigma_a ~ normal(0, 10);
  sigma_d ~ normal(0, 10);
  sigma_r ~ normal(0, 10);

  // likelihood
  for (g in 1:num_games) {
    lambda1[g] = exp(mu + (eta * homeg[g]) + alpha[home[g]] + delta[away[g]]);
    lambda2[g] = exp(mu + alpha[away[g]] + delta[home[g]]);
    lambda3[g] = exp(rho[home[g]] + rho[away[g]]);
  }
  h_goals ~ poisson(lambda1 + lambda3);
  a_goals ~ poisson(lambda2 + lambda3);
}

ii) game random intercept model

model {
  vector[num_games] lambda1;
  vector[num_games] lambda2;

  // priors
  raw_alpha ~ normal(0, 1);                         // attacking random effects
  raw_delta ~ normal(0, 1);                         // defending random effects
  raw_gamma ~ normal(0, 1);                         // game random effects
  mu ~ normal(0, 10);
  eta ~ normal(0, 10);
  sigma_a ~ normal(0, 10);
  sigma_d ~ normal(0, 10);
  sigma_g ~ normal(0, 10);

  // likelihood
  for (g in 1:num_games) {
    lambda1[g] = exp(mu + (eta * homeg[g]) + alpha[home[g]] + delta[away[g]] + gamma[g]);
    lambda2[g] = exp(mu + alpha[away[g]] + delta[home[g]] + gamma[g]);
  }
  h_goals ~ poisson(lambda1);
  a_goals ~ poisson(lambda2);
}

avehtari · August 26, 2022, 7:19am

As I expected, the Poisson terms are independent conditional on the parameters and thus the likelihood is product of those terms and thus log likelihood is sum of these terms. This

  h_goals ~ poisson(lambda1 + lambda3);
  a_goals ~ poisson(lambda2 + lambda3);

can also be written as

  target += poisson_lpmf(h_goals | lambda1 + lambda3);
  target += poisson_lpmf(a_goals | lambda2 + lambda3);

which could also be written as

  target += poisson_lpmf(h_goals | lambda1 + lambda3) + poisson_lpmf(a_goals | lambda2 + lambda3);

Given vector/array arguments, _lpmf and _lpdf functions sum together the log probabilities / densities, and to get the individual terms we need to use for loop in the generated quantities.

vector[N] log_lik;
for (n in 1:N) {
  log_lik[n] = poisson_lpmf(h_goals[n] | lambda1 + lambda3) + poisson_lpmf(a_goals[n] | lambda2 + lambda3);
}

This will compute the joint likelihood given h_goals[n] and a_goals[n]. PSIS-LOO computes what would happen if nth log_lik term is removed form the target, and in this case it would correspond to leaving out both observations h_goals[n] and a_goals[n].

fabio · August 26, 2022, 7:31am

Thanks @avehtari for clarifying!

valyagolev · May 28, 2025, 11:48am

Here’s my take at bivariate poisson (with a lambda_common common rate) which is a bit more tedious but IMO more precise. It does not treat the goals themselves as independent.

real bivariate_match_lpdf_f(int goal1, int goal2, real lambda1,
                              real lambda2, real lambda_common) {
    int m = min(goal1, goal2);
    vector[m + 1] log_terms;
    for (z in 0 : m) {
      int r1 = goal1 - z;
      int r2 = goal2 - z;
      log_terms[z + 1] = r1 * log(lambda1) - lgamma(r1 + 1)
                         + r2 * log(lambda2) - lgamma(r2 + 1)
                         + z * log(lambda_common) - lgamma(z + 1);
    }
    return -(lambda1 + lambda2 + lambda_common) + log_sum_exp(log_terms);
  }

fabio · June 4, 2025, 7:13pm

thanks @valyagolev for sharing!

avehtari · June 5, 2025, 7:15am

My version did not assume independence either, but certainly it’s likely that numerical accuracy can be improved by re-arranging the computation. Thanks for sharing the code

valyagolev · June 6, 2025, 3:28am

I might be deeply wrong about this, and misunderstand something about the topic. But here are the joint distributions I get from just using both approaches (with a big lambda_common = 4, and lambda1, lambda2 = 1.2, 0.9). The left-side approach considers that the common goals would be same for any given match (which is how I understand the correlation between the home/away goals). It is my understanding that the right-side approach (original approach from this topic) ignores the correlation. I found the correlation essential for football modeling. I don’t think the common lambda is even identifiable without it (well, maybe as an intercept). I probably have misunderstood the code or the approach of the topic, but those are two very different distributions.

avehtari · June 6, 2025, 7:00am

Thanks for continuing, this forced me also to clarify my thoughts about this. We have two different models here.

The original model in the question was

  h_goals ~ poisson(lambda1 + lambda3);
  a_goals ~ poisson(lambda2 + lambda3);

my answer was specifically for that model

poisson_lpmf(h_goals | lambda1 + lambda3) + poisson_lpmf(a_goals | lambda2 + lambda3);

The terms in the first model are independent conditional on the parameters, and thus when you fix the parameters you should see an independent bivariate distribution. They still have dependency due to the common lambda3 parameter (and the prior structure affects the strength of the dependency).

Looking at the Bivariate Poisson in Wikipedia we see where the difference comes. To make the difference more clear, the original model in the question could in theory be rewritten as

h_goals = Y1 + Y3a;
a_goals = Y2 + Y3b;
Y1 ~ poisson(lambda1);
Y2 ~ poisson(lambda2);
Y3a ~ poisson(lambda3);
Y3b ~ poisson(lambda3);

and the bivariate Poisson as on Wikipedia could be in theory written as

h_goals = Y1 + Y3;
a_goals = Y2 + Y3;
Y1 ~ poisson(lambda1);
Y2 ~ poisson(lambda2);
Y3 ~ poisson(lambda3);

We really can’t write the model like this in Stan as Y1, Y2, and Y3 are unknown discrete variables and Stan does not support sampling of discrete variables. However, as Y1, Y2, and Y3 have to be smaller than h_goals and a_goals, they have compact support, and we can integrate them out by enumerating all possible combinations with the equation on Wikipedia and with your code. This model has stronger dependency and dependency also conditional on fixed parameters.

I don’t know which one of these models is more realistic for soccer scores.

valyagolev · June 6, 2025, 4:27pm

I think my approach is a proper marginalized version of bivariate poisson, and it’s been working well for me for massive models (80k matches, hierarchical bivariate poisson regression (here I’m just trying to say that they’re converging very well etc)). To be honest, I don’t really understand the point of the original model in this topic, because two independently sampled sums of Poissons is the same thing as just two independent Poissons of the sums of the arguments.

I commented on this topic to make sure that whoever wants to implement Bivariate Poisson in Stan takes this into consideration if they stumble upon this page.

avehtari · June 6, 2025, 5:30pm

I don’t think anyone is disagreeing with that, and that is what I also said in my previous message.

Cool

I’m not an expert in match modeling, so I’m not commenting on the point of that model, but I agree that it would be better to not call that as bivariate Poisson since that term is commonly used for that other model.

Thanks again for doing that! Could you also tell your favorite paper / blog post for bivariate Poisson? I did link to Wikipedia, but Wikipedia has quite strange choice for the reference for bivariate Poisson.

fabio · June 7, 2025, 8:06am

@valyagolev I might have mentioned this before, but I was trying to reproduce some soccer scores models as a way to train myself and deepen my understanding of both statistics and Bayesian modeling. I’m genuinely glad if your approach outperforms others—thanks a lot for sharing your ideas and code with the community!

valyagolev · June 7, 2025, 8:04pm

Please be aware that what is called bivariate poisson under that link is not an actual implementation of bivariate poisson, it’s an implementation of independent poisons for home and away goals.

There are some papers by Karlis and Ntzoufras that define and use it, I will do a bit more research to edit Wikipedia and help other prevent this unfortunate confusion. (Wikipedia’s formulation is correct, but I will add a couple of plots to make it more clear what’s the difference between the bivariate case and simply having two Poissons like above)

andre.pfeuffer · June 8, 2025, 6:45am

See bivariate poisson

real bipois_log_lpmf(array[] int r, vector mu_log) {
  int miny = min(r[1], r[2]);
  vector[miny+1] ss;

   for(k in 0:miny)
     ss[k+1] = poisson_log_lpmf(r[1] - k | mu_log[1]) + poisson_log_lpmf(r[2] - k | mu_log[2]) + poisson_log_lpmf(k | mu_log[3]);
   return log_sum_exp(ss);
}

For sampling one may use:

int shared_count = poisson_log_rng(mu_log[3]);
int x = poisson_log_rng(mu_log[1]) + shared_count;
int y = poisson_log_rng(mu_log[2]) + shared_count;

Topic		Replies	Views
LOO - CV and other penalised criteria Modeling loo	5	1179	October 6, 2019
Help calculating log-likelihood with binomial_logit_glm to use with loo Modeling fitting-issues , specification , loo	22	1260	August 3, 2022
Pointwise log lik for multi-response Poisson model Modeling loo , multivariate-normal	6	92	November 14, 2024
How to calculate log-likelihood for multivariate model with known covariance matrix for each id Modeling	2	62	October 10, 2024
WAIC and LOOCV for multivariate analysis with different distributions General loo	8	776	December 2, 2020

Loo and loglikelihood calculation for bivariate poisson

Related topics