Multinomial with non-integer data

shira · June 12, 2019, 4:27pm

Suppose we wanted to implement this in Stan:

http://www.stat.columbia.edu/~gelman/research/published/misterp.pdf
deff

But multinomial_lpmf(int[] y, vector theta) doesn’t allow non-integer data. What do you recommend?

Thanks!!

andre.pfeuffer · June 13, 2019, 5:15am

vector[D] theta[N];
vector[D] y_real[N];
for(i in 1:N)
  target += sum(y_real[i] .* (theta[i] - log_sum_exp(theta[i])));

where N number of observations, D dimension of simplex, y_real …

But the paper talks about binomial.

Updated. Added sum(). Should work without too.
same: target += sum(y_real[i] .* log_softmax(theta[i])));

shira · June 13, 2019, 3:18pm

thanks so much!! the log-likelihood for the Multinomial is a bit different from your expression, I think? See e.g. p.272 of Agresti’s Categorical Data Analysis:

andre.pfeuffer · June 14, 2019, 1:16am

That’s just the J - 1 expression. Look at the PMF given in

https://en.wikipedia.org/wiki/Multinomial_distribution

p_1 + ... + p_k =1, the softmax.
log(p_i) = theta[i] - log\_sum\_exp(theta[i]))

The factor with n! / x1! … /x_k! can be omitted for calc. the likelihood, it
contains only constants.

For identifiable reasons your theta’s should sum-to-zero or contain one element being 0,
the reference element. Ref. Stan manual.

shira · June 14, 2019, 6:30pm

thanks so much!! got it.

(I didn’t catch at first that you switched theta from probabilities to logit scale)

RobertoCerina · June 24, 2024, 5:52pm

Hi all – sorry to resurrect this thread, I was trying to implement this and I’m getting strange results so I thought I’d check in. I’m implementing as follows:

data {
    int<lower=1> N; // number of observations
    int<lower=2> J; // number of choices
    vector[J] y_real[N]; // number of supporters for each choice
}

parameters {
    vector[J] beta; // baseline rate of support
    real<lower = 0> beta_sd;
}

transformed parameters {
    matrix[N,J] mu_s; 
    vector[J] beta_s = beta * beta_sd;

    for(j in 1:J) mu_s[1:N,j] = beta_s[j] ;
}

model {
  beta ~ std_normal();
  beta_sd ~ std_normal();

  for (i in 1:N) target += sum(y_real[i] .* log_softmax(mu_s[i,1:J]'));
}

Does this look vaguely correct ? There are more parameters in the actual model but I’ve slimmed it down to the bare minimum.

Thank you !

Topic		Replies	Views
Integer parameters Modeling	4	2763	July 3, 2017
Specifying a Multinomial Logistic Regression with Unpooled Estimates Modeling pystan , specification , multinomial-response	3	1178	November 29, 2021
Request for Multinomial example Modeling	8	3720	August 6, 2019
Possibility of Logit Parameterized Multnomial in Stan Modeling	1	371	April 23, 2020
Multinomial regression with restriction Modeling	0	533	December 27, 2018

Multinomial with non-integer data

Related topics