Weighted logistic regression

Guido_Biele · May 23, 2020, 7:18pm

If you just want to reduce the number of calls to the likelihood, sufficient statistics is a different and probably also the best way to go.

If you have following data

data {
  int<lower=0> N_unique;   // number of unique rows in x
  int<lower=0> d;
  matrix[N_unique, d] x;
  int<lower=0> U[N];       // number of cases in each row of x
  int<lower=0> Y[N];       // number of cases in each row of x with value 1
}

You should be able to use the binomial distribution for your likelihood:

model {
  theta0 ~ normal(0, 1);
  theta ~ normal(0, 1);
  target += binomial_logit_lpmf(Y | U, theta0 + x*theta)
}

No need for a loop here, because binomial_logit_lpmf is vectorized. Here is the Stan documentation for the binomial_logit_lpmf: https://mc-stan.org/docs/2_22/functions-reference/binomial-distribution-logit-parameterization.html.

Also check the Stan documentation for something like “exploiting sufficient statistics”.

Topic		Replies	Views
Error when fitting a Bernoulli logit model with weights Modeling rstan , techniques , fitting-issues , specification	1	544	November 5, 2020
Error in Stan code when modelling a weighted logistic regression model Modeling rstan , fitting-issues , specification	3	518	April 6, 2021
Survey weighted regression Modeling	34	9235	May 27, 2022
Weighted loglikelihoods in mixture model PyStan	1	1013	February 3, 2020
The effect of weights on the resulting estimates Modeling	1	612	May 22, 2020

Weighted logistic regression

Related topics