Coin Flip Probability Based On Additional Attributes

mattiasthalen · October 7, 2019, 9:16am

Hi,

I’m trying to get a probability plot for several coins with different attributes.
Let’s say that the attributes are diameter, weight, shape & thickness (all categorical).
I’ve understood that in bayes this turns out to be the following:
P(A | B \cap C \cap D \cap E) = \frac{P(B \cap C \cap D \cap E | A)\times P(A)}{P(B \cap C \cap D \cap E)}

If it was just a simple coin my setup would be this:
P(A | B) = \frac{P(B | A)\times P(A)}{P(B)}

data {
	int <lower = 1> N; // number of observations
	int <lower = 0, upper = N> k; // number of successes
}
parameters {
	real<lower = 0, upper = 1> theta; // probability of success
}
model {
	theta ~ beta(1, 1); // prior
	k ~ binomial(N, theta); // likelihood
}

I’m a bit lost on where to start this using rstan.

nhuurre · October 7, 2019, 3:52pm

For binary outcomes predicted by several attributes you might want to start with a logistic regression.

emiruz · October 11, 2019, 11:05am

If it was just one coin I guess you’re asking, what’s the chance of it landing heads given y=(y_1,..,y_n) observations where y_i\in \{0,1\}. So we are interested in P(\theta\mid y). We assume y \sim bernoulli(\theta) and \theta \sim beta(\alpha,\beta). Wikipedia has a lovely page listing lots of common conjugate priors here. You can also calculate them yourself. In your case, Beta is a conjugate prior for the Bernoulli so the posterior form is quite simple: \theta \mid y \sim beta(\alpha + \sum_i^n{y_i},\beta + n - \sum_i^n{y_i}). You don’t need to do any Stan modelling for that. If you want to do it for several coins, I’d suggest just doing it for each coin category separately.

If on the other hand you want to create a model which enables you to guess \theta given a set of factors, you can turn your categorical factors (weight, size, etc) into dummy variables (i.e. 0 or 1 for every possible value) and then do a standard logistic regression; treating \theta as your response variable and your dummy variable factors as your binary predictors.

For example, let’s take just weight as a factor. Say it can have 3 values. We’d thus turn weight into 3 binary valued vectors w_1,w_2,w_3, let o be the binary outcome. We model o \sim bernoulli\_logit(c+ \alpha w_1 + \beta w_2 + \gamma w_3). The posterior parameters c, \alpha, \beta, \gamma then inform how to estimate P(o=1\mid w) = \theta for any given combination.

E.g. (untested)

data {
	int <lower = 1> N;
        int<lower=0,upper=1> o[N];
        int<lower=0,upper=1> w1[N];
        int<lower=0,upper=1> w2[N];
        int<lower=0,upper=1> w3[N];
}
parameters {
	real<lower = 0> c;
	real<lower = 0> alpha;
	real<lower = 0> beta;
	real<lower = 0> gamma;
}
model {
  for (n in 1:N)
    o[n] ~ bernoulli_logit(c + alpha*w1[n] + beta*w2[n] + gamma*w3[n]);
}

Topic		Replies	Views
Bayesian Beta Binomial model with latent draw Modeling rstan , hierarchical-model	6	674	August 1, 2020
Prior Predictive Checks Modeling	3	3381	October 17, 2018
How to write a finite discrete mixture model in stan Modeling specification , cognitive-science	11	713	March 29, 2021
Doubts when modelling the probability of a team beating another team Modeling specification	3	583	November 23, 2020
Weighted Beta-Binomial Bayesian model Modeling rstan , specification	8	1006	September 11, 2023

Coin Flip Probability Based On Additional Attributes

Related topics