Repeated measure logistic regression

ali · October 25, 2022, 6:58am

hello

dear all

I want to perform a bayesian logistic regression on my repeated measure data. I have a problem with specifying the formula and specifying random effects. would you please help me to fix that?
i want to have a random effect based on participant id.
yielding (0,1) is my response variable and TTA is my predictor and I want have a random effect based on parti_id
i used this formula but I get an error:
brm(data = df2,
family = binomial,
yielding ~ 1 + TTA + (1 | parti_id),
prior = c(prior(normal(0, 2), class = Intercept),
prior(normal(0, 2), class = b),
prior(normal(0, 2), class = sd)),
iter = 2500, warmup = 500, chains = 4, cores = 4,
seed = 21)

I get this error:
Warning: Using ‘binomial’ families without specifying ‘trials’ on the left-hand side of the model formula is deprecated.Only 2 levels detected so that family ‘bernoulli’ might be a more efficient choice.

do you have any idea about this? very much appreciated your response.

matti · October 25, 2022, 7:51am

Hi @ali

Welcome to the forum. That’s not an error, but a warning telling you that you might find the HMC sampling to be more efficient if you specify

brm(
  data = df2,
  family = bernoulli,  # bernoulli instead of binomial
  yielding ~ 1 + TTA + (1 | parti_id),
  prior = c(
    prior(normal(0, 2), class = Intercept),
    prior(normal(0, 2), class = b),
    prior(normal(0, 2), class = sd)
  ),
  iter = 2500, warmup = 500, chains = 4, cores = 4,
  seed = 21
)

I also recommend using more iterations for the warmup phase so that the final samples start off clean. The default of using half of samples for warmup makes sense usually.

By the way, when you want to show some code, it is much easier to read if you wrap it in three backticks, like this

```
# code goes here
```

Hope that helps.

ali · October 26, 2022, 9:29am

thanks @matti

I really appreciate your help. it worked!

yes, I will follow your advice next time I ask questions here.

best,
Ali

Solomon · October 26, 2022, 2:19pm

@matti’s suggestion to use the Bernoulli likelihood is great. If you’d like to stay with the binomial likelihood, you can use code like this:

brm(
  data = df2,
  family = binomial,
  yielding | trials(1) ~ 1 + TTA + (1 | parti_id),
  prior = c(
    prior(normal(0, 2), class = Intercept),
    prior(normal(0, 2), class = b),
    prior(normal(0, 2), class = sd)
  ),
  iter = 2500, warmup = 500, chains = 4, cores = 4,
  seed = 21
)

Notice the | trials(1) syntax on the left side of the model formula. That’s what the warning message was referring to with the phrase: “Using ‘binomial’ families without specifying ‘trials’ on the left-hand side of the model formula is deprecated.” With the | trials(1) syntax, you are explicitly telling brm() each row in the yielding variable corresponds to a single Bernoulli trial.

matti · October 26, 2022, 10:37pm

Great to hear it helped. If you don’t mind you can mark the post as the solution to help others find the “right” answer quickly in the future :)

jerlich · October 31, 2022, 4:10pm

If you have discrete levels of TTA it will be much faster to compute the sum of yielding (by TTA, parti_id) and then fit the model as a binomial.

github.com/paul-buerkner/brms

Recommendation of Bernoulli over binomial

opened 01:04PM - 19 May 20 UTC

closed 03:41PM - 19 May 20 UTC

jerlich

documentation

In the [families](https://github.com/paul-buerkner/brms/blob/master/vignettes/br…ms_families.Rmd) vignette it says: > binomial and bernoulli families are distinguished in brms as the bernoulli distribution has its own implementation in Stan that is computationlly more efficient. This suggests (at least it did to me when I was learning about `brms` and Stan) that it is preferred to use Bernoulli over binomial. We have a data set with many repeated choices and binomial versions of our models fit more than 20x faster than the Bernoulli versions. So, I think I would recommend removing or rewording this statement. Note: there is also an **a** missing from _computationlly_.

Topic		Replies	Views
Bayesian Repeated Measures Logistic Regression Modeling techniques , fitting-issues , specification	4	1562	February 25, 2020
Bernouilli/categorical model where responses don't vary within levels of the grouping variable Modeling specification , brms	2	498	March 21, 2022
Analysis of two binomials Modeling brms	5	241	May 9, 2024
Translating brms model into equation brms specification	5	1565	February 11, 2021
'Trials ' missing in model description with brm brms	3	2392	September 14, 2023

Repeated measure logistic regression

Related topics