I have a dataset where I am looking at how many customers a store receives per hour (Customers
). When there are customers, I am also interested in understanding how much each individual customer has spent. There are some hours where there are no customers, which I am trying to study using a hurdle model. However, this means that in those cases there is no money spent, so the column Spent
would be NA. My data looks something like this:
Hour Spent Customers
1 20 1
1 10 2
2 NA 0
3 NA 0
4 10 2
5 15 1
6 10 3
7 NA 0
I’m trying to fit a model of this type:
model <- brms(bf(Customers ~ Hour + Spent,
hu ~ Hour + Spent),
data = df,
family = hurdle_poisson())
However I’m not sure how to handle these NAs. Technically they are not missing data as there was no data to collect at that time, so I’m not sure if imputation or handling it with the mi
function are appropriate ways to handle it.
I would appreciate any suggestions that may come up.