Comparing counts

Hey there,

I have a dataset on animals released at various locations and the number of them recorded in subsequent years. A simplified version of the data set could look like this:

Release_ID Occasion n
S1 Released 150
S1 2022 46
S2 Released 150
S2 2022 12
S3 Released 150
S3 2022 37

The real data set contains 11 Release points and counts for the years 2022, 2023, and 2024. What would be a good way to compare the resulting records statistically among each other in a Bayesian framework?

Thanks for your help!

What question are you trying to answer with these data? Estimate a survival rate? If so is it assumed to be constant or variable across sites? Across years? Are the number recorded in subsequent years assumed to be an exhaustive census or is there a possibility of non-detection of surviving individuals?

I already used a continuous-time survival model on the complete detection data. Here, I would like to compare the number of detected animals per year across the different stocking locations to determine whether there is a statistically significant difference in the number of recorded individuals per year among the release site groups. I hope that makes it clearer.

Is there any difference between studying site-specific variation in the number of recorded individuals versus site-specific variation in survival rates?

I mean, the survival model is based on a couple of assumptions and interprets the decreasing detection rate as an increasing mortality rate. This is basically about the reviewer requesting a statistical test for this figure, to show how similar or dissimilar the detection patterns are.

You mean you want to compute a p-value? Is the null hypothesis that they’re all the same and you want to measure the significance of the rejection of this null hypothesis? For that, the simplest thing I can think of is a chi-squared test (though please take this advice with a grain of salt—my knowledge of hypothesis tests is more textbook driven than experience driven).

Calculating significance is not really something we do in Bayesian statistics, so if you literally want the above, then just choose a frequentist test and report the p-value. You could fit a model with a single recapture probability and a model with varying probabilities and compare them using Bayes factors, but I wouldn’t recommend that for all the reasons Gelman et al. go over in the Bayesian Data Analysis chapter on model comparison (free pdf on line at book’s home page).

If you want to do this in a Bayesian way, you have to set up what the quantity of interest is that you want to measure because point null hypothesis have zero probability when measured using probability theory.

1 Like

Because you begin by releasing the same number of individuals at each site, “significant” variation in subsequent counts is conceptually the same thing as “significant” variation in the apparent survival rate. The count is precisely a constant times the apparent survival rate.

1 Like