Hi everyone,
I’m working on a Jolly-Seber (mark-recapture) model, specifically the superpopulation approach, where the entry probabilities are modeled as a simplex. We have individuals i = 1, \dots, I that enter the study during one of j = 1, \dots, J surveys, so that the entry occasion b_i \sim \mathrm{Categorical} (\boldsymbol{\gamma}), where \sum_{j=1}^J \gamma_j = 1. A natural uninformative prior distribution is \gamma \sim \textrm{Dirichlet}(\boldsymbol{1}), which works fine.
I wish to extend this to account for unequal survey intervals \tau_j, where we expect a higher entry probability for surveys that had a longer period of time between it and the last survey. One way to do this would be via Dirichlet regression, e.g. in Stan:
parameters {
simplex[J] gamma;
real alpha, beta;
}
transformed parameters {
vector[J] mu;
mu[1] = 0; // pin first entry probability for identifiability
mu[2:J] = alpha + beta * tau;
}
model {
gamma ~ dirichlet(softmax(mu));
}
However, this has some divergent transitions, I think for the same reason that the centered parameterisation of Gaussian random effects gets them. Also, I’d prefer to think of the \tau's as an offset, where we have some baseline rate that’s offset by the survey length, and therefore I’d rather not estimate \beta. If we think of generating random Dirichlet variables by simulating \textrm{Gamma}(\alpha, 1) variates and normalising them, I want to find a good way to model the entry “rates” and to interpret them as the expected number of entries per survey.
The following two models (I’ll call them centered and non-centered) give the same loo output and parameter estimates:
parameters {
simplex[J] gamma;
}
model {
gamma ~ dirichlet(ones_vector(J))); // centered
}
And “non-centered”:
parameters {
vector<lower=0>[J] z;
}
transformed parameters {
vector[J] gamma = z / sum(z);
}
model {
z ~ gamma(1, 1); // non-centered
}
What would be a good way to incorporate the survey intervals here? I’ve tried a few things, including just adding log(tau)
or with coefficients, but I want to find a good principled approach to include the survey intervals. Note that the first entry probability doesn’t have a survey interval as it’s the first occasion. Therefore, I wish to estimate it separately from the remaining 2:J surveys.
Thanks,
Matt