While reading the rstanarm
vignette on “Estimating Ordinal Regression Models with rstanarm” by @jonah and @bgoodri , I came across the following section (under “Example”), where the authors fit a model relating tobacco consumption to explanatory variables. The ordinal outcome is measured in grams/day of tobacco: 0-9, 10-19, 20-29, 30+.
However, as the authors state, the cutpoints on the latent scale are actually known, unlike other applications of ordinal models, such as Likert scale outcomes.
The authors state:
Since these cutpoints are actually known , it would be more appropriate for the model to take that into account, but
stan_polr
does not currently support that.
I’m interested in this idea, but I haven’t been able to find any references on this topic. Could anyone help point me in the right direction?
It seems related to interval-censored survival data: for example, a patient can only be diagnosed with a disease when they visit the clinic, so the time of onset of the disease is known to have occurred between visits. Like time of onset to diagnosis, in the vignette example the true tobacco use is positive, and unknown but within a known range.
Link to the vignette here: Estimating Ordinal Regression Models with rstanarm • rstanarm