Clarification on Censored Data Models

Ellis_Scharfenaker · August 10, 2018, 2:03pm

I would like to create a partially synthetic dataset (along the lines of Stephen P. Jenkins’ “Measuring inequality using censored data:
a multiple-imputation approach to estimation and inference”, but Bayesian). As a first step I’m sampling one random value from the posterior of each censored value and attaching it to the observed data.

The code I’m running is just a basic example using censored lognormal data:

stan.mix.cen.test="
data {
  int<lower=0> N_obs;
  int<lower=0> N_cens;
  real y_obs[N_obs];
  real U;
}
parameters {
  real<lower=U> y_cens[N_cens];
  real<lower=0> mu;     
  real<lower=0> sigma;  
} 
model {
  y_obs ~ lognormal(mu,sigma);
  y_cens ~ lognormal(mu, sigma);
}"

set.seed(123)
y <- rlnorm(5000,10.5,.65)
U<-100000
N.cens <- length(y[y>U])
ycens <- y[y<U]
dataList = list(y_obs = ycens , N_obs = length(ycens), U=U, N_cens = N.cens)
cenfit = stan(model_code=stan.mix.cen.test, data=dataList,
               chains=4 , iter=200 , warmup=200) 

stan.out = data.frame(extract(cenfit))

 ys = stan.out[,1:N.cens]
 y.cens.samp <- apply(ys,2,sample,size=1)

post.data = data.frame("y"=c(as.numeric(ycens),as.numeric(y.cens.samp)))

Ellis_Scharfenaker · August 10, 2018, 9:22pm

I realize that taking the maximum posterior value of y_cens is not the right way of creating the synthetic data.

Bob_Carpenter · August 22, 2018, 9:57am

There’s a chapter in the user’s guide section of the manual that explains how to code up censoring and truncation in Stan.

Topic		Replies	Views
Modeling censored data whose probability of observation is proportional to the observation value Modeling specification	7	450	August 10, 2018
Stan Users Guide - 4.3 Censored Data Modeling fitting-issues	0	297	August 23, 2023
Coding hurdle model with censored data Modeling rstan , specification	2	638	November 24, 2022
Censored data is not required in modelling Modeling	6	572	February 4, 2019
Integrating out censored data in negative binomial model Modeling specification	8	1399	August 1, 2018

Clarification on Censored Data Models

Related topics