Brms: Estimating residual correlation between two variables with measurement error

tomkeaney · November 16, 2023, 3:45pm

I have a model that attempts to estimate the residual correlation between two traits using the brms multivariate syntax, where both traits are measured with some error. Testing with some simulated data, the model without measurement errors (or with error specified for one of the traits) samples efficiently, but fails to identify the true correlation (as expected). However, the model with measurement errors struggles to explore the target distribution for the residual correlation (as evidenced by low E-BFMI, low ESS and slow sampling), but gets much closer to the true correlation.

# simulate some instructive fake data with a true high correlation
sigma_true <- matrix(c(1,0.8,0.8,1),2,2)
 
n <- 500
 
# noise greater than signal: independent errors between the two traits
measurement_error <- 
  tibble(a_me = runif(n,0.5,3), 
         b_me = runif(n,0.5,3))
 
# true unknown effects
z<-rmvnorm(n,c(0,0),sigma_true)
 
# estimates are true effects plus measurement errors    
y<-z
y[,1]<-y[,1]+rnorm(n,0,sqrt(measurement_error$a_me))
y[,2]<-y[,2]+rnorm(n,0,sqrt(measurement_error$b_me))
 
data <- 
  as_tibble(y) %>% 
  rename(a_obs = V1, b_obs = V2) %>% 
  bind_cols(as_tibble(z) %>% 
  rename(a_true = V1, b_true = V2)) %>% 
  bind_cols(measurement_error)
 
# the model
 
bf_a <- bf(a_obs | mi(a_me) ~ 1)
bf_b <- bf(b_obs | mi(b_me) ~ 1)
 
fit <- brm(bf_a + bf_b + set_rescor(TRUE),
            prior = c(prior(normal(0, 0.25), class = Intercept, resp = aobs),
                      prior(normal(0, 0.25), class = Intercept, resp = bobs),
                      prior(exponential(2), class = sigma, resp = aobs),
                      prior(exponential(2), class = sigma, resp = bobs),
                      prior(lkj(3), class = rescor)),  
            iter = 6000, warmup = 2000,
            control = list(adapt_delta = 0.9, max_treedepth = 15),
            data = data, chains = 4, cores = 4, seed = 1)

Is my approach for estimating the residual correlation reasonable? What is it about having error for both outcomes that leads to poor sampling? Any thoughts would be great! Happy to move from brms to stan if neccessary.

Operating System: Windows 11
brms Version: 2.20.4

jsocolar · December 11, 2023, 5:01am

Sorry it took so long for you to get a response. Your approach is reasonable in general, but it doesn’t surprise me that it yields some sampling problems, which likely are arising from difficulties in identifying the residuals as separate from the measurement errors.

No idea if this is part of the problem, but one thing that I note is that your lkj(3) prior on the residual correlation matrix disfavors high correlations like the ones that you use in your data generating process. Do things get any better if you switch to lkj(1), which might be more consistent with your data?

Topic		Replies	Views
Trouble with Measurement Error in brms brms fitting-issues	3	516	October 5, 2020
Applying loo to multi-variate measurement error model in BRMS Modeling loo , brms	7	694	August 16, 2021
Measurement error models in brms with error on both independent and dependent variables brms specification	2	2929	February 3, 2021
Brms measurement error model where x and y measure same latent variable brms	1	357	August 11, 2023
Multivariate models with different families and missing data brms	4	1478	June 5, 2018

Brms: Estimating residual correlation between two variables with measurement error

Related topics