Model comparsion for linear regression using loo and Bayesian R2

Michelle · October 18, 2022, 8:51pm

Hello,

I have 4 linear regression models and their only difference is one column of the design matrix (the other 20 columns are all the same). The task is to set a model selection strategy to select only one model among 4. I first considered loo_compare to select those who have considerably better elpd_loo (elpd_diff > 2*se_diff), then from the “surviving” ones, I prioritized Bayesian R2 to select one(s) that have similar elpd_loo but probably better fitting. In the end, if there is still more than one model left, I plan to use the absolute z score of the parameter corresponding to the column that is different across 4 models. I have two questions could you please help?

elpd has se_diff to indicate whether the difference is considerable, but R2 and z scores do not. There will be always a numerical difference such as mode1 R2 = 0.86123, model2 R2 = 0.86133, but I am not sure the difference has any practical meaning. Is there a way to tell if the difference between R2s is big enough? I guess I shouldn’t compute the distribution of R2 difference by doing R2(model1) - R2(model2) since the samples are not paired. Then, should I do one-way ANOVA? or maybe just set an arbitrary R2 difference like larger than 0.01 is big enough? Same issue for z score.
Does it make sense to select one model using zscore of the parameter corresponding to the column that is different across 4 models after loo and R2 can not tell a difference? Note that the only difference across 4 models is one column in the design matrix

Thanks a lot

Topic		Replies	Views
Statistical significance of model comparison using ELPD Modeling loo	2	231	October 3, 2024
Loo_compare issue Modeling	5	1198	August 31, 2020
Comparing non-nested models using elpd_diff Modeling techniques , loo	2	617	February 9, 2021
Model selection with loo and bridge sampling brms	1	1461	February 23, 2021
If elpd_diff/se_diff > \|2\|, is this noteworthy? brms techniques , loo , cross-validation	21	4009	April 3, 2021

Model comparsion for linear regression using loo and Bayesian R2

Related topics