MRP for average predictive comparisons?

landerso · December 2, 2020, 9:06pm

In analyzing survey data, where some demographic groups are oversampled and average predictive comparisons for the predictors are of interest because a logit or a beta regression model was used, should weighting as in MRP be used? The sampling strata were included in the model.

In addition there are survey data from two years, and this oversampling was not present in the first year.


* Operating System: Windows 10
* rstanarm Version: 2.21

martinmodrak · December 7, 2020, 8:41pm

Yes, I would say MRP would be a reasonable way to model this data. The lack of oversampling in the first year shouldn’t be an issue (in fact, it could make your life easier). But if the representation changed drastically between years, there could be fear that the non-response structure changed beyond the demographic variables you can adjust for… (but that’s not a thing any statistical package will help you with)

Best of luck with your model!

landerso · December 8, 2020, 7:49pm

I appreciate your reply, I can use the help! If I am looking at just the second year which is oversampled, and I want an average predictive comparison for the predictors averaged over all the groups, do I need weighting then, or is it enough that I have strata in the model?

martinmodrak · December 8, 2020, 9:11pm

I am not sure I understand the question well (I also should have noted earlier that I am not expert on surveys - I am just relaying second-hand knowledge). The way I understand MRP is that you fit a model that let’s you make prediction how a single person in each stratum you can distinguish would respond. Then you generate the right number of predictions for each stratum based on your demographic data. I don’t think weighting enters the model directly. But I would feel more comfortable if @lauren checked my reasoning here (as she’s the resident survey expert).

landerso · December 9, 2020, 4:02pm

Because I want to average over the values given by respondents for the other continuous predictors, I am using posterior_epred and I am giving it as the new data the dataset or a subset of the dataset. Therefore the oversampled groups are overrepresented in the new data. I didn’t know if including the sampling strata in the model took care of that or not.

martinmodrak · December 14, 2020, 9:00pm

I think I understand - for poststratification you would want to pass as new data for prediction a dataset that mimics your expected population structure. I.e. a dataset with the “correct” number of respondents in each group. Does that make sense?

landerso · December 15, 2020, 5:54pm

Yes, that does make sense. I could sample from the overrepresented groups in the data to get the correct proportions

Topic		Replies	Views
MRP to correct for differences between population and sample? Modeling techniques	1	518	August 17, 2020
Taking into account sampling error in MRP Modeling	1	368	August 19, 2018
Using MRP on within-subjects experiment brms	3	382	July 18, 2023
How to obtain population proportion in MRP model Modeling techniques	4	75	December 19, 2024
Sampling weights in Stan General	3	2035	April 16, 2019

MRP for average predictive comparisons?

Related topics