Hi everyone,
I’m currently analyzing the correlation between Paid Leave and various demographic variables and have been using the Leave Module Weights for this analysis.
I have a couple of questions regarding the correct approach for calculating standard errors.
-
Weighting Approach: Is it better to use aweight or pweight in STATA for this type of analysis? I came across a post mentioning that using the summarize function with aweights yields incorrect standard errors. Could someone confirm if employing svyset with pweights is indeed preferable for accurate standard error estimation?
-
Optimizing Standard Error Calculations: To compute standard errors, I use the following code, where lvwt represents the Leave Module weight variable, and rlvwt_1 to rlvwt_160 denote the replication weights. Are these codes appropriate for the task, or is there a way to improve or refine them?
STATA codes:
svyset [pweight= lvwt], vce(brr) brrweight(rlvwt_1- rlvwt_160)
R codes:
df_wt ← df %>%
as_survey_rep(
weights = lvwt,
repweights = starts_with(“rlvwt_”),
type = “BRR”)
Thanks in advance for your guidance and advice!