This question is directed to Tim Moreland–
I wrote a few weeks back re: Veterans Supplement Income Information, and received a very helpful response explaining how it’s necessary to link respondents to their outgoing rotation month. I have since been using the new variable “cpsidp” to link these respondents accordingly. Within the article “Making full use of the longitudinal design of the CPS,” the authors mention that when using “cpsidp” a small number of liked records do not match up on sex, race, or age. I have found this to be true in my merging as well, and am able to bypass the mismatches in age and sex by including those variables in my merge (in using STATA-- merge 1:1 cpsidp age sex using filename.dta). But of course with race you run into the additional difficulty of needing to recode the variable due to its change in categorical structure in 2003.
My question: What is the best way to locate and remove these mismatches from my data after/while merging? I am only interested in hanging onto the “plausible” matches, as referenced by the article above, rather than “all” matches linked by “cpsidp,” as, to give an example, the process currently matches up a handful of veteran responses to org-respondents below the age of 10. I am tempted to add more demographic / identifiable variables into the “merge” command in order to sift out the unlikely matches, but the more variables I add, the more similar the process seems to how I was merging pre-“cpsidp.” Is there an easier way?
Thank you in advance for any help you’re able to provide.