Hello,
I’ve been trying to supplement tables from ACS 5 year data (EEO tabulation) for 2006-2010 with ACS 5-year data for the same period from IPUMS. I’ve been flustered for about 24 hours since there seem to be considerable discrepancies in summary stats between the two sources, which makes me wonder if I’m doing something wrong.
For example, ACS 5 year data estimate (for the 2006-2010 period) available via AFF for the number and distribution of janitors (occ code 4220 in 2010) is the attached screenshot, showing a total of 2.6 million with 68.3% male and 31.7% female.
On the other hand, using perwt as weights, the ACS 5-year data for the same period indicates a slightly different total:
. tab sex if occ2010==4220 & datanum==5 [fw=perwt]
Sex | Freq. Percent Cum.
------------±----------------------------------
Male | 2,185,828 67.52 67.52
Female | 1,051,432 32.48 100.00
------------±----------------------------------
Total | 3,237,260 100.00
I’m wondering what accounts for the discrepancy between them (wrong weights?) Given the differences, it seemed a bit unwise to move on to the more finegrained data before making sure that I’m looking at the same apples.
Thank you!