I had been working with the 1% samples. I recently downloaded the full count census data and I’m getting significantly different numbers. I’m trying to figure out where I’m going wrong.
For instance, the 1920 full count data from Oregon shows 9,995 cases where the individual was categorized as working in the logging industry (code 306 in the IND1950 variable).
On the other hand, the 1920 1% sample from Oregon with Person Weight (PERWT) applied shows 16,772 cases where the individual was categorized as working in the logging industry (again using IND1950).
This is a person-level analysis so PERWT is the correct weight to apply to the 1% sample in this case, right?
I understand that the full count data and 1% sample will not alighn perfectly. Still, I didn’t expect the numbers to be that different. Like I said, I’m just trying to figure out why the numbers are so off. Am I making a mistake working with the 1% sample or with the full count data?
Thanks for your help!