Person-household record match

skogan · September 23, 2024, 5:12pm

Have set up Chicago-only 1-year PUMS person and household datasets for 2012-2022 (no 2020). Want to attach household record to each person record. Sorted by sample and serial, 54% of household records are still not unique. Is there a third mysterious variable that will produce unique household records that I can then attach to person records with the same unique sort?

Using 1-year to look at trend; plan to push it a bit further back in time.

Isabel_Pastoor · September 24, 2024, 1:16pm

By default, IPUMS USA data are person-level microdata, meaning each row or observation is a person. Each column is a variable that describes a person-level characteristic or household-level characteristic; household-level variables are automatically appended to person records.

In IPUMS USA, the two variables SAMPLE and SERIAL uniquely identify households. The three variables SAMPLE, SERIAL, and PERNUM uniquely identify persons. While SAMPLE and SERIAL uniquely identify households, if your data extract includes person records as well as household records (as you stated), you will still have multiple observations associated with many of the households; many households include multiple household members. If your data extract includes only household records, then you will have just one observation per household.

If you prefer to create a data extract that includes only household records (and no information about persons), you can select this option when you create your data extract:

Topic		Replies	Views
What don't I understand? USA	3	127	September 26, 2024
How is each IPUMS-USA record uniquely identified?	1	416	January 22, 2013
Key identifier for household and personal data USA	6	732	September 18, 2020
How does IPUMS CPS variable SERIAL behave across years and downloads? CPS	2	809	March 31, 2015
How is an IPUMS-CPS record uniquely identified? CPS	1	871	January 23, 2013

Person-household record match

Related topics