I pulled data from IPUMS to help inform a project my team is working on. The questions my colleague wants to answer are about presence of children in households by race of householder by units in structure. This is not a pre-packaged Census/ACS table, so I am turning to the PUMS data.
However, I’m a little unsure of my analysis, as this is the first time I’ve really used the PUMS data to prepare a custom table. I don’t have anyone at my workplace to turn to for double-checking my work, so I hope this forum might help answer my question.
To summarize, I was able to extract the 26,000+ person records (12,000+ housing records) of the five primary PUMAs of the city I’m looking at (Portland, OR). Then I did some recoding (e.g., units in structure reduced to “multi-family, single-family, other”; and whether the household has children). Then I kept the householder record and put that in a pivot table. I then summed the HHWT (household weight) field. My results seem right, but I’m not sure if this was the correct way to do this.
One person I asked, who admitted they don’t really use HHWT field so they don’t really know, said I should not have kept only one record per household. But this doesn’t seem to agree with the advice on the HHWT documentation.
An additional question I have is whether the HHWT field should sum to the total number of households within the PUMAs selected. If so, it seems my HHWT field is under-counting the number of households/units. The sum of my HHWT field (one record per household) is 224,876 (using 2014 5-yr ACS), but using the data in FactFinder the total housing units comes to 224,876, and the occupied unit count is 251,512. Why is there a discrepency?
Below is the table I came up with. Would anyone be able to check my numbers or general approach? Thank you for reading.