Household counts larger than expected

Ethan_Jantz · February 18, 2022, 8:01pm

I believe I was able to figure this out. There is no PERNUM variable in household-level data extracts, but by ensuring there were no duplicate SERIAL values I was able to get a count within the ACS5 estimate ± the margin of error. See below:

library(ipumsr)
library(dplyr)

ddi <- ""
ipums_path <- ""

ipums_data <- read_ipums_micro(ddi = ddi,
                               data_file = ipums_path)

IL_hh_pums <- ipums_data %>%
  filter(GQ == 1, STATEFIP == 17) %>%
  distinct(SERIAL, .keep_all = T) %>%
  group_by(STATEFIP) %>%
  summarize(households = sum(HHWT))

IL_hh_acs5 <- tidycensus::get_acs(
  survey = "acs5",
  year = 2019,
  geography = "state",
  state = "IL",
  variables = c("households" = "B11012_001"),
  output = "wide"
)

IL_hh_pums # 4,844,000
IL_hh_acs5 # 4,846,134; moe 10,459

Topic		Replies	Views
Why do I fail to replicate published ACS aggregates? USA	2	219	March 2, 2023
Obtaining Total Household counts for 1990 and 2000 USA	1	328	May 25, 2021
Inconsistency with PUMS data USA	2	311	April 13, 2021
Household Count Mismatching USA	4	272	March 1, 2022
As a new user, can someone check my analysis that used HHWT field? USA	2	637	September 28, 2016

Household counts larger than expected

Related topics