Hi,
I am working on a project that needs to create a tract-level map for Chicago-Naperville-Elgin, IL-IN-WI Metropolitan Statistical Area (MSA). To define the MSA boundaries, I use the MET2013, STATE and PUMA variables in 2019-2023 IPUMS 5-year dataset, and the 2020_Census_Tract_to_2020_PUMA crosswalk file on the Census Bureau website.
However, as I subset the Chicago MSA map based on the Tract-PUMA crosswalk results, I found an error: The map includes Ogle County (17141), which should not belong to Chicago MSA area. It also misses three other counties: Grundy (17063), Jasper (18073) and Newton (18111).
I compared the IPUMS data and crosswalk file and found that this discrepancy is due to one PUMA–State code is 17, and PUMA 5-digit code is 03700. In the PUMA file, this PUMA is classified under County FIPS 17000 (unidentifiable), while in the crosswalk file, it is under County FIPS 17141, which is assigned to Ogle County on the map.
So my question are: is the crosswalk file classifies PUMA 173700 to Ogle County by mistake, or is it the latest reclassification for Chicago MSA boundaries? If it is a mistake, which county should I assign this PUMA code to, to correct the boundaries and figures on the tract-level map? And what happened to the other three missing counties in PUMA codes that are supposed to be included in Chicago MSA?
Also, what is the best way to extract a tract-level MSA shapefile in R? I use the IPUMS + crosswalk files for other MSAs, which appear to work well, but now the Chicago map got me suspicious about the validity of this strategy. I am attaching a screenshot of the Chicago MSA map that I created earlier for reference, which includes Ogle County due to the tract-PUMA classification.
Any insights would be greatly appreciated! Thanks!