Data-Documentation Mismatch: US 2000-2005 IND Codes

Hi,

I’ve identified a discrepancy between the US 2000-2005 dataset and its accompanying documentation regarding IND codes. Specifically:

  • The dataset contains IND codes ranging from 17 to 126 that are not included in the documentation
  • Conversely, the documentation lists several codes that don’t appear in the actual dataset

Could you help clarify whether this indicates an issue with my data processing, or if there’s an alternative reference I should consult for the complete code definitions?

Thank you so much!

Thanks for flagging this discrepancy. I was able to confirm that the link contained in the IND code section for United States 2000-2005, links to the Industry codes for a different variable (US2005_IND1950, US 2005 Industry, 1950 basis), instead of the US 2005 unrecoded industry variable and codes page. I have notified the IPUMS International team of this and the said a fix will go out with their next data release.

In the meantime, you can reference the 2003-2007 ACS/PRCS Industry Codes from IPUMS USA because the US2005 sample in IPUMS International is the 2005 American Community Survey (ACS, which is also available through IPUMS USA). Note that the IPUMS International’s IND variable for US2005 is less detailed than what is available through IPUMS USA, so when you reference the 2003-2007 ACS IND codes, only the first three digits will be relevant for your IPUMSI analysis. I verified that it is otherwise an accurate list of codes between IPUMSI and IPUMS USA unrecoded industry variables.