1880 & 1900 city codes listed as "0" for New York, Newark, Jersey City

Code Label
3150 Jersey City, NJ
4610 New York, NY
4630 Newark, NJ

According to the city code section on the IPUMS website there should be 1880 & 1900 data available for these samples.

I am using R to view this data in a data frame, and all city codes are listed as “0”.

Has this happened to anyone else? I am wondering if this is an error with the IPUMS data, or with how I access the data.
Please let me know your thoughts.

Hi Dan,

A couple of questions that would help me answer your question:

  1. Did you create the extract on the IPUMS USA website or through IPUMS R?
  2. What format did you download the data in (e.g., CSV, Stata, text [.dat])?
  3. How are you loading the data into your R data frame?

I created a CSV format extract from the 1880 and 1900 full count datasets, and a second extract from the 1880 (10%) and 1900 (5%) samples. I included the CITY and SEX variables in my extract. I then used the case selection option to retain only records in Jersey City, New York, Brooklyn, and Newark.

For the full-count dataset, these are the record counts I have for the four cities:

# A tibble: 4 × 2
# Groups:   CITY [4]
   CITY       n
  <dbl>   <int>
1  3150  328090
2  4610 4666542
3  4611  566641
4  4630  383301

For the sample datasets, these are the record counts:

# A tibble: 4 × 2
# Groups:   CITY [4]
   CITY      n
  <dbl>  <int>
1  3150  22801
2  4610 299249
3  4611  59739
4  4630  26873

So, I am definitely finding data for those city codes in the 1880 and 1900 samples. Answers to the questions I posed above may provide more insights.

Sincerely,
Dave Van Riper
IPUMS

Hello Dave,

I used the “SELECT CASES” option to select Jersey City, New York, and Newark, like you mentioned. This solved my error, the city codes are now listed for me.

Thank you for the reply.