Code Label
3150 Jersey City, NJ
4610 New York, NY
4630 Newark, NJ
According to the city code section on the IPUMS website there should be 1880 & 1900 data available for these samples.
I am using R to view this data in a data frame, and all city codes are listed as “0”.
Has this happened to anyone else? I am wondering if this is an error with the IPUMS data, or with how I access the data.
Please let me know your thoughts.
Hi Dan,
A couple of questions that would help me answer your question:
- Did you create the extract on the IPUMS USA website or through IPUMS R?
- What format did you download the data in (e.g., CSV, Stata, text [.dat])?
- How are you loading the data into your R data frame?
I created a CSV format extract from the 1880 and 1900 full count datasets, and a second extract from the 1880 (10%) and 1900 (5%) samples. I included the CITY and SEX variables in my extract. I then used the case selection option to retain only records in Jersey City, New York, Brooklyn, and Newark.
For the full-count dataset, these are the record counts I have for the four cities:
# A tibble: 4 × 2
# Groups: CITY [4]
CITY n
<dbl> <int>
1 3150 328090
2 4610 4666542
3 4611 566641
4 4630 383301
For the sample datasets, these are the record counts:
# A tibble: 4 × 2
# Groups: CITY [4]
CITY n
<dbl> <int>
1 3150 22801
2 4610 299249
3 4611 59739
4 4630 26873
So, I am definitely finding data for those city codes in the 1880 and 1900 samples. Answers to the questions I posed above may provide more insights.
Sincerely,
Dave Van Riper
IPUMS
Hello Dave,
I used the “SELECT CASES” option to select Jersey City, New York, and Newark, like you mentioned. This solved my error, the city codes are now listed for me.
Thank you for the reply.