Download samples for specific cities

Stacey_Jones · July 27, 2020, 6:37pm

Hello-For teaching purposes, I would like to download random samples of roughly 1000 observations from about 50-100 US cities, with about 20 basic demographic and economic variables such as sex, age, race/ethnicity, household size, employment status, income, education. Would IPUMS USA be an appropriate source for this, and if yes, could someone point me to the process for downloading data for a specific city (or perhaps I could download all cities at once, then sort, which would mean starting with 50K to 100K observations…). Help!

KariWilliams · July 30, 2020, 3:16pm

IPUMS USA would certainly do the trick. This video tutorial provides a basic overview of how to create a custom dataset with IPUMS USA. You can leverage the “Select cases” tool in the final stages of submitting your data extract request to keep only cases with certain values (e.g., certain metropolitan areas) for the variables that you have included in your extract (e.g., CITY, METRO, or MET2013).

Stacey_Jones · September 8, 2020, 5:08am

Thank you for getting back to me. I’ve been working on the above- downloading random samples of roughly 1000 observations from about 50-100 US cities, with about 20 basic demographic and economic variables such as sex, age, race/ethnicity, household size, employment status, income, education. Because it is for teaching purposes-an introductory course- I would like to avoid weighting, but still include economic variables such as income, education and employment. The only samples with these variables seem to be the weighted ones. Any suggestions? Is there a method for transforming a weighted sample into a pseudo-unweighted sample? Or a fairly recent unweighted sample that with a more extensive set of variables than the 2010 10% sample?

KariWilliams · September 11, 2020, 6:43pm

The sample design of the ACS requires the use of sample weights for producing accurate estimates. I can understand that weighting might be outside of the realm of an introductory course; I recommend either skipping weights in analysis (possibly covering the concept briefly and letting students know that the results they get are not necessarily accurate) and/or using the 2010 10% sample for a subset of analyses where you don’t require the more detailed topical coverage of the ACS/long-form. The ACS Data Users Group (and corresponding forum) may be able to offer suggestions about transforming the PUMS as you describe, but I am not familiar with methods that allow for this.

Topic		Replies	Views
Random selection of people (how to with survey package) USA	3	747	November 22, 2021
Programmatically pull USA Sample IDs through IPUMSPY package? USA	1	172	February 14, 2024
Downloading the complete census for the USA full count (1850-1930) USA	1	182	November 3, 2023
Variables for entire United States USA	1	425	November 29, 2017
How do I decide what IPUMS-USA sample to use from any given year?	1	303	January 12, 2013

Download samples for specific cities

Related topics