Why do roughly half the states have so few respondents in ACS 2016?

claireah · October 30, 2018, 12:24am

I extracted data from UPUMS USA looking at certain demographic information (race, income, age) and also pulled STATEICP and STATEFIP (and county and MET2013 and PUMA). I’d like to look at this info on a state by state basis, so sorted with STATEFIP codes.

When organizing by state, roughly half the states have less than 100 respondents, the rest have an (N) of dozens or hundreds of thousands. For instance, CA has 340k, and CT has 26. Why is this, and is there a way to make the data less …patchy? I have 5,264,018 rows of data.

Thanks

Michelle_Pratt · October 30, 2018, 7:01pm

Would you be able to share how you are tabulating the data? Are you further limiting cases based on certain criteria? Based on the total number of observations you have in your data, it sounds like you are working with a fully unzipped file, however, looking at your last extract, the minimum number of unweighted observations for a state should be Alaska with 8,797.

Feel free to share your code here or email us at ipums@umn.edu. If you have not narrowed your data in any way, I would recommend re-saving/unzipping your extract just in case something odd happened in the unzipping process.

Topic		Replies	Views
I am using 2007-2011 ACS data. When I run the frequencies for stateicp & statefip there are only 14 states. Why?	2	441	November 18, 2013
Aggregating Observations by State + County Fips Codes USA	2	463	November 24, 2021
Is there a way to download IPUMS data by geographic region (i.e. by state)? USA	1	1086	November 30, 2015
Incomplete 100% data sets USA	1	268	May 4, 2020
I'm trying to get complete count data for two California counties (Merced and Los Angeles) from the 1850 U.S. Cens INTERNATIONAL	2	373	March 2, 2017

Why do roughly half the states have so few respondents in ACS 2016?

Related topics