Discrepancies between online tabulations and 1y dataset aggregations

Mathew_Baumann · January 29, 2021, 10:16pm

Hello!
I’ve been working with a puma level educational attainment dataset for the ACS 1 year files(version 7), and checking against tabulations found here:
https://data.census.gov/cedsci/table?q=South%20dakota%20education%20age&t=Educational%20Attainment&tid=ACSST1Y2016.S1501&moe=false&tp=false&hidePreview=false

I was hoping someone might know why these would be different – such as a specific versioning change that would lead to different results.

Thank you!

(Example of discrepancies)
(Aggregations using “perwt” weighting variable)
South Dakota 2010, ages 25+ by educational attainment:
less than HS:
tab: 55,367.416
aggregated 1y: 51,503

HS grad
tab: 168,231.764
aggregated 1y: 164,355

some college/ AA
tab: 168,764.143
aggregated 1y: 163,565

college grad
tab: 140,015.677
aggregated 1y: 150,447

skolenik · February 1, 2021, 10:14pm

Publicly available microdata are a subsample from the full ACS, while the Census website reports the results based on all of the ACS. For small geographies, such as SD, the discrepancies would be more pronounced. The hope is that the two numbers would be insignificantly different from one another (although it is hard to formally test this as the samples are not independent).

Matthew_Bombyk · February 2, 2021, 2:21pm

To follow up on @skolenik 's answer, you can read more about the differences between the ACS PUMS and the full sample here.

Mathew_Baumann · February 3, 2021, 1:10am

ahh ok, I didn’t realize I was working with a subsample. Is this just a measure taken to insure confidentiality?

Matthew_Bombyk · February 3, 2021, 5:52pm

I believe that’s the reason.

Topic		Replies	Views
Why are population estimates at the PUMA level so different between 1-year and 5-year sample ACS? CPS	1	395	June 17, 2014
About population estimates using one-year ACS data USA	6	1362	April 25, 2022
Crosswalking between 1% PUMA sample (2012-2018) and 2010 Counties	1	438	May 29, 2020
Comparing across 1980, 1990, 2000 Census and 2006-2008 3 year ACS USA	1	396	April 14, 2016
HHWT in the 5-year and 1-year ACS USA	1	204	March 7, 2023

Discrepancies between online tabulations and 1y dataset aggregations

Related topics