Census: 100% vs sample

Benjamin_Couillard · September 20, 2024, 7:52pm

Greetings to the community,

I am curious about why aggregates sometimes differ between census tables based on 100% counts and based on samples. Since the samples are said to be weighted to match the 100% counts in other documentation, I would have expected aggregates to match.

For example:
The total number of households in the 1990 100% STF1 data (source table NP24, NHGIS code E22): 91,993,582.
The total number of households in the 1990 sample STF3 data (source table NP27, NHGIS code EUL): 91,947,410

Obviously this difference of ~45,000 is small relative to the size of the nation, but I intend to use a statistical method where it might matter that they do not match.

Thanks,
Ben

JonathanSchroeder · September 20, 2024, 8:53pm

The differences between these two sources are due to the Bureau’s choice to maintain whole numbers for all sample-based estimates (rather than fractional estimates) and to maintain additivity between subgroup counts and totals. There’d be no way to adjust all of the sample-based numbers to match perfectly with the corresponding 100%-count numbers without allowing some of the subgroup counts to be fractions (or to reduce their accuracy by rounding them more severely).

This outcome is also due to the weighting design. To tally the sample-based counts, the Bureau assigns a whole-number weight to every sample response and then sums these whole-number weights to produce the estimates. This also makes it impossible for all the weighted totals to match exactly with all 100%-count totals.

For more information about the Bureau’s approach to sampling and weighting for long-form census data, see the “Technical Documentation” for the corresponding datasets, particularly the sections on the “Accuracy of the Data”.

Benjamin_Couillard · September 20, 2024, 9:07pm

Makes perfect sense. Thank you, Jonathan.

Topic		Replies	Views
Discrepancy Between 1% Sample and Full Count Data USA	1	490	November 7, 2018
Why do I fail to replicate published ACS aggregates? USA	2	241	March 2, 2023
When selecting my sample years, what do i do if the sample percents do not match up? USA	1	356	September 22, 2014
Pooling ACS and Census extracts USA	1	418	September 1, 2020
Kenya 2009 Census' weighted estimates not lining up? INTERNATIONAL	3	961	May 20, 2023

Census: 100% vs sample

Related topics