Rules of thumb for determining if tract-level data are too noisy/unreliable?

Alexandra_Lee · March 20, 2019, 12:16am

I’m using tract-level estimates from a 5-year ACS sample (number of renter-occupied households), and am trying to filter out unreliable data. It’s clear when estimates are extremely unreliable (e.g., the margin of error is larger than the estimate), but are there generally accepted thresholds in determining data validity? Margin of error no more than X, with a sample size of at least Y?

Looking at Census’ guidance on using margins of error (https://www.census.gov/programs-surveys/acs/guidance/training-presentations/acs-moe.html), I understand how to determine if two estimates are statistically different. But if I’m trying to draw conclusions about trends across all tracts, and so need a reliable dataset of tract-level data, how can I go about only using tract estimates that are satisfactorily un-noisy?

John_Sullivan · March 21, 2019, 12:21am

I think there are some suggestions for dealing with this issue toward the end of this paper.
Folch, David C. Daniel Arribas-Bel, Julia Koschinsky and Seth E. Spielman. (2016). Spatial Variation in the Quality of American Community Survey Estimates. Demography, 53:5 pp. 1535-1554.

Alexandra_Lee · March 21, 2019, 9:25pm

Thanks John! This paper was helpful in describing the non-random pattern that MOEs follow in the ACS data. Though I might hope for a straightforward set of rules, it looks like their recommendations would vary on a case-by-case basis.

I found a PDF of the working paper version here: https://ecommons.cornell.edu/bitstream/handle/1813/38122/Folch-etal_2014.pdf?sequence=2&isAllowed=y

Topic		Replies	Views
Reliability/ Error of Cross-Tabs for a Categorical Variable USA	1	359	December 4, 2019
Criteria for flagging estimates as unreliable USA	1	331	September 5, 2019
Estimates and MOE	3	608	January 13, 2021
Replicate Weights Margin of Error USA	1	556	September 24, 2020
Civilian non-institutionalized population ACS USA	3	640	June 18, 2020

Rules of thumb for determining if tract-level data are too noisy/unreliable?

Related topics