Why are there fewer counties in the 1950 1% sample compared to the 1940 1% sample?

NTurner · February 16, 2018, 1:57pm

I downloaded data from 1930-2010.

I used the 1% or 5% samples.

In 1930 and 1940 using the 1% samples, I have roughly 3,000 observations at the state-county level.

In 1950 1% sample I get 145, in 1960 5% sample I get 435 obseravations and in 1970 1% sample I get 145 observations.

Am I doing something incorrect? Or, was there a change in the data/sampling that leads to fewer counties in later years?

Thanks-

Nick

JeffBloem · February 16, 2018, 6:44pm

You haven’t done anything incorrect. Beginning in 1950, the lowest level geography identifiable in public use data is the PUMA (public use microdata area). PUMAs are sometimes identical to counties, but often not. Therefore, some counties in some samples, from 1950 onward, are able to be identified. More details about this are described on the COUNTYFIPS variable description. This recent blog post explains this issue and offers some ideas for alternatives.

Topic		Replies	Views
County identifiers 1940-1960 USA	1	141	April 4, 2024
Which is the lowest level of geography in the ACS PUMS 5 year samples: PUMA or county (when available)? USA	1	969	September 29, 2016
County Level Data USA	2	372	May 26, 2017
for the 1 and 3 yr samples for county and statefip variables, why are there so many "0" values for county? USA	11	1341	February 28, 2014
Crosswalking between 1% PUMA sample (2012-2018) and 2010 Counties	1	423	May 29, 2020

Why are there fewer counties in the 1950 1% sample compared to the 1940 1% sample?

Related topics