I extracted data from UPUMS USA looking at certain demographic information (race, income, age) and also pulled STATEICP and STATEFIP (and county and MET2013 and PUMA). I’d like to look at this info on a state by state basis, so sorted with STATEFIP codes.
When organizing by state, roughly half the states have less than 100 respondents, the rest have an (N) of dozens or hundreds of thousands. For instance, CA has 340k, and CT has 26. Why is this, and is there a way to make the data less …patchy? I have 5,264,018 rows of data.