Help with FAMINC variable

I am sharing a few ideas about what might be happening to give you these numbers. Please follow up if you have questions after reviewing them or continue to have problems replicating the household counts you expect.

A good way to calculate household estimates is to use PERNUM == 1, which removes duplicates within households by keeping only the household heads. It looks like you may be trying to estimate the number of households for a particular state (please correct me if i’m wrong), in which case you can filter and then use group_by in R to sort your results by state. Also be sure to use ASECFLAG == 1 if you included any Basic Monthly Data in your extract, which requires use of a different sample weight variable. These data training exercises might be helpful to you for best practices in utilizing IPUMS data according to the statistical package you use, in your case R. Below is some example code you could use to estimate the number of households in Florida (2018), which comes out to about 8.6 million:

data %>%
filter(PERNUM == 1 & STATEFIP == 12 & ASECFLAG == 1) %>%
group_by(STATEFIP) %>%
summarize(n = sum(ASECWTH))

One other factor to consider if you are getting conflicting numbers is the difference between families and households. Not all household members are necessarily family members. The U.S. Census Bureau defines a family as a group of two or more people related by birth, marriage, or adoption who reside together, whereas a household includes unrelated people who reside in the house. This comes into play with FAMINC, which is a household-level variable that measures family income. Alternatively, HHINCOME is a household-level variable that measures household income (you can read more about this here).