Analyzing multiply imputed family income variables in NHIS

Esther_Lamidi · January 7, 2021, 7:44pm

I am posting my question here hoping someone could help. I, however, understand that this may be more of an analytic question than a question about the data that I’m using.

I would like to include a measure of family income (not poverty status) in my logistic regression analysis of self-rated health using the 2000-2018 NHIS data. To deal with the missing information on family income as indicated by the variable incfam97on2, I rely on the five imputed family income variables (incimp1-incimp5). Rather than using the imputed income categories on those variables, I would like to 1) use the midpoint of each interval of the imputed income variables, 2) convert those values (midpoints) to 2018 dollars, and 3) adjust the income values in 2018 dollars, derived in step 2, for family size. Given the popular view that imputed variables should not be transformed after imputation, I wonder if I could do my own multiple imputation as follows: 1) use incimp1 as the “original” income variable and treat cases with imputed values (i.e. impyfamflag1 == 1 | impyfamflag1 == 2) as missing cases; 2) transform the incimp1 variable with missing/unimputed values as described in steps 1-3 above - convert intervals to midpoints, convert to 2018 dollars, and adjust for family size; 3) multiply impute the values of the new transformed variable.
I am open to other suggestions. Thank you.

Matthew_Bombyk · January 20, 2021, 3:31pm

I’m not an expert in multiple imputation, but I will share some resources that might be helpful to you. First you can take a look at the IPUMS NHIS data briefs, which include sample code used in the analyses. Brief #2 uses imputed variables. I would also recommend reviewing the technical documentation on multiple imputation in NHIS from NCHS. You may have luck contacting the NHIS program directly with your question, since they employ the statisticians who developed the imputation methods. You may also want to post your question on stats.stackexchange.com or Statalist.

To clarify, are you proposing to create the imputed variables yourself, instead of relying on INCIMP1-INCIMP5? For what it’s worth, your method seems reasonable to me.

Esther_Lamidi · January 24, 2021, 2:33pm

Thank you so much. I ended up using the five imputed variables from IPUMS.

Topic		Replies	Views
Using NHIS Imputed Income Data (1997-2018) HEALTH SURVEYS	1	328	September 26, 2023
Income variable - Imputated variable HEALTH SURVEYS	1	659	November 8, 2019
USE OF IMPUTED VAR "INCIMP5" HEALTH SURVEYS	1	501	January 17, 2019
NHIS Income Data Post 2019 HEALTH SURVEYS	1	136	April 23, 2024
NHIS point estimate imputed income pre-2008 HEALTH SURVEYS	2	13	December 5, 2024

Analyzing multiply imputed family income variables in NHIS

Related topics