Analysis approach to find associations between disease outcomes within specific demographic groups

Manuel_Marte · January 9, 2023, 7:52pm

Hi all,

Please excuse my naivety as I am not sure how to best formulate my question.

Here is a toy problem demonstrating what I would like to accomplish:
Imagine we have four groups, Male and female adults without hypertension, and male and female adults with hypertension.

Is there an analysis that can, as I see it, (1) sample from the “without” groups to create “matching cohorts” on all variables of interest but hypertension, to (2) estimate associations between various other variables (some acting as covariates of no interest) and the emergence of hypertension in the latter group? E.g., something like a survival analysis, but without a time variable, given that IPUMS data are cross-sectional without repeated measures. References to other papers would also be appreciated.

Best,
MJM

Matthew_Bombyk · January 12, 2023, 4:55pm

What you are describing is a matching analysis. There are several varieties. Matching on exact combinations of variables as you describe is known as “exact matching”. Other common methods are propensity score matching, full matching, and coarsened exact matching. Exact matching is equivalent to running a linear regression using “hypertension” as the outcome variable and dummy variables for every combination of covariates, as explained in this Stata Blog post, however this requires all covariates to be discrete and generally would require a very large sample size in order to ensure a sufficient number of matches. The other matching methods are more flexible in what counts as a match, and thus have less stringent data requirements. Other approaches to this type of analysis are logistic, probit, and linear probability models, all of which are linear index models. All of these methods have plenty of resources freely available online. I hope that helps.

Topic		Replies	Views
How to set up a "time" variable for a Cox PH model to examine correlations to a diagnosis HEALTH SURVEYS	2	421	May 17, 2022
Identifying matching individuals within a household in the ASEC and Monthly samples over time. CPS	2	602	December 31, 2018
Weights for mortality analysis (IPUMS_NHIS)	2	269	February 23, 2021
Can I analyze the intergenerational change in wages in IPUMS? USA	2	384	September 26, 2017
Which weight to use when analyzing a subpopulation? HEALTH SURVEYS	3	685	January 10, 2022

Analysis approach to find associations between disease outcomes within specific demographic groups

Related topics