R2/explained variance for ACS data

Molly_Richard · February 10, 2023, 1:29am

I received a reviewer comment from a paper in which I am using IPUMS data that reads:

“What is typical for regression R2 for American Community Survey Microdata? The R2 for the four models seem quite high, and providing a reference point would be useful for readers.”

Currently, I’m exploring research on aggregation bias to attempt to respond to this comment, but I wanted to see if any other users or staff have thoughts on how to approach the question. In the study, the dependent variable is an MSA-level variable created using IPUMS microdata, and the independent variables are also derived from the ACS (estimates downloaded directly from the Census).

Thanks for any thoughts!

Ivan_Strahof · February 15, 2023, 3:04pm

I don’t have an answer for the question of what is a typical R^2 for regressions using ACS data and it seems unlikely to me that you will be able to find an answer to such a broad question. Some relationships may involve only a few well observed variables and thus will generate higher R^2 values when the correct variables are specified in a regression model. Other relationships are much more complex and involve unobserved variables that cannot be included in the model, causing a decrease in the R^2. The expected R^2 will depend on the relationships you’re evaluating in your paper and what previous research has found on the specific issue you’re studying.

Topic		Replies	Views
CBSA variable in ACS?	2	680	July 14, 2022
Is Fixed Effects possible for ACS data?	2	264	February 20, 2024
Combine both micro- and aggregate level data to compute change across years and incorporate the survey design USA	3	826	July 15, 2021
Adding your own data to an IPUMS dataset USA	1	396	January 20, 2023
Why doesn't the 5-year ACS in IPUMS contain a variable for Census Tract? I would have expected this variable.	1	1779	August 9, 2018

R2/explained variance for ACS data

Related topics