I received a reviewer comment from a paper in which I am using IPUMS data that reads:
“What is typical for regression R2 for American Community Survey Microdata? The R2 for the four models seem quite high, and providing a reference point would be useful for readers.”
Currently, I’m exploring research on aggregation bias to attempt to respond to this comment, but I wanted to see if any other users or staff have thoughts on how to approach the question. In the study, the dependent variable is an MSA-level variable created using IPUMS microdata, and the independent variables are also derived from the ACS (estimates downloaded directly from the Census).
Thanks for any thoughts!