Weighting for State-Industry Level Estimates

Chantal_Pezold · June 27, 2023, 6:53am

Hi All,

I would like to construct state-industry level measures of employment, similar to what Paul Goldsmith-Pinkham, Isaac Sorkin and Henry Swift do in their replication code below:

collapse (sum) indwt = perwt_cz (firstnm) geo' indind_digits’ if age>=18 & full_time==1, by(geo'_indind_digits’ year)

What I am unsure about is what the weight should look like. They construct it as a combined weight of the standard person-level weight and “afactor”, which is simply measuring the share of a given “puma” population in a commuting zone, where the latter is the geographical unit that you are interested in.

rename afactor cz_wt
label var cz_wt “Commuting Zone Weight”
gen perwt_cz = perwt * cz_wt
label var cz_wt “Person * Commuting Zone Weight”

Will I need to construct a comparable “State Weight”?

Thanks for your help!

Ivan_Strahof · June 28, 2023, 8:19pm

The problem these researchers appear to be trying to solve is that the data does not report what commuting zone a sample household resided in and that the smallest geographic unit that is identified, public use microdata areas (PUMA), are not contiguous with commuting zones. The code you shared therefore appears to estimate commuting zones as weighted combinations of the PUMAs they contain. However, this is not an issue when constructing state-level measures since PUMAs do not cross state boundaries. The state that a respondent household resides in is identified in the variable STATEFIP.

Topic		Replies	Views
Using HH Weight & P Weight for same analysis in R USA	1	350	October 8, 2021
zero wages USA	3	351	April 19, 2018
What weighting procedure do I use to calculate the state-year level mean for an individual-level variable? USA	3	950	May 23, 2018
Accuracy of aggregate industry and occupation employment counts in 2011-2015 ACS sample USA	0	390	November 7, 2017
Assigning PERWT Variable USA	4	244	March 6, 2024

Weighting for State-Industry Level Estimates

Related topics