Hi,
I am currently working with CPS data, focusing on an occupation with a relatively small group of workers (e.g., school bus drivers in public schools). My goal is to see how the size of this workforce changes by year to better understand its labor dynamics.
Since this analysis uses monthly CPS data (from 2012 to 2024), I have been using the wtfinl weight, following the recommendation on the IPUMS website. Here’s a brief outline of what I’ve done so far:
-
Created a dummy variable for whether an individual has this occupation using
occandind(e.g.,is_busvariable). -
Set the survey design with:
svyset [pweight = wtfinl]. -
Ran:
svy: tab year is_busto get the number of workers in this occupation by year.
For some reason, the number of people in the occupation is quite larger than what I have seen reported in the literature using ACS data (and I had expected CPS to show smaller numbers, since it is a monthly survey data.
Question: Does this approach seem methodologically reasonable to you?
I have checked around online and looked through past questions (i.e., link), but I did not find anything directly addressing this. Maybe the question is too basic for someone to have posted about before.
My plan is to move on to analyzing financial-related variables using earnwt, but before doing so I want to make sure that my current approach makes sense.
I’d really appreciate any suggestions, advice, or corrections you might have.
Thanks so much,
K


