Hi!
Sorry if this is a basic question but I can’t seem to find the answer anywhere.
I have have a file of CPS ASEC data that I want to merge with the MORG files available at NBER. Specifically, I’d like to look at union coverage by year x state x occupation/class/industry (kinda like what unionstats.com does). However, I’m not sure how class, occupation, and industry are coded in the MORG file, so I’m not sure which variables I’d need in my ASEC file (e.g. OCCLY, OCC10LY, etc). Thanks!
It sounds like you are interested in merging harmonized CPS data from IPUMS with the NBER merged outgoing rotation group (MORG) files. Note that IPUMS CPS offers the outgoing rotation group data together with each ASEC (1990-onwards) and Basic Monthly Survey (1983-onwards) sample. The Outgoing Rotation Groups (ORG) variable selection page displays the harmonized ORG variables that IPUMS offers, including the variable UNION. An extract with both ASEC and ORG variables will have a single observation per person, with responses from each component already merged together onto the same person-record.
The NBER MORG file will code occupation and industry based on the Census Bureau classification system that was used contemporaneously (see the MORG documentation for reference); the corollary to these unharmonized values are available in IPUMS CPS as OCC, IND, OCCLY, and INDLY. I am linking to a blog post and forum discussion that include information on merging IPUMS and NBER versions of the CPS data.
Thank you Ivan, this is super helpful. Because I would rather have dates the 1979-1983 data, I am looking to use the MORG dataset from NBER if at all possible. What I’m thinking is that the best solution might be to use the broadest codes of OCCLY/INDLY which are consistent throughout (using the first and second digit of the codes). Would this make sense?
Thanks for clarifying. This approach makes sense, though you should note that both OCCLY and INDLY use the 1970 classification system in samples from 1971-1982 and the 1980 system in samples from 1983-1991. The two systems are very different (compare 1970 OCC codes to 1980 OCC codes); using only the first digit might not provide you with comparable groups. If you require bridging one or more of these code vintages, I recommend using our harmonized occupation and industry variables: OCC1990 and IND1990. These code occupations and industries across vintages into consistent groups from the 1990 census using a modal assignment protocol (see the variable documentation for more information).
To make use of these harmonized variables with the MORG files, you’d first need to create a crosswalk using IPUMS CPS data (see this blog post for a step-by-step guide). You will need to create the OCC and IND crosswalks individually for each code vintage you plan to use. For example, you can request the 1980 ASEC with both OCC and OCC1990 in order to collapse it into an occupation code crosswalk for samples from 1971-1982. Once you have this crosswalk, you can then use it to recode the vintage-specific 3-digit occupations in the MORG files to OCC1990. You can also aggregate OCC1990/IND1990 into more general occupation/industry groups by using the headings in the variable codes tab.
Thanks! If I understand correctly, this will be my plan of attack:
I will download the OCC codes for each decade of interest and map it to the OCCLY90 variable. I’m not sure how to deal with the issue that the codes don’t always go from X → Y but sometimes go from X ->Y and X-> Z. My guess is that the best solution is to move the X’s to the modal category between Y and Z (use weighted count and see if there’s more Ys than Zs).
The harmonized occupation codes already do this work for you! Each OCC/OCCLY will always map to a single OCC1990/OCC90LY code using a modal assignment procedure. This procedure is based on a series of technical papers published by the Census Bureau where observations are double coded into the occupational schemes of the current and previous census year (see OCC1990 for more information). Depending on how specific the occupations that you want to study are, you might additionally consider aggregating occupations into broad categories implicit in the 1990 scheme in order to increase comparability across code vintages.