Code for cleaning original NSF NSCG Data

Kadeem_Noray · November 6, 2019, 5:45pm

I recently downloaded the NSCG 2015 and 2017 files from NSF, and noticed that they are somewhat different than the IPUMS versions of previous NSCG data. Is there some prewritten code (ideally in Stata) for making these data sets comparable quickly? Also, when will the NSCG 2015 and 2017 data be released on IPUMS? Thanks.

JeffBloem · November 6, 2019, 7:40pm

Unfortunately, the grant that funded the existing IPUMS Higher Ed project is no longer operational. Therefore, at this time we do not have plans to integrate the 2015 and 2017 NSCG samples into IPUMS Higher Ed. Also, the IPUMS harmonization process is not executed in Stata or any other statistical software and so we do not have any pre-written code that harmonizes these data.

Tomas_Bascolo · April 9, 2025, 1:28pm

Hello, I have a similar question regarding this. In the original NSCG dataset there are many variables that are present across the every year of the survey but are not present in the IPUMS extracts. For instance, variables related to location of education (e.g. MRRGNX, does not show for any of the years when only selecting NSCG sample. Is this intended? Is there anyway to create an extract only with NSCG data with all the variables?

Ivan_Strahof · April 11, 2025, 4:45pm

While IPUMS Higher Ed provides data from three surveys — the National Survey of College Graduates (NSCG), the National Survey of Recent College Graduates (NSRCG), and the Survey of Doctorate Recipients (SDR) — we only release the SESTAT (Scientists and Engineers Statistical Data System) subsamples of the NSCG and the NSRCG. This subsample consists of fewer respondents, including only those with science or engineering degrees or occupations, and also includes fewer variables. For example, MRRGN is available only as a restricted use variable in the NSCG SESTAT sample.

You can merge respondents from an IPUMS Higher Ed data extract with the full NSCG file containing MRRGN and any other variables of interest (see the public use files page) using REFID.

Tomas_Bascolo · April 24, 2025, 7:19am

Thank you for the clarification Ivan!

Topic		Replies	Views
NSRCG2001 not included HIGHER ED	2	527	September 1, 2020
Undergraduate GPA variable HIGHER ED	2	631	September 1, 2020
Higher Ed-NSCG graduation year	1	418	September 7, 2018
Example API Export With ACS IPUMS NHGIS	1	279	August 28, 2023
IPUMS CPS Data Extract with all Variables versus CPS Data Extracts from Census website CPS	1	471	October 29, 2021

Code for cleaning original NSF NSCG Data

Related topics