PERWT in a new data extract sums to very low value

ron.campbell · November 11, 2019, 11:32pm

I’ve just begun working with an extract of ACS 2013-2017 IPUMS. As a test I summed PERWT / 100 (to account for the implied decimal), grouping by state. The values for CA, TX and FL were each in the hundreds of thousands, obviously quite low. I then tried for a national total and got about 6.5 million, again obviously quite low. The import instructions in both txt and xml called for importing at the same position in the file. The record count is about 18.9 million. Any ideas?

ron.campbell · November 13, 2019, 7:11pm

I resolved this by re-importing the data using the R readr library, importing the data as a fixed-width format, specifying all column widths and names. I suspect there’s a problem in the IPUMS R file that may have led to my issue. In any case, when I hand-built the import file, the problem vanished.

Matthew_Bombyk · November 14, 2019, 3:20pm

Hi Ron, glad you were able to find a workaround.

For future reference, you don’t need to divide the weights by 100. The ipumsr program will do this for your automatically. I think this was your problem. I just downloaded an extract using the 2013-17 5-year ACS sample, and PERWT summed to the correct state and national totals.

ron.campbell · November 14, 2019, 5:26pm

Thanks. I hadn’t realized that ipumsr could do that. I relied on the old “divide by 100” rule that I’ve used in the past. This is a nice time saver.

Topic		Replies	Views
Applying Sampling Weights USA	1	394	April 19, 2022
Using PERWT to get total population USA	4	1110	March 21, 2019
computing std errors using replicate weights in an ACS2016 extract. USA	1	442	March 6, 2018
PERWT and decimal places	5	257	December 11, 2023
Pewrt variable and decimal places INTERNATIONAL	1	662	June 15, 2018

PERWT in a new data extract sums to very low value

Related topics