In IPUMS CPS, for my variables I’m choosing: HOURWAGE, AGE, RACE, EARNWT, MARST, STATEFIP (Once I troubleshoot this, I will eventually choose more variables). I’m choosing 2020 for my sample. When creating my extract, my data structure is rectangular with CSV format. When loading my extract into R, I use this code:
ddi ← read_ipums_ddi(“cps_00035.xml”)
data ← read_ipums_micro(ddi)
After doing this however, I keep ending up with 200K+ observations and 0 variables, and I get the error message: Version:1.0 StartHTML:0000000107 EndHTML:0000001754 StartFragment:0000000127 EndFragment:0000001736
Warning: 270540 parsing failures. row col expected actual file 1 – 3 columns 1 columns ‘./cps_00035.csv’ 2 – 3 columns 1 columns ‘./cps_00035.csv’ 3 – 3 columns 1 columns ‘./cps_00035.csv’ 4 – 3 columns 1 columns ‘./cps_00035.csv’ 5 – 3 columns 1 columns ‘./cps_00035.csv’ … … … … … See problems(…) for more details. Warning message: The following named parsers don’t match the column names: YEAR, SERIAL, MONTH, HWTFINL, CPSID, ASECFLAG, ASECWTH, STATEFIP, PERNUM, WTFINL, CPSIDP, ASECWT, AGE, SEX, RACE, MARST, EARNWT, HOURWAGE
Help! What am I doing wrong?