Does anyone have sample code for using svydesign function in R?

madoolsy · July 17, 2018, 8:35pm

In the “survey” vignette for R, it shows that you’re supposed to enter ID, weights, data set name, and fpc. In the example it shows the following:

dclus1 <- svydesign(id = ~dnum, weights = ~pw, data = apiclus1, fpc = ~fpc)

FPC is supposed to be the population size, but I do not have this variable in my dataset. What variables should I be using for id and fpc? The variables in my dataset are: year, datanum, serial, HHWT, STATEFIP, GQ, PERNUM, PERWT, AGE, HISPAN, HISPAND, RACAMIND, RACASIAN, RACBLK, RACPACIS, RACWHT, RACOTHER, POVERTY.

Does anyone have sample R code for weighting IPUMS USA data? I would like to get estimates and standard errors.

gfellis · July 18, 2018, 4:51pm

*** Edited on 8/2/21 to fix an error in the code.

Hi there,

The fpc argument is not required, so you can leave it empty. (The FPC argument can give extra precision when you know that the sample design has sampled a significant portion of a particular group. This isn’t the case for IPUMS USA, so the estimates would be indistinguishable with or without the FPC and the data doesn’t have the necessary information to use it. See the help for svydesign or Thomas Lumley’s book “Complex Surveys: A guide to Analysis Using R” for more details.)

Based on the variables you’ve listed, I believe you will need to revise your extract to add the CLUSTER and STRATA variables, and then the following code should give you estimates using the person weights. If you are interested in weighting at the household level, you’ll need to use HHWT instead of PERWT.

library(ipumsr)

ddi <- read_ipums_ddi(“usa_00019.xml”)

data <- read_ipums_micro(ddi)

survey package instructions (Person Weights)

library(survey)

svy <- svydesign(~CLUSTER, weights = ~PERWT, strata = ~STRATA, data = data, nest = TRUE, check.strata = FALSE)

svymean(~HISPAN, svy)

srvyr package instructions (Person Weights)

library(srvyr)

svy <- as_survey(data, ids = CLUSTER, weights = PERWT, strata = STRATA, nest = TRUE)

summarize(svy, HISPAN = survey_mean(HISPAN))

Topic		Replies	Views
What R package is needed for IPUMS USA for weighting? Is it "survey"? Do you have sample code? USA	3	2705	July 18, 2018
Know of any tutorials for using the R survey package with ACS PUMS? Having problems weighting data.	4	1212	May 10, 2017
How to use Stata svyset on IPUMS CPS data correctly? CPS	4	2395	September 3, 2025
Survey design in Ipums International INTERNATIONAL	2	41	May 8, 2026
Strange values form sryvr package USA	3	1006	August 16, 2021

Does anyone have sample code for using svydesign function in R?

survey package instructions (Person Weights)

srvyr package instructions (Person Weights)

Related topics