Dear IPUMS users,
I am planning to use census data from Malawi to investigate migration patterns. I also downloaded the 2008 Malawi emigration supplemental file.
According to the description, the ‘sample’ and ‘serial’ variable should identify an observation and should allow to merge the two dataset.
However, I noticed that the main census dataset has a ‘person’ variable which reports the number of persons in a given HH.
Since it is not possible to merge the data using the hyerarchical structure of IPUMS data when both household and person level variables are selected (it returns that the identifier in the supplementary data neither is unique in the Census, nor in the supplemental dataset), I though I could append the external migration dataset.
However, I already have records in the dataset for each member of the household as reported by the ‘person’ variable. The appendingprocess would add additional household members.
What I would like to know is, should these additional persons add up to the total household size? Or, the persons in the houshold already include those reported in the supplemental migration dataset and would be therefore duplicated? In this latter case, is there any way to match the people who migrated, included in the supplemental data file, with those in the main census data?