I am trying to look at migration trends between 2000 and 2010 and when I merged the two samples for the respective years, Curiously observations appear as follows (see Image 1)
the serial and pernum identifies the dataset uniquely. The above screen shot is just a quick browse of households with serial number 1000. But there are about 2.3 crore observations.
id2010a_di~5- district 5 years ago asked in 2010
id2000a_re~r district 5 yaers ago asked in 2000
Question is- What does the “.” under the variable id2010a_di~5 really mean? For every individual who has a value for variable id2000a_re~r there are no corresponding values-represented by"."- for id2010a_di~5 and vice versa. It cannot be the case that there has not been any migration from 2000 to 2010. So I am missing some link. Is it the interpretation of the “.”?
I used count if (id2010a_dist5!=id2000a_reg5yr) & id2010a_dist5!=. & id2000a_reg5yr!=.
and it said 0. Basically I expected that this would give me number of people who migrated between 2000 and 2010.