Is empstat in 1910 and 1930 derived from the occupational/gainful employment questions used to create labforce? If this is true, why are there some individuals who are employed by empstat but considered “not in the labor force” by labforce ? In principle, should the use of empstat provide us indpendent information about those considered out of the labor force by labforce?
My understanding of the creation of labforce prior to 1940 is that it was based off reporting of a “gainful occupation.” In contrast, empstat in 1910 and 1930 asked if the respondent worked on the day of the census (or last working day). From the enumerator instructions displayed on IPUMS’ website, it looks as though this question concerning work was only asked to the subset of individuals who report a gainful occupation - both sets of instructions state to only ask the question of those reporting an occupation. However, when you crosstab these two variables in the microdata, there are some employed/unemployed persons in empstat who report being out of the labor force in labforce. In addition, in 1930 there are a few people considered “not in the labor force” by empstat but considered “in the labor force” by labforce .
Is the discrepancy due my misunderstanding of the data? Or is it due to some cleaning issue such as illegible occupations? Or is this enumerator error?