For calculating the civilian labor force, we want to include all employed and unemployed individuals between 15-65, excluding inactive individuals and those in the armed forces. Therefore, our code reads as follows:
EMPSTATD%in%c(100:120,140:240) & AGE%in%c(16:65) & !OCCISCO%in%c(10,97:99))
As you can see from the code above, we used both EMPSTATD and OCCISCO to exclude individuals in the armed forces from our sample. However, by doing so, we are assuming that both variables have similar coverage. Is this a fair assumption or should we rather only include one variable (either EMPSTAD or OCCISCO) in our code? If so, which variable would be the preferred variable for our purposes?