I’m doing a longitudinal analysis from 1970 to 2016 with compositional variables, such as percent Hispanic at the county level. For the Hispanic indicator, the time series table recommends NT24 for 1970, which includes the following four categories:
C11001: Any of five Spanish categories of the question on “origin or descent”
C11002: Puerto-Rican birth or parentage
C11003: Spanish language
C11004: Not of “Spanish language” but of Spanish surname (in 5 Southwestern states only)
Since I am trying to calculate percent Hispanic for each county, is it correct to sum C11001 through C1104 as the numerator and to use the total population count from NT126 as the denominator? Or, would you only use C11001 and C11002 in the numerator?