I generated a time series report for CV4. Persons by Hispanic or Latino Origin [2] by Race [5*]. It looks like it pulled table P5. GROUP QUARTERS POPULATION BY MAJOR GROUP QUARTERS TYPE from the PL data rather than P5. HISPANIC OR LATINO ORIGIN BY RACE from the DHC. Anyone else run into this issue? Any solutions?
Thanks for reporting this issue. As it happens, another user recently reported a problem with the 2020 data in time series table AE7, Persons by Hispanic or Latino Origin [2] by Race [6*], which includes several of the same time series as CV4. I figured out just a week ago what you found to be the case: that the 2020 data comes from table P5 in the PL94-171 dataset but should come from P5 in the DHC.
We’re planning to work on a solution soon. (Our metadata that defines the tables is correct in this case, but our process that builds the data is not operating on that metadata correctly, so we’ll need to do some development to determine where the error is occurring in our code and correct it.)
Until we’ve released corrected time series, you could get the correct data from the corresponding source table (P5) from the 2020 DHC, or, if you’d be OK using 2010 census geography for your application, you could use our geographically standardized time series (which are “Standardized to 2010”). It appears that this problem affects only our nominally integrated time series and not our geographically standardized time series.
Thanks! FWIW, it looks like it only affects white alone, Non Hispanic/Latino and Black/AA alone, Non Hispanic/Latino. I ended up grabbing the data directly from the census API and mushed it all together. At some point, I’ll check back and rerun it without the extra side quest to mesh data sources. Cheers!