Hello,
I am working with linked historical census data and would like to estimate the total number of children ever born for men and women aged 40 and older.
I realize that this is not directly observed for all persons/years. My question is whether there is any recommended or reasonably defensible way to approximate this quantity when a direct fertility variable is unavailable.
One possible strategy I have considered is linking an individual across censuses and then taking the maximum number of co-resident own children ever observed in that person’s household across waves. But I am not sure this is appropriate, for several reasons:
-
Older children may already have left the parental household by the next census.
-
If births are widely spaced, siblings may never appear together in the same household at any one census.
-
Children themselves can sometimes be linked across censuses, but not all children link successfully. When a child observed in two censuses cannot be linked, it becomes difficult to know whether this is the same child observed twice or two different children.
-
If one child disappears between censuses, it is hard to distinguish between leaving home, death, and linkage failure.
So my main question is: when CHBORN is not available, is there any best practice in IPUMS linked historical census data for approximating completed fertility or total children ever born? For example, have people used a defensible lower-bound/upper-bound approach, an own-child-style reconstruction, or some other method?
Thank you very much.