Link to the University of Pittsburgh Homepage
Link to the University Library System Homepage Link to the Contact Us Form

Probabilistic data linkage: generating a reproductive histories dataset from states’ vital records data

Dib, Basma Nihad (2021) Probabilistic data linkage: generating a reproductive histories dataset from states’ vital records data. Master Essay, University of Pittsburgh.

[img] PDF
Updated Version
Restricted to University of Pittsburgh users only until 6 January 2024.

Download (796kB) | Request a Copy


Datasets that follow pregnancy histories over time are lacking. In this pilot study, we used Pennsylvania’s fetal death and birth records to generate a longitudinal maternal dataset by linking records for the same mother to each other. We explored how to best achieve this linkage when lacking a unique record identifier. We demonstrated how Stata’s existing probabilistic matching tools can use nonunique identifiers to facilitate record linkage. To validate the effectiveness of this probabilistic linkage, we compared its results to the linkage results generated from deterministically linking the records using social security numbers. Compared to the deterministic linkage, the probabilistic linkage had a sensitivity of 94.3% and a positive predictive value of 96.7%. Our pilot study can serve as a guide for researchers in other states to generate longitudinal maternal datasets from their states’ vital records. Such longitudinal datasets can be a valuable resource for conducting epidemiologic analyses in the field of maternal and child health and answering research questions that relate to the period between pregnancies. Results from these studies can be used to improve health outcomes of mothers and children.


Social Networking:
Share |


Item Type: Other Thesis, Dissertation, or Long Paper (Master Essay)
Status: Unpublished
CreatorsEmailPitt UsernameORCID
Dib, Basma Nihadbnd18@pitt.edubnd18
ContributionContributors NameEmailPitt UsernameORCID
Committee ChairBodnar, Lisa M.bodnar@edc.pitt.edubodnarUNSPECIFIED
Committee MemberYouk, Adaayouk@pitt.eduayoukUNSPECIFIED
Committee MemberParisi, Sara M.smp101@pitt.edusmp101UNSPECIFIED
Date: 17 December 2021
Date Type: Completion
Access Restriction: 2 year -- Restrict access to University of Pittsburgh for a period of 2 years.
Number of Pages: 39
Institution: University of Pittsburgh
Schools and Programs: Graduate School of Public Health > Epidemiology
Degree: MPH - Master of Public Health
Thesis Type: Master Essay
Refereed: Yes
Date Deposited: 06 Jan 2022 14:39
Last Modified: 06 Jan 2022 14:39


Monthly Views for the past 3 years

Plum Analytics

Actions (login required)

View Item View Item