Title | A natural language processing approach to detect inconsistencies in death investigation notes attributing suicide circumstances. |
Publication Type | Journal Article |
Year of Publication | 2024 |
Authors | Wang S, Zhou Y, Han Z, Tao C, Xiao Y, Ding Y, Ghosh J, Peng Y |
Journal | Commun Med (Lond) |
Volume | 4 |
Issue | 1 |
Pagination | 199 |
Date Published | 2024 Oct 14 |
ISSN | 2730-664X |
Abstract | BACKGROUND: Data accuracy is essential for scientific research and policy development. The National Violent Death Reporting System (NVDRS) data is widely used for discovering the patterns and causing factors of death. Recent studies suggested the annotation inconsistencies within the NVDRS and the potential impact on erroneous suicide-circumstance attributions. METHODS: We present an empirical Natural Language Processing (NLP) approach to detect annotation inconsistencies and adopt a cross-validation-like paradigm to identify possible label errors. We analyzed 267,804 suicide death incidents between 2003 and 2020 from the NVDRS. We measured annotation inconsistency by the degree of changes in the F-1 score. RESULTS: Our results show that incorporating the target state's data into training the suicide-circumstance classifier brings an increase of 5.4% to the F-1 score on the target state's test set and a decrease of 1.1% on other states' test set. CONCLUSIONS: To conclude, we present an NLP framework to detect the annotation inconsistencies, show the effectiveness of identifying and rectifying possible label errors, and eventually propose an improvement solution to improve the coding consistency of human annotators. |
DOI | 10.1038/s43856-024-00631-7 |
Alternate Journal | Commun Med (Lond) |
PubMed ID | 39397053 |
PubMed Central ID | PMC11471859 |
Grant List | P30DA040500 / / U.S. Department of Health & Human Services | NIH | National Institute on Drug Abuse (NIDA) / OT2OD032581 / / U.S. Department of Health & Human Services | National Institutes of Health (NIH) / OT2 OD032581 / OD / NIH HHS / United States 1R01AI130460 / / U.S. Department of Health & Human Services | National Institutes of Health (NIH) / RF1AG072799 / / U.S. Department of Health & Human Services | National Institutes of Health (NIH) / R01 AI130460 / AI / NIAID NIH HHS / United States RF1 AG072799 / AG / NIA NIH HHS / United States P30 DA040500 / DA / NIDA NIH HHS / United States |