Deep learning with noisy labels in medical prediction problems: a scoping review.

Submitted by yip4002 on August 1, 2024 - 2:47am

Title	Deep learning with noisy labels in medical prediction problems: a scoping review.
Publication Type	Journal Article
Year of Publication	2024
Authors	Wei Y, Deng Y, Sun C, Lin M, Jiang H, Peng Y
Journal	J Am Med Inform Assoc
Volume	31
Issue	7
Pagination	1596-1607
Date Published	2024 Jun 20
ISSN	1527-974X
Keywords	Biomedical Research, Deep Learning, Humans
Abstract	OBJECTIVES: Medical research faces substantial challenges from noisy labels attributed to factors like inter-expert variability and machine-extracted labels. Despite this, the adoption of label noise management remains limited, and label noise is largely ignored. To this end, there is a critical need to conduct a scoping review focusing on the problem space. This scoping review aims to comprehensively review label noise management in deep learning-based medical prediction problems, which includes label noise detection, label noise handling, and evaluation. Research involving label uncertainty is also included. METHODS: Our scoping review follows the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. We searched 4 databases, including PubMed, IEEE Xplore, Google Scholar, and Semantic Scholar. Our search terms include "noisy label AND medical/healthcare/clinical," "uncertainty AND medical/healthcare/clinical," and "noise AND medical/healthcare/clinical." RESULTS: A total of 60 papers met inclusion criteria between 2016 and 2023. A series of practical questions in medical research are investigated. These include the sources of label noise, the impact of label noise, the detection of label noise, label noise handling techniques, and their evaluation. Categorization of both label noise detection methods and handling techniques are provided. DISCUSSION: From a methodological perspective, we observe that the medical community has been up to date with the broader deep-learning community, given that most techniques have been evaluated on medical data. We recommend considering label noise as a standard element in medical research, even if it is not dedicated to handling noisy labels. Initial experiments can start with easy-to-implement methods, such as noise-robust loss functions, weighting, and curriculum learning.
DOI	10.1093/jamia/ocae108
Alternate Journal	J Am Med Inform Assoc
PubMed ID	38814164
PubMed Central ID	PMC11187424
Grant List	R01 LM014306 / LM / NLM NIH HHS / United States R01LM014306 / LM / NLM NIH HHS / United States 2145640 / / National Science Foundation /