ML-Net: multi-label classification of biomedical texts with deep neural networks

Submitted by yip4002 on November 24, 2020 - 9:29pm

Title	ML-Net: multi-label classification of biomedical texts with deep neural networks
Publication Type	Journal Article
Year of Publication	2019
Authors	Du J, Chen Q, Peng Y, Xiang Y, Tao C, Lu Z
Journal	J Am Med Inform Assoc
Volume	26
Issue	11
Pagination	1279-1285
Date Published	2019 11 01
ISSN	1527-974X
Abstract	OBJECTIVE: In multi-label text classification, each textual document is assigned 1 or more labels. As an important task that has broad applications in biomedicine, a number of different computational methods have been proposed. Many of these methods, however, have only modest accuracy or efficiency and limited success in practical use. We propose ML-Net, a novel end-to-end deep learning framework, for multi-label classification of biomedical texts. MATERIALS AND METHODS: ML-Net combines a label prediction network with an automated label count prediction mechanism to provide an optimal set of labels. This is accomplished by leveraging both the predicted confidence score of each label and the deep contextual information (modeled by ELMo) in the target document. We evaluate ML-Net on 3 independent corpora in 2 text genres: biomedical literature and clinical notes. For evaluation, we use example-based measures, such as precision, recall, and the F measure. We also compare ML-Net with several competitive machine learning and deep learning baseline models. RESULTS: Our benchmarking results show that ML-Net compares favorably to state-of-the-art methods in multi-label classification of biomedical text. ML-Net is also shown to be robust when evaluated on different text genres in biomedicine. CONCLUSION: ML-Net is able to accuractely represent biomedical document context and dynamically estimate the label count in a more systematic and accurate manner. Unlike traditional machine learning methods, ML-Net does not require human effort for feature engineering and is a highly efficient and scalable approach to tasks with a large set of labels, so there is no need to build individual classifiers for each separate label.
DOI	10.1093/jamia/ocz085
Alternate Journal	J Am Med Inform Assoc
PubMed ID	31233120
Grant List	R01 LM010681 / LM / NLM NIH HHS / United States R01 LM011829 / LM / NLM NIH HHS / United States