iSimp: A sentence simplification system for biomedicail text

TitleiSimp: A sentence simplification system for biomedicail text
Publication TypeConference Proceedings
Year of Conference2012
AuthorsPeng Y, Tudor CO, Torii M, Wu CH, Vijay-Shanker K.
Conference NameIEEE International Conference on Bioinformatics and Biomedicine
Pagination211-216
Date Published10/2012
PublisherIEEE
Conference LocationPhiladelphia, PA, USA
ISBN Number978-1-4673-2560-8
Abstract

Text mining applications using natural language processing are often confronted with long and complicated sentences. This is observed particularly in the abstracts of scientific articles where authors summarize, in few sentences, the various facts described throughout the manuscript. Being rich in novel and important information, the abstract has been the primary target of biomedicai text mining applications. In this work, we aim to simplify complex sentences in abstracts of biomedicai text so that they can be readily processed by text mining applications. We focus on syntactic constructs that are frequently encountered in the biomedicai literature, such as coordinations, relative clauses, and appositions, with emphasis on their boundary detection. Our approach yielded good detection performance (average F-measure between 86.5% and 92.7%), and aided in improving biomedicai text mining applications, RLIMS-P and Rank Pref .

DOI10.1109/BIBM.2012.6392671