Publications

Found 163 results
Journal Article
Lin M, Hou B, Mishra S, Yao T, Huo Y, Yang Q, Wang F, Shih G, Peng Y.  2023.  Enhancing thoracic disease detection using chest X-rays from PubMed Central Open Access.. Comput Biol Med. 159:106962.
Idnay B, Xu Z, Adams WG, Adibuzzaman M, Anderson NR, Bahroos N, Bell DS, Bumgardner C, Campion T, Castro M et al..  2025.  Environment scan of generative AI infrastructure for clinical and translational science.. Npj Health Syst. 2(1):4.
Xu Z, Lin M, Zhou Y, Xu Z, Orlow SJ, Meehan SA, Flamm A, Moshiri AS, Peng Y.  2026.  Establishing dermatopathology encyclopedia DermpathNet with Artificial Intelligence-Based Workflow.. Sci Data. 13(1)
Lin M, Xiao Y, Hou B, Wanyan T, Sharma MManoj, Wang Z, Wang F, Van Tassel S, Peng Y.  2023.  Evaluate underdiagnosis and overdiagnosis bias of deep learning model on primary open-angle glaucoma diagnosis in under-served populations.. AMIA Jt Summits Transl Sci Proc. 2023:370-377.
Sun Z, Ong H, Kennedy P, Tang L, Chen S, Elias J, Lucas E, Shih G, Peng Y.  2023.  Evaluating GPT4 on Impressions Generation in Radiology Reports.. Radiology. 307(5):e231259.
Zhou Y, Ong H, Kennedy P, Wu CC, Kazam J, Hentel K, Flanders A, Shih G, Peng Y.  2024.  Evaluating GPT-V4 (GPT-4 with Vision) on Detection of Radiologic Findings on Chest Radiographs.. Radiology. 311(2):e233270.
Tang L, Sun Z, Idnay B, Nestor JG, Soroush A, Elias PA, Xu Z, Ding Y, Durrett G, Rousseau JF et al..  2023.  Evaluating large language models on medical evidence summarization.. NPJ Digit Med. 6(1):158.
Kang T, Sun Y, Kim JHyun, Ta C, Perotte A, Schiffer K, Wu M, Zhao Y, Moustafa-Fahmy N, Peng Y et al..  2023.  EvidenceMap: a three-level knowledge representation for medical evidence computation and comprehension.. J Am Med Inform Assoc.
Zhou Y, Newbury AM, Zhang G, Idnay BRoss, Liu H, Weng C, Peng Y.  2025.  EvidenceOutcomes: A Dataset of Clinical Trial Publications with Clinically Meaningful Outcomes.. Stud Health Technol Inform. 329:723-727.
Flanders AE, Wang X, Wu CC, Kitamura FC, Shih G, Mongan J, Peng Y.  2025.  The Evolution of Radiology Image Annotation in the Era of Large Language Models.. Radiol Artif Intell. 7(4):e240631.
Yang R, Li H, Wong MYu Heng, Ke Y, Li X, Yu K, Liao J, Liew JChong Kai, Nair SVinod, Ong JChiat Ling et al..  2026.  The evolving landscape of large language models and non-large language models in health care.. Npj Health Syst. 3
Gautam N, Elhusseiny A, Mansour M, Mehta JL, Andersen OS, Peng Y, Al'Aref SJ.  2025.  Exploring the feasibility of using artificial intelligence to simulate the placebo arm of randomized clinical trials.. Postgrad Med J.
Peng Y, Rios A, Kavuluru R, Lu Z.  2018.  Extracting chemical-protein relations with ensembles of SVM and deep learning models. Database (Oxford). 2018
Bai Z, Xu Z, Sun C, Zang C, H Bunnell T, Sinfield C, Rutter J, Martinez AThomas, L Bailey C, Weiner M et al..  2025.  Extracting post-acute sequelae of SARS-CoV-2 infection symptoms from clinical notes via hybrid natural language processing.. Npj Health Syst. 2
Moukheiber D, Mahindre S, Moukheiber L, Moukheiber M, Wang S, Ma C, Shih G, Peng Y, Gao M.  2022.  Few-Shot Learning Geometric Ensemble for Multi-label Classification of Chest X-Rays.. Data Augment Label Imperfections (2022). 13567:112-122.
Wang Z, Cao L, Jin Q, Chan J, Wan N, Afzali B, Cho H-J, Choi C-I, Emamverdi M, Gill MK et al..  2025.  A foundation model for human-AI collaboration in medical literature mining.. Nat Commun. 16(1):8361.
Tam TYu Chow, Sivarajkumar S, Kapoor S, Stolyar AV, Polanska K, McCarthy KR, Osterhoudt H, Wu X, Visweswaran S, Fu S et al..  2024.  A framework for human evaluation of large language models in healthcare derived from literature review.. NPJ Digit Med. 7(1):258.
Peng Y, Malin BA, Rousseau JF, Wang Y, Xu Z, Xu X, Weng C, Bian J.  2025.  From GPT to DeepSeek: Significant gaps remain in realizing AI in healthcare.. J Biomed Inform. 163:104791.
Peng Y, Torii M, Wu CH, Vijay-Shanker K.  2014.  A generalizable NLP framework for fast development of pattern-based biomedical relation extraction systems. BMC Bioinformatics. 15:285.
Paul A, Shen TC, Lee S, Balachandar N, Peng Y, Lu Z, Summers RM.  2021.  Generalized Zero-shot Chest X-ray Diagnosis through Trait-Guided Multi-view Semantic Embedding with Self-training.. IEEE Trans Med Imaging.
Sun C, Teichman K, Zhou Y, Critelli B, Nauheim D, Keir G, Wang X, Zhong J, Flanders AE, Shih G et al..  2025.  Generative Large Language Models Trained for Detecting Errors in Radiology Reports.. Radiology. 315(2):e242575.
Wang S, Zhu Y, Lee S, Elton DC, Shen TC, Tang Y, Peng Y, Lu Z, Summers RM.  2022.  Global-Local attention network with multi-task uncertainty loss for abnormal lymph node detection in MR images.. Med Image Anal. 77:102345.
Holste G, Lin M, Zhou R, Wang F, Liu L, Yan Q, Van Tassel SH, Kovacs K, Chew EY, Lu Z et al..  2024.  Harnessing the power of longitudinal medical imaging for eye disease prognosis using Transformer-based sequence modeling.. NPJ Digit Med. 7(1):216.
Jin Q, Chen F, Zhou Y, Xu Z, Cheung JM, Chen R, Summers RM, Rousseau JF, Ni P, Landsman MJ et al..  2024.  Hidden flaws behind expert-level accuracy of multimodal GPT-4 vision in medicine.. NPJ Digit Med. 7(1):190.
Holste G, Jiang Z, Jaiswal A, Hanna M, Minkowitz S, Legasto AC, Escalon JG, Steinberger S, Bittman M, Shen TC et al..  2023.  How Does Pruning Impact Long-Tailed Multi-label Medical Image Classifiers? Med Image Comput Comput Assist Interv. 14224:663-673.