| Title | A multi-stage large language model framework for extracting suicide-related social determinants of health. |
| Publication Type | Journal Article |
| Year of Publication | 2025 |
| Authors | Wang S, Wei Y, Ma H, Lovitt M, Deng K, Meng Y, Xu Z, Zhang J, Xiao Y, Ding Y, Xu X, Ghosh J, Peng Y |
| Journal | Commun Med (Lond) |
| Volume | 5 |
| Issue | 1 |
| Pagination | 404 |
| Date Published | 2025 Sep 29 |
| ISSN | 2730-664X |
| Abstract | BACKGROUND: Understanding social determinants of health (SDoH) factors contributing to suicide incidents is crucial for early intervention and prevention. However, data-driven approaches to this goal face challenges such as long-tailed factor distributions, analyzing pivotal stressors preceding suicide incidents, and limited model explainability. METHODS: We present a multi-stage large language model framework to enhance SDoH factor extraction from unstructured text. Our approach was compared to other state-of-the-art language models (i.e., pre-trained BioBERT and GPT-3.5-turbo) and reasoning models (i.e., DeepSeek-R1). We also evaluated how the model's explanations help people annotate SDoH factors more quickly and accurately. The analysis included both automated comparisons and a pilot user study. RESULTS: We show that our proposed framework demonstrates performance boosts in the overarching task of extracting SDoH factors and in the finer-grained tasks of retrieving relevant context. Additionally, we show that fine-tuning a smaller, task-specific model achieves comparable or better performance with reduced inference costs. The multi-stage design not only enhances extraction but also provides intermediate explanations, improving model explainability. CONCLUSIONS: Our approach improves both the accuracy and transparency of extracting suicide-related SDoH from unstructured texts. These advancements have the potential to support early identification of individuals at risk and inform more effective prevention strategies. |
| DOI | 10.1038/s43856-025-01114-z |
| Alternate Journal | Commun Med (Lond) |
| PubMed ID | 41023090 |
| PubMed Central ID | PMC12480878 |
