| Title | Less Likely Brainstorming: Using Language Models to Generate Alternative Hypotheses. |
| Publication Type | Journal Article |
| Year of Publication | 2023 |
| Authors | Tang L, Peng Y, Wang Y, Ding Y, Durrett G, Rousseau JF |
| Journal | Proc Conf Assoc Comput Linguist Meet |
| Volume | 2023 |
| Pagination | 12532-12555 |
| Date Published | 2023 Jul |
| ISSN | 0736-587X |
| Abstract | A human decision-maker benefits the most from an AI assistant that corrects for their biases. For problems such as generating interpretation of a radiology report given findings, a system predicting only highly likely outcomes may be less useful, where such outcomes are already obvious to the user. To alleviate biases in human decision-making, it is worth considering a broad differential diagnosis, going beyond the most likely options. We introduce a new task, "less likely brainstorming," that asks a model to generate outputs that humans think are relevant but less likely to happen. We explore the task in two settings: a brain MRI interpretation generation setting and an everyday commonsense reasoning setting. We found that a baseline approach of training with less likely hypotheses as targets generates outputs that humans evaluate as either likely or irrelevant nearly half of the time; standard MLE training is not effective. To tackle this problem, we propose a controlled text generation method that uses a novel contrastive learning strategy to encourage models to differentiate between generating likely and less likely outputs according to humans. We compare our method with several state-of-the-art controlled text generation models via automatic and human evaluations and show that our models' capability of generating less likely outputs is improved. |
| DOI | 10.18653/v1/2023.findings-acl.794 |
| Alternate Journal | Proc Conf Assoc Comput Linguist Meet |
| PubMed ID | 37701928 |
| PubMed Central ID | PMC10494958 |
| Grant List | R00 LM013001 / LM / NLM NIH HHS / United States R01 LM014306 / LM / NLM NIH HHS / United States |
