Title | Less Likely Brainstorming: Using Language Models to Generate Alternative Hypotheses. |
Publication Type | Journal Article |
Year of Publication | 2023 |
Authors | Tang L, Peng Y, Wang Y, Ding Y, Durrett G, Rousseau JF |
Journal | Proc Conf Assoc Comput Linguist Meet |
Volume | 2023 |
Pagination | 12532-12555 |
Date Published | 2023 Jul |
ISSN | 0736-587X |
Abstract | A human decision-maker benefits the most from an AI assistant that corrects for their biases. For problems such as generating interpretation of a radiology report given findings, a system predicting only highly likely outcomes may be less useful, where such outcomes are already obvious to the user. To alleviate biases in human decision-making, it is worth considering a broad differential diagnosis, going beyond the most likely options. We introduce a new task, "less likely brainstorming," that asks a model to generate outputs that humans think are relevant but less likely to happen. We explore the task in two settings: a brain MRI interpretation generation setting and an everyday commonsense reasoning setting. We found that a baseline approach of training with less likely hypotheses as targets generates outputs that humans evaluate as either likely or irrelevant nearly half of the time; standard MLE training is not effective. To tackle this problem, we propose a controlled text generation method that uses a novel contrastive learning strategy to encourage models to differentiate between generating likely and less likely outputs according to humans. We compare our method with several state-of-the-art controlled text generation models via automatic and human evaluations and show that our models' capability of generating less likely outputs is improved. |
DOI | 10.18653/v1/2023.findings-acl.794 |
Alternate Journal | Proc Conf Assoc Comput Linguist Meet |
PubMed ID | 37701928 |
PubMed Central ID | PMC10494958 |
Grant List | R00 LM013001 / LM / NLM NIH HHS / United States R01 LM014306 / LM / NLM NIH HHS / United States |