AMBIGQA: Answering Ambiguous Open-domain Questions

被引:0
|
作者
Min, Sewon [1 ,2 ]
Michael, Julian [1 ]
Hajishirzi, Hannaneh [1 ,3 ]
Zettlemoyer, Luke [1 ,2 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Facebook AI Res, Menlo Pk, CA 94025 USA
[3] Allen Inst Artificial Intelligence, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ambiguity is inherent to open-domain question answering; especially when exploring new topics, it can be difficult to ask questions that have a single, unambiguous answer. In this paper, we introduce AMBIGQA, a new open-domain question answering task which involves finding every plausible answer, and then rewriting the question for each one to resolve the ambiguity. To study this task, we construct AMBIGNQ, a dataset covering 14,042 questions from NQ- OPEN, an existing opendomain QA benchmark. We find that over half of the questions in NQ-OPEN are ambiguous, with diverse sources of ambiguity such as event and entity references. We also present strong baseline models for AMBIGQA which we show benefit from weakly supervised learning that incorporates NQ-OPEN, strongly suggesting our new task and data will support significant future research effort. Our data and baselines are available at https://nlp.cs.washington.edu/ambigqa.
引用
收藏
页码:5783 / 5797
页数:15
相关论文
共 50 条
  • [31] A New Approach For Open-Domain Question Answering System
    Alturani, Ibrahim Mahmoud Ibrahim
    Bin Hamzah, Mohd Pouzi
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (06): : 100 - 103
  • [32] Dense Passage Retrieval for Open-Domain Question Answering
    Karpukhin, Vladimir
    Oguz, Barlas
    Min, Sewon
    Lewis, Patrick
    Wu, Ledell
    Edunov, Sergey
    Chen, Danqi
    Yih, Wen Tau
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6769 - 6781
  • [33] A dataset and baselines for sequential open-domain question answering
    Elgohary, Ahmed
    Zhao, Chen
    Boyd-Graber, Jordan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1077 - 1083
  • [34] Using clustering approaches to open-domain question answering
    Wu, Youzheng
    Kashioka, Hideki
    Zhao, Jun
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 506 - +
  • [35] CREPE: Open-Domain Question Answering with False Presuppositions
    Yu, Xinyan Velocity
    Min, Sewon
    Zettlemoyer, Luke
    Hajishirzi, Hannaneh
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10457 - 10480
  • [36] Open-Domain Non-factoid Question Answering
    Khvalchik, Maria
    Kulkarni, Anagha
    TEXT, SPEECH, AND DIALOGUE, TSD 2017, 2017, 10415 : 290 - 298
  • [37] Hybrid Hierarchical Retrieval for Open-Domain Question Answering
    Arivazhagan, Manoj Ghuhan
    Li, Lan
    Qi, Peng
    Chen, Xinchi
    Wang, William Yang
    Huang, Zhiheng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10680 - 10689
  • [38] Visual Explanation for Open-Domain Question Answering With BERT
    Shao, Zekai
    Sun, Shuran
    Zhao, Yuheng
    Wang, Siyuan
    Wei, Zhongyu
    Gui, Tao
    Turkay, Cagatay
    Chen, Siming
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 3779 - 3797
  • [39] Learning Transferable Features for Open-Domain Question Answering
    Zuin, Gianlucca
    Chaimowicz, Luiz
    Veloso, Adriano
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [40] Reading Wikipedia to Answer Open-Domain Questions
    Chen, Danqi
    Fisch, Adam
    Weston, Jason
    Bordes, Antoine
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1870 - 1879