QED: A Framework and Dataset for Explanations in Question Answering

被引:14
|
作者
Lamm, Matthew [1 ,4 ]
Palomaki, Jennimaria [2 ]
Alberti, Chris [2 ]
Andor, Daniel [2 ]
Choi, Eunsol [3 ,4 ]
Soares, Livio Baldini [2 ]
Collins, Michael [2 ]
机构
[1] Stanford Univ, Dept Linguist, Stanford, CA 94305 USA
[2] Google Res, Mountain View, CA USA
[3] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
[4] Google, Mountain View, CA 94043 USA
关键词
D O I
10.1162/tacl_a_00398
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A question answering system that in addition to providing an answer provides an explanation of the reasoning that leads to that answer has potential advantages in terms of debuggability, extensibility, and trust. To this end, we propose QED, a linguistically informed, extensible framework for explanations in question answering. A QED explanation specifies the relationship between a question and answer according to formal semantic notions such as referential equality, sentencehood, and entailment. We describe and publicly release an expert-annotated dataset of QED explanations built upon a subset of the Google Natural Questions dataset, and report baselinemodels on two tasks-post-hoc explanation generation given an answer, and joint question answering and explanation generation. In the joint setting, a promising result suggests that training on a relatively small amount of QED data can improve question answering. In addition to describing the formal, language-theoretic motivations for the QED approach, we describe a large user study showing that the presence of QED explanations significantly improves the ability of untrained raters to spot errors made by a strong neural QA baseline.
引用
收藏
页码:790 / 806
页数:17
相关论文
共 50 条
  • [31] A Large Visual Question Answering Dataset for Cultural Heritage
    Asprino, Luigi
    Bulla, Luana
    Marinucci, Ludovica
    Mongiovi, Misael
    Presutti, Valentina
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE (LOD 2021), PT II, 2022, 13164 : 193 - 197
  • [32] PersianQuAD: The Native Question Answering Dataset for the Persian Language
    Kazemi, Arefeh
    Mozafari, Jamshid
    Nematbakhsh, Mohammad Ali
    IEEE Access, 2022, 10 : 26045 - 26057
  • [33] TwEETQA: A Social Media Focused Question Answering Dataset
    Xiong, Wenhan
    Wu, Jiawei
    Wang, Hong
    Kulkarni, Vivek
    Yu, Mo
    Chang, Shiyu
    Guo, Xiaoxiao
    Wang, William Yang
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5020 - 5031
  • [34] PersianQuAD: The Native Question Answering Dataset for the Persian Language
    Kazemi, Arefeh
    Mozafari, Jamshid
    Nematbakhsh, Mohammad Ali
    IEEE ACCESS, 2022, 10 : 26045 - 26057
  • [35] TheoremQA: A Theorem-driven Question Answering Dataset
    Chen, Wenhu
    Yin, Ming
    Ku, Max
    Lu, Pan
    Wan, Yixin
    Ma, Xueguang
    Xu, Jianyu
    Wang, Xinyi
    Xia, Tony
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, : 7889 - 7901
  • [36] DAWQAS: A Dataset for Arabic Why Question Answering System
    Ismail, Walaa Saber
    Homsi, Masun Nabhan
    ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 : 123 - 131
  • [37] ToolQA: A Dataset for LLM Question Answering with External Tools
    Zhuang, Yuchen
    Yu, Yue
    Wang, Kuan
    Sun, Haotian
    Zhang, Chao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [38] A dataset for medical instructional video classification and question answering
    Deepak Gupta
    Kush Attal
    Dina Demner-Fushman
    Scientific Data, 10
  • [39] QASC: A Dataset for Question Answering via Sentence Composition
    Khot, Tushar
    Clark, Peter
    Guerquin, Michal
    Jansen, Peter
    Sabharwal, Ashish
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8082 - 8090
  • [40] MultiSpanQA: A Dataset for Multi-Span Question Answering
    Li, Haonan
    Vasardani, Maria
    Tomko, Martin
    Baldwin, Timothy
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1250 - 1260