MA-MRC: A Multi-answer Machine Reading Comprehension Dataset

被引:0
|
作者
Yue, Zhiang [1 ]
Liu, Jingping [2 ]
Zhang, Cong [3 ]
Wang, Chao [4 ]
Jiang, Haiyun [5 ]
Zhang, Yue [2 ]
Tian, Xianyang [2 ]
Cen, Zhedong [2 ]
Xiao, Yanghua [1 ]
Ruan, Tong [2 ]
机构
[1] Fudan Univ, Shanghai, Peoples R China
[2] East China Univ Sci & Technol, Shanghai, Peoples R China
[3] AECC Sichuan Gas Turbine Estab, Mianyang, Sichuan, Peoples R China
[4] Shanghai Univ, Shanghai, Peoples R China
[5] Tencent AI Lab, Shenzhen, Peoples R China
关键词
Machine Reading Comprehension; Multiple Answer; Knowledge Graph;
D O I
10.1145/3539618.3592015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine reading comprehension (MRC) is an essential task for many question-answering applications. However, existing MRC datasets mainly focus on data with single answer and overlook multiple answers, which are common in the real world. In this paper, we aim to construct an MRC dataset with both data of single answer and multiple answers. To achieve this purpose, we design a novel pipeline method: data collection, data cleaning, question generation and test set annotation. Based on these procedures, we construct a high-quality multi-answer MRC dataset (MA-MRC) with 129K question-answer-context samples. We implement a sequence of baselines and carry out extensive experiments on MA-MRC. According to the experimental results, MA-MRC is a challenging dataset, which can facilitate the future research on the multi-answer MRC task(1).
引用
收藏
页码:2144 / 2148
页数:5
相关论文
共 50 条
  • [31] TibetanQA2.0: Dataset with Unanswerable Questions for Tibetan Machine Reading Comprehension
    Zhengcuo Dan
    Yuan Sun
    Data Intelligence, 2024, 6 (04) : 1158 - 1167
  • [32] MRC-PASCL: A Few-Shot Machine Reading Comprehension Approach via Post-Training and Answer Span-Oriented Contrastive Learning
    Li, Ren
    Xiao, Qiao
    Yang, Jianxi
    Zhang, Luyi
    Chen, Yu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 4838 - 4849
  • [33] Question answering model based on machine reading comprehension with knowledge enhancement and answer verification
    Yang, Ziming
    Sun, Yuxia
    Kuang, Qingxuan
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2022, 34 (12):
  • [34] Integrate Candidate Answer Extraction with Re-Ranking for Chinese Machine Reading Comprehension
    Zeng, Junjie
    Sun, Xiaoya
    Zhang, Qi
    Li, Xinmeng
    ENTROPY, 2021, 23 (03) : 1 - 19
  • [35] SQuAD-SRC: A Dataset for Multi-Accent Spoken Reading Comprehension
    Tang, Yixuan
    Tung, Anthony K. H.
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 5206 - 5214
  • [36] Developing Dataset of Japanese Slot Filling Quizzes Designed for Evaluation of Machine Reading Comprehension
    Watarai, Takuto
    Tsuchiya, Masatoshi
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6895 - 6901
  • [37] DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications
    He, Wei
    Liu, Kai
    Liu, Jing
    Lyu, Yajuan
    Zhao, Shiqi
    Xiao, Xinyan
    Liu, Yuan
    Wang, Yizhong
    Wu, Hua
    She, Qiaoqiao
    Liu, Xuan
    Wu, Tian
    Wang, Haifeng
    MACHINE READING FOR QUESTION ANSWERING, 2018, : 37 - 46
  • [38] FinBERT–MRC: Financial Named Entity Recognition Using BERT Under the Machine Reading Comprehension Paradigm
    Yuzhe Zhang
    Hong Zhang
    Neural Processing Letters, 2023, 55 : 7393 - 7413
  • [39] MRC4BioER: Joint extraction of biomedical entities and relations in the machine reading comprehension framework
    Sun, Cong
    Yang, Zhihao
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 125
  • [40] Robustness-Eva-MRC: Assessing and analyzing the robustness of neural models in extractive machine reading comprehension
    Fang, Jingliang
    Xu, Hua
    Wu, Zhijing
    Gao, Kai
    Che, Xiaoyin
    Hui, Haotian
    INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20