Enhancing yes/no question answering with weak supervision via extractive question answering

被引:1
|
作者
Dimitriadis, Dimitris [1 ]
Tsoumakas, Grigorios [1 ]
机构
[1] Aristotle Univ Thessaloniki, Sch Informat, Thessaloniki 54124, Greece
关键词
Question answering; Yes/no question answering; Extractive question answering; Transformers;
D O I
10.1007/s10489-023-04751-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The effectiveness of natural language processing models relies on various factors, including the architecture, number of parameters, data used during training, and the tasks they were trained on. Recent studies indicate that models pre-trained on large corpora and fine-tuned on task-specific datasets, covering multiple tasks, can generate remarkable results across various benchmarks. We propose a new approach based on a straightforward hypothesis: improving model performance on a target task by considering other artificial tasks defined on the same training dataset. By doing so, the model can gain further insights into the training dataset and attain a greater understanding, improving efficiency on the target task. This approach differs from others that consider multiple pre-existing tasks on different datasets. We validate this hypothesis by focusing on the problem of answering yes/no questions and introducing a multi-task model that outputs a span of the reference text, serving as evidence for answering the question. The task of span extraction is an artificial one, designed to benefit the performance of the model answering yes/no questions. We acquire weak supervision for these spans, by using a pre-trained extractive question answering model, dispensing the need for costly human annotation. Our experiments, using modern transformer-based language models, demonstrate that this method outperforms the standard approach of training models to answer yes/no questions. Although the primary objective was to enhance the performance of the model in answering yes/no questions, it was discovered that span texts are a significant source of information. These spans, derived from the question reference texts, provided valuable insights for the users to better comprehend the answers to the questions. The model's improved accuracy in answering yes/no questions, coupled with the supplementary information provided by the span texts, led to a more comprehensive and informative user experience.
引用
收藏
页码:27560 / 27570
页数:11
相关论文
共 50 条
  • [31] Question Rewriting for Conversational Question Answering
    Vakulenko, Svitlana
    Longpre, Shayne
    Tu, Zhucheng
    Anantha, Raviteja
    WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 355 - 363
  • [32] Extractive-Boolean Question Answering For Scientific Fact Checking
    Rakotoson, Loic
    Letaillieur, Charles
    Massip, Sylvain
    Laleye, Frejus A. A.
    1ST ACM INTERNATIONAL WORKSHOP ON MULTIMEDIA AI AGAINST DISINFORMATION, MAD 2022, 2022, : 27 - 34
  • [33] Artificial fine-tuning tasks for yes/no question answering
    Dimitriadis, Dimitris
    Tsoumakas, Grigorios
    NATURAL LANGUAGE ENGINEERING, 2024, 30 (01) : 73 - 95
  • [34] Unsupervised Question Decomposition for Question Answering
    Perez, Ethan
    Lewis, Patrick
    Yih, Wen-tau
    Cho, Kyunghyun
    Kiela, Douwe
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8864 - 8880
  • [35] Question Modifiers in Visual Question Answering
    Britton, William
    Sarkhel, Somdeb
    Venugopal, Deepak
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1472 - 1479
  • [36] Embodied Question Answering
    Das, Abhishek
    Datta, Samyak
    Gkioxari, Georgia
    Lee, Stefan
    Parikh, Devi
    Batra, Dhruv
    PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 2135 - 2144
  • [37] Question answering in Spanish
    Vicedo, JL
    Izquierdo, R
    Llopis, F
    Muñoz, R
    COMPARATIVE EVALUATION OF MULTILINGUAL INFORMATION ACCESS SYSTEMS, 2003, 3237 : 541 - 548
  • [38] Personalized Question Answering
    Quarteroni, Silvia
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2010, 51 (01): : 97 - 123
  • [39] ANSWERING THE CUBAN QUESTION
    COLE, L
    NATION, 1983, 237 (07) : 194 - 194