Pseudo-labeling with transformers for improving Question Answering systems

被引:1
|
作者
Kuligowska, Karolina [1 ]
Kowalczuk, Bartlomiej [1 ]
机构
[1] Univ Warsaw, Fac Econ Sci, Dluga St 44-50, PL-00241 Warsaw, Poland
关键词
Natural Language Processing; Question Answering systems; pseudo-labeling; neural networks; transfer learning; knowledge distillation;
D O I
10.1016/j.procs.2021.08.119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Advances in neural networks contributed to the fast development of Natural Language Processing systems. As a result, Question Answering systems have evolved and can classify and answer questions in an intuitive yet communicative way. However, the lack of large volumes of labeled data prevents large-scale training and development of Question Answering systems, confirming the need for further research. This paper aims to handle this real-world problem of lack of labeled datasets by applying a pseudolabeling technique relying on a neural network transformer model DistiIBERT. In order to evaluate our contribution, we examined the performance of a text classification transformer model that was fine-tuned on the data subject to prior pseudo-labeling. Research has shown the usefulness of the applied pseudo-labeling technique on a neural network text classification transformer model DistiIBERT. The results of our analysis indicated that the model with additional pseudo-labeled data achieved the best results among other compared neural network architectures. Based on that result, Question Answering systems may be directly improved by enriching their training steps with additional data acquired cost-effectively. (C) 2021 The Authors. Published by Elsevier B.V.
引用
收藏
页码:1162 / 1169
页数:8
相关论文
共 50 条
  • [31] slimIPL: Language-Model-Free Iterative Pseudo-Labeling
    Likhomanenko, Tatiana
    Xu, Qiantong
    Kahn, Jacob
    Synnaeve, Gabriel
    Collobert, Ronan
    INTERSPEECH 2021, 2021, : 741 - 745
  • [32] Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
    Higuchi, Yosuke
    Moritz, Niko
    Le Roux, Jonathan
    Hori, Takaaki
    INTERSPEECH 2021, 2021, : 726 - 730
  • [33] User Feedback for Improving Question Categorization in Web-Based Question Answering Systems
    Song, Wanpeng
    Liu Wenyin
    Gu, Naijie
    Quan, Xiaojun
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 148 - +
  • [34] Informative pseudo-labeling for graph neural networks with few labels
    Li, Yayong
    Yin, Jie
    Chen, Ling
    DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 37 (01) : 228 - 254
  • [35] UNCERTAINTY-AWARE PSEUDO-LABELING FOR SPOKEN LANGUAGE ASSESSMENT
    Lin, Binghuai
    Wang, Liyuan
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 885 - 891
  • [36] Spatial pseudo-labeling for semi-supervised facies classification
    Asghar, Saleem
    Choi, Junhwan
    Yoon, Daeung
    Byun, Joongmoo
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2020, 195
  • [37] Pseudo-Labeling for Small Lesion Detection on Diabetic Retinopathy Images
    Chen, Qilei
    Liu, Ping
    Ni, Jing
    Cao, Yu
    Liu, Benyuan
    Zhang, Honggang
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [38] Question Answering Chatbots for Biomedical Research using Transformers
    Xygi, Evdokia
    Andriopoulos, Andreas D.
    Koutsomitropoulos, Dimitrios A.
    2023 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION, ICAIIC, 2023, : 25 - 29
  • [39] A comparative study of language transformers for video question answering
    Yang, Zekun
    Garcia, Noa
    Chu, Chenhui
    Otani, Mayu
    Nakashima, Yuta
    Takemura, Haruo
    NEUROCOMPUTING, 2021, 445 : 121 - 133
  • [40] Cascade transformers with dynamic attention for video question answering
    Jiang, Yimin
    Yan, Tingfei
    Yao, Mingze
    Wang, Huibing
    Liu, Wenzhe
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 242