A Semi-automatic Data Generator for Query Answering

被引:0
|
作者
Angiulli, Fabrizio [1 ]
Del Prete, Alessandra [1 ]
Fassetti, Fabio [1 ]
Nistico, Simona [1 ]
机构
[1] Univ Calabria, DIMES Dept, Arcavacata Di Rende, Italy
关键词
D O I
10.1007/978-3-031-16564-1_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question Answering (QA) is a critical NLP task mainly based on deep learning models that allow users to answer questions in natural language and get a response. Since available general-purpose datasets are often not effective enough to suitably train a QA model, one of the main problems in this context is related to the availability of datasets which fit the considered context. Moreover, such datasets are generally in English, making QA system design in different languages difficult. To alleviate the above-depicted issues, in this work, we propose a framework which automatically generates a dataset for a given language and a given topic. To train our system in any language, an alternative way to evaluate the quality of the answers is needed, so we propose a novel unsupervised method. To test the proposed technique, we generate a dataset for the topic "computer science" and the language "Italian" and compare the performance of a QA system trained on available datasets and the built one.
引用
收藏
页码:106 / 114
页数:9
相关论文
共 50 条
  • [21] Semi-automatic Spine Segmentation Method of CT Data
    Mateusiak, Malgorzata
    Mikolajczyk, Krzysztof
    MECHATRONICS 2019: RECENT ADVANCES TOWARDS INDUSTRY 4.0, 2020, 1044 : 29 - 35
  • [22] Principles and methods for automatic and semi-automatic tissue segmentation in MRI data
    Wang, Lei
    Chitiboi, Teodora
    Meine, Hans
    Guenther, Matthias
    Hahn, Horst K.
    MAGNETIC RESONANCE MATERIALS IN PHYSICS BIOLOGY AND MEDICINE, 2016, 29 (02) : 95 - 110
  • [23] Data Mining Techniques for Semi-Automatic Signature Generation
    Tylman, Wojciech
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DEPENDABILITY OF COMPUTER SYSTEMS, 2009, : 210 - 217
  • [24] Semi-automatic Quality Control of Topographic Data Sets
    Helmholz, Petra
    Becker, Christian
    Breitkopf, Uwe
    Bueschenfeld, Torsten
    Busch, Andreas
    Braun, Carola
    Gruenreich, Dietmar
    Mueller, Soenke
    Ostermann, Joern
    Pahl, Martin
    Rottensteiner, Franz
    Vogt, Karsten
    Ziems, Marcel
    Heipke, Christian
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2012, 78 (09): : 959 - 972
  • [25] Principles and methods for automatic and semi-automatic tissue segmentation in MRI data
    Lei Wang
    Teodora Chitiboi
    Hans Meine
    Matthias Günther
    Horst K. Hahn
    Magnetic Resonance Materials in Physics, Biology and Medicine, 2016, 29 : 95 - 110
  • [26] Semi-automatic analysis of ultrasonic data on laminated plates
    Bertrand, Cédric
    Marrier, Philippe
    e-Journal of Nondestructive Testing, 2023, 28 (09):
  • [27] SEMI-AUTOMATIC SEGMENTATION OF SPEECH FOR OBTAINING SYNTHESIS DATA
    OLIVE, JP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 : S107 - S107
  • [28] SEMI-AUTOMATIC WELDING
    不详
    BRITISH WELDING JOURNAL, 1966, 13 (03): : 177 - &
  • [29] A Semantic Approach for Semi-Automatic Detection of Sensitive Data
    Akoka, Jacky
    Comyn-Wattiau, Isabelle
    Du Mouza, Cedric
    Fadili, Hammou
    Lammari, Nadira
    Metais, Elisabeth
    Cherfi, Samira
    INFORMATION RESOURCES MANAGEMENT JOURNAL, 2014, 27 (04) : 23 - 44
  • [30] SEMI-AUTOMATIC AND AUTOMATIC INSERTION.
    Greeninger, Marvin
    SME Technical Paper (Series) EE, 1980,