Evaluating the performance of multilingual models in answer extraction and question generation

被引:0
|
作者
Moreno-Cediel, Antonio [1 ]
del-Hoyo-Gabaldon, Jesus-Angel [1 ]
Garcia-Lopez, Eva [1 ]
Garcia-Cabot, Antonio [1 ]
de-Fitero-Dominguez, David [1 ]
机构
[1] Univ Alcala, Dept Ciencias Comp, Alcala De Henares 28805, Spain
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
D O I
10.1038/s41598-024-66472-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multiple-choice test generation is one of the most complex NLP problems, especially in languages other than English, where there is a lack of prior research. After a review of the literature, it has been verified that some methods like the usage of rule-based systems or primitive neural networks have led to the application of a recent architecture, the Transformer architecture, in the tasks of Answer Extraction (AE) and Question Generation (QG). Thereby, this study is centred in searching and developing better models for the AE and QG tasks in Spanish, using an answer-aware methodology. For this purpose, three multilingual models (mT5-base, mT0-base and BLOOMZ-560 M) have been fine-tuned using three different datasets: a translation to Spanish of the SQuAD dataset; SQAC, which is a dataset in Spanish; and their union (SQuAD + SQAC), which shows slightly better results. Regarding the models, the performance of mT5-base has been compared with that found in two newer models, mT0-base and BLOOMZ-560 M. These models were fine-tuned for multiple tasks in literature, including AE and QG, but, in general, the best results are obtained from the mT5 models trained in our study with the SQuAD + SQAC dataset. Nonetheless, some other good results are obtained from mT5 models trained only with the SQAC dataset. For their evaluation, the widely used BLEU1-4, METEOR and ROUGE-L metrics have been obtained, where mT5 outperforms some similar research works. Besides, CIDEr, SARI, GLEU, WER and the cosine similarity metrics have been calculated to present a benchmark within the AE and QG problems for future work.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] On the Generation of Medical Question-Answer Pairs
    Shen, Sheng
    Li, Yaliang
    Du, Nan
    Wu, Xian
    Xie, Yusheng
    Ge, Shen
    Yang, Tao
    Wang, Kai
    Liang, Xingzheng
    Fan, Wei
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8822 - 8829
  • [22] Epidemic Question Answering: question generation and entailment for Answer Nugget discovery
    Weinzierl, Maxwell A.
    Harabagiu, Sanda M.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (02) : 329 - 339
  • [23] Graph Guided Question Answer Generation for Procedural Question-Answering
    Pham, Hai X.
    Hadji, Isma
    Xu, Xinnuo
    Degutyte, Ziedune
    Rainey, Jay
    Kazakos, Evangelos
    Fazly, Afsaneh
    Tzimiropoulos, Georgios
    Martinez, Brais
    PROCEEDINGS OF THE 18TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 2501 - 2525
  • [24] Multimodal representative answer extraction in community question answering
    Li, Ming
    Ma, Yating
    Li, Ying
    Bai, Yixue
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (09)
  • [25] Context-Aware Answer Extraction in Question Answering
    Seonwoo, Yeon
    Kin, Ji-Hoon
    Ha, Jung -Woo
    Oh, Alice
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2418 - 2428
  • [26] Answer Extraction Algorithm of Chinese Question Answering System
    Tang, Zhao-xia
    INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ENGINEERING (ACSE 2014), 2014, : 130 - 133
  • [27] Improving the Robustness of QA Models to Challenge Sets with Variational Question-Answer Pair Generation
    Shinoda, Kazutoshi
    Sugawara, Saku
    Aizawa, Akiko
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2021, : 197 - 214
  • [28] Evaluating Retrieval-Augmented Generation Models for Financial Report Question and Answering
    Iaroshev, Ivan
    Pillai, Ramalingam
    Vaglietti, Leandro
    Hanne, Thomas
    APPLIED SCIENCES-BASEL, 2024, 14 (20):
  • [29] A Combined Approach Using Semantic Role Labelling and Word Sense Disambiguation for Question Generation and Answer Extraction
    Pillai, Lekshmi R.
    Veena, G.
    Gupta, Deepa
    2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2018,
  • [30] Evaluating Question generation models using QA systems and Semantic Textual Similarity
    Shaheer, Safwan
    Hossain, Ishmam
    Sarna, Sudipta Nandi
    Mehedi, Md Humaion Kabir
    Rasel, Annajiat Alim
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 431 - 435