Evaluating the performance of multilingual models in answer extraction and question generation

被引:0
|
作者
Moreno-Cediel, Antonio [1 ]
del-Hoyo-Gabaldon, Jesus-Angel [1 ]
Garcia-Lopez, Eva [1 ]
Garcia-Cabot, Antonio [1 ]
de-Fitero-Dominguez, David [1 ]
机构
[1] Univ Alcala, Dept Ciencias Comp, Alcala De Henares 28805, Spain
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
D O I
10.1038/s41598-024-66472-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multiple-choice test generation is one of the most complex NLP problems, especially in languages other than English, where there is a lack of prior research. After a review of the literature, it has been verified that some methods like the usage of rule-based systems or primitive neural networks have led to the application of a recent architecture, the Transformer architecture, in the tasks of Answer Extraction (AE) and Question Generation (QG). Thereby, this study is centred in searching and developing better models for the AE and QG tasks in Spanish, using an answer-aware methodology. For this purpose, three multilingual models (mT5-base, mT0-base and BLOOMZ-560 M) have been fine-tuned using three different datasets: a translation to Spanish of the SQuAD dataset; SQAC, which is a dataset in Spanish; and their union (SQuAD + SQAC), which shows slightly better results. Regarding the models, the performance of mT5-base has been compared with that found in two newer models, mT0-base and BLOOMZ-560 M. These models were fine-tuned for multiple tasks in literature, including AE and QG, but, in general, the best results are obtained from the mT5 models trained in our study with the SQuAD + SQAC dataset. Nonetheless, some other good results are obtained from mT5 models trained only with the SQAC dataset. For their evaluation, the widely used BLEU1-4, METEOR and ROUGE-L metrics have been obtained, where mT5 outperforms some similar research works. Besides, CIDEr, SARI, GLEU, WER and the cosine similarity metrics have been calculated to present a benchmark within the AE and QG problems for future work.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] A Practical Toolkit for Multilingual Question and Answer Generation
    Ushio, Asahi
    Alva-Manchego, Fernando
    Camacho-Collados, Jose
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-DEMO 2023, VOL 3, 2023, : 86 - 94
  • [2] Probabilistic Models for Answer-Ranking in Multilingual Question-Answering
    Ko, Jeongwoo
    Si, Luo
    Nyberg, Eric
    Mitamura, Teruko
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2010, 28 (03)
  • [3] Evaluating Rewards for Question Generation Models
    Hosking, Tom
    Riedel, Sebastian
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2278 - 2283
  • [4] Sentence extraction with topic modeling for question–answer pair generation
    Chung-Hsien Wu
    Chao-Hong Liu
    Po-Hsun Su
    Soft Computing, 2015, 19 : 39 - 46
  • [5] Sentence extraction with topic modeling for question-answer pair generation
    Wu, Chung-Hsien
    Liu, Chao-Hong
    Su, Po-Hsun
    SOFT COMPUTING, 2015, 19 (01) : 39 - 46
  • [6] Models: do they answer the question?
    Saini, Sameer D.
    Rubenstein, Joel H.
    GASTROINTESTINAL ENDOSCOPY, 2008, 68 (05) : 937 - 939
  • [7] Neural Models for Key Phrase Extraction and Question Generation
    Subramanian, Sandeep
    Wang, Tong
    Yuan, Xingdi
    Zhang, Saizheng
    Bengio, Yoshua
    Trischler, Adam
    MACHINE READING FOR QUESTION ANSWERING, 2018, : 78 - 88
  • [8] Question recommendation and answer extraction in question answering community
    Xianfeng, Yang
    Pengfei, Liu
    International Journal of Database Theory and Application, 2016, 9 (01): : 35 - 44
  • [9] Evaluating Multilingual Question Answering Systems at CLEF
    Forner, Pamela
    Giampiccolo, Danilo
    Magnini, Bernardo
    Penas, Anselmo
    Rodrigo, Alvaro
    Sutcliffe, Richard
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2774 - 2781
  • [10] Multilingual question-answer system applied to conversational agents
    Siblini, Wissam
    Pasqual, Charlotte
    Lavielle, Axel
    Cauchois, Cyril
    Extraction et Gestion des Connaissances, EGC 2020, 2020, : 333 - 340