Evaluating Question generation models using QA systems and Semantic Textual Similarity

被引:1
|
作者
Shaheer, Safwan [1 ]
Hossain, Ishmam [1 ]
Sarna, Sudipta Nandi [1 ]
Mehedi, Md Humaion Kabir [1 ]
Rasel, Annajiat Alim [1 ]
机构
[1] Brac Univ, Dept Comp Sci & Engn CSE, Sch Data & Sci SDS, 66 Mohakhali, Dhaka 1212, Bangladesh
关键词
Question Generation; Semantic Textual Similarity; Question Answering; BLEU;
D O I
10.1109/CCWC57344.2023.10099244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question generation based on conversational context is a difficult problem to solve. A widely used technique for generating quality questions using fine-tuned models relies on a suitable answer and the context, usually the passage. But when it comes to conversational settings, the questions generated are not of the highest quality as they lack the contextual element in the question, especially due to the lack of co-reference resolution of the entity. Furthermore, in most of the evaluation techniques for generating questions, there seems to be a lack of utilizing powerful question-answering systems to judge the answerability of the questions generated. The most prevalent metric used for judging machine-generated text against the human gold standard, BLUE, unfortunately doesn't factor in whether a question answering system would be able to answer the question, but instead focuses mostly on the number of substrings that match against each other. Various question generation models following a generalized encoder-decoder architecture were evaluated using semantic textual similarity for both the generated questions and the generated answers. Although higher parameters in a model usually lend to better performance, our experiment displayed that such is not always the case, at least when there is a massive amount of context missing.
引用
收藏
页码:431 / 435
页数:5
相关论文
共 50 条
  • [41] Evaluation of semantic similarity using vector space model based on textual corpus
    Hssina, Badr
    Bouikhalene, Belaid
    Merbouha, Abdelkrim
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 295 - 300
  • [42] Cross-Lingual Semantic Textual Similarity Modeling Using Neural Networks
    Li, Xia
    Chen, Minping
    Zeng, Zihang
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 52 - 62
  • [43] Towards a Better Metric for Evaluating Question Generation Systems
    Nema, Preksha
    Khapra, Mitesh M.
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 3950 - 3959
  • [44] Legal Question Answering Using Ranking SVM and Syntactic/Semantic Similarity
    Kim, Mi-Young
    Xu, Ying
    Goebel, Randy
    NEW FRONTIERS IN ARTIFICIAL INTELLIGENCE, JSAI-ISAI 2014, 2015, 9067 : 244 - 258
  • [45] Evaluating the performance of multilingual models in answer extraction and question generation
    Moreno-Cediel, Antonio
    del-Hoyo-Gabaldon, Jesus-Angel
    Garcia-Lopez, Eva
    Garcia-Cabot, Antonio
    de-Fitero-Dominguez, David
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [46] Combining Attention-based Models with the MeSH Ontology for Semantic Textual Similarity in Clinical Notes
    Faramarzi, Noushin Salek
    Dara, Akanksha
    Banerjee, Ritwik
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 74 - 83
  • [47] A Multilingual Semantic Similarity-Based Approach for Question-Answering Systems
    Wali, Wafa
    Ghorbel, Fatma
    Gragouri, Bilel
    Hamdi, Faycal
    Metais, Elisabeth
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 604 - 614
  • [48] Using QVT for adapting question analysis to restricted domain QA systems
    Vila, Katia
    Mason, Jose-Norberto
    Ferrandez, Antonio
    22ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING & KNOWLEDGE ENGINEERING (SEKE 2010), 2010, : 335 - 338
  • [49] QA4QG: USING QUESTION ANSWERING TO CONSTRAIN MULTI-HOP QUESTION GENERATION
    Su, Dan
    Xu, Peng
    Fung, Pascale
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8232 - 8236
  • [50] Using Textual Semantic Similarity to Improve Clustering Quality of Web Video Search Results
    Phuc Quang Nguyen
    Anh-Thu Nguyen-Thi
    Thanh Duc Ngo
    Tu-Anh Hoang Nguyen
    2015 SEVENTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2015, : 156 - 161