Evaluating Question generation models using QA systems and Semantic Textual Similarity

被引:1
|
作者
Shaheer, Safwan [1 ]
Hossain, Ishmam [1 ]
Sarna, Sudipta Nandi [1 ]
Mehedi, Md Humaion Kabir [1 ]
Rasel, Annajiat Alim [1 ]
机构
[1] Brac Univ, Dept Comp Sci & Engn CSE, Sch Data & Sci SDS, 66 Mohakhali, Dhaka 1212, Bangladesh
关键词
Question Generation; Semantic Textual Similarity; Question Answering; BLEU;
D O I
10.1109/CCWC57344.2023.10099244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question generation based on conversational context is a difficult problem to solve. A widely used technique for generating quality questions using fine-tuned models relies on a suitable answer and the context, usually the passage. But when it comes to conversational settings, the questions generated are not of the highest quality as they lack the contextual element in the question, especially due to the lack of co-reference resolution of the entity. Furthermore, in most of the evaluation techniques for generating questions, there seems to be a lack of utilizing powerful question-answering systems to judge the answerability of the questions generated. The most prevalent metric used for judging machine-generated text against the human gold standard, BLUE, unfortunately doesn't factor in whether a question answering system would be able to answer the question, but instead focuses mostly on the number of substrings that match against each other. Various question generation models following a generalized encoder-decoder architecture were evaluated using semantic textual similarity for both the generated questions and the generated answers. Although higher parameters in a model usually lend to better performance, our experiment displayed that such is not always the case, at least when there is a massive amount of context missing.
引用
收藏
页码:431 / 435
页数:5
相关论文
共 50 条
  • [1] Question Similarity Detection in Turkish Using Semantic Textual Similarity Methods
    Yildiz, Eray
    Findik, Yasin
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [2] Predicting Semantic Textual Similarity of Arabic Question Pairs using Deep Learning
    Einea, Omar
    Elnagar, Ashraf
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [3] Evaluating Multimodal Representations on Visual Semantic Textual Similarity
    de Lacalle, Oier Lopez
    Salaberria, Ander
    Soroa, Aitor
    Azkune, Gorka
    Agirre, Eneko
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1990 - 1997
  • [4] Deep learning based Bengali question answering system using semantic textual similarity
    Das, Arijit
    Saha, Diganta
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (01) : 589 - 613
  • [5] Deep learning based Bengali question answering system using semantic textual similarity
    Arijit Das
    Diganta Saha
    Multimedia Tools and Applications, 2022, 81 : 589 - 613
  • [6] A COMPARISON OF SEMANTIC SIMILARITY MODELS IN EVALUATING CONCEPT SIMILARITY
    Xu, Q. X.
    Shi, W. Z.
    XXII ISPRS CONGRESS, TECHNICAL COMMISSION II, 2012, 39-B2 : 173 - 178
  • [7] Evaluating Semantic Textual Similarity in Clinical Sentences Using Deep Learning and Sentence Embeddings
    Antunes, Rui
    Silva, Joao Figueira
    Matos, Sergio
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 662 - 669
  • [8] Efficient Textual Similarity using Semantic MinHashing
    Nawaz, Waqas
    Baig, Maryam
    Khan, Kifayat Ullah
    2024 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING, IEEE BIGCOMP 2024, 2024, : 262 - 269
  • [9] Semantic Textual Similarity Using Various Approaches
    Kazula, Maciej
    Kozlowski, Marek
    MACHINE INTELLIGENCE AND BIG DATA IN INDUSTRY, 2016, 19 : 49 - 62
  • [10] Linking Datasets Using Semantic Textual Similarity
    McCrae, John P.
    Buitelaar, Paul
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2018, 18 (01) : 109 - 123