Evaluating Question generation models using QA systems and Semantic Textual Similarity

被引:1
|
作者
Shaheer, Safwan [1 ]
Hossain, Ishmam [1 ]
Sarna, Sudipta Nandi [1 ]
Mehedi, Md Humaion Kabir [1 ]
Rasel, Annajiat Alim [1 ]
机构
[1] Brac Univ, Dept Comp Sci & Engn CSE, Sch Data & Sci SDS, 66 Mohakhali, Dhaka 1212, Bangladesh
关键词
Question Generation; Semantic Textual Similarity; Question Answering; BLEU;
D O I
10.1109/CCWC57344.2023.10099244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question generation based on conversational context is a difficult problem to solve. A widely used technique for generating quality questions using fine-tuned models relies on a suitable answer and the context, usually the passage. But when it comes to conversational settings, the questions generated are not of the highest quality as they lack the contextual element in the question, especially due to the lack of co-reference resolution of the entity. Furthermore, in most of the evaluation techniques for generating questions, there seems to be a lack of utilizing powerful question-answering systems to judge the answerability of the questions generated. The most prevalent metric used for judging machine-generated text against the human gold standard, BLUE, unfortunately doesn't factor in whether a question answering system would be able to answer the question, but instead focuses mostly on the number of substrings that match against each other. Various question generation models following a generalized encoder-decoder architecture were evaluated using semantic textual similarity for both the generated questions and the generated answers. Although higher parameters in a model usually lend to better performance, our experiment displayed that such is not always the case, at least when there is a massive amount of context missing.
引用
收藏
页码:431 / 435
页数:5
相关论文
共 50 条
  • [31] Using Ontology for Measuring Semantic Similarity for Question Answering System
    Ramprasath, Muthukrishnan
    Hariharan, Shanmugasundaram
    2012 IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2012, : 218 - 223
  • [32] Taming large language models to implement diagnosis and evaluating the generation of LLMs at the semantic similarity level in acupuncture and moxibustion
    Li, Shusheng
    Tan, Wenjun
    Zhang, Changshuai
    Li, Jiale
    Ren, Haiyan
    Guo, Yanliang
    Jia, Jing
    Liu, Yangyang
    Pan, Xingfang
    Guo, Jing
    Meng, Wei
    He, Zhaoshui
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [33] Semantic Textual Similarity on Brazilian Portuguese: An approach based on language-mixture models
    Silva, A.
    Lozkins, A.
    Bertoldi, L. R.
    Rigo, S.
    Bure, V. M.
    VESTNIK SANKT-PETERBURGSKOGO UNIVERSITETA SERIYA 10 PRIKLADNAYA MATEMATIKA INFORMATIKA PROTSESSY UPRAVLENIYA, 2019, 15 (02): : 235 - 244
  • [34] Measurement of Semantic Textual Similarity in Clinical Texts: Comparison of Transformer-Based Models
    Yang, Xi
    He, Xing
    Zhang, Hansi
    Ma, Yinghan
    Bian, Jiang
    Wu, Yonghui
    JMIR MEDICAL INFORMATICS, 2020, 8 (11)
  • [35] SimLex-999: Evaluating Semantic Models With (Genuine) Similarity Estimation
    Hill, Felix
    Reichart, Roi
    Korhonen, Anna
    COMPUTATIONAL LINGUISTICS, 2015, 41 (04) : 665 - 695
  • [36] Portuguese Language Models and Word Embeddings: Evaluating on Semantic Similarity Tasks
    Rodrigues, Ruan Chaves
    Rodrigues, Jessica
    Quinta de Castro, Pedro Vitor
    Felipe da Silva, Nadia Felix
    Soares, Anderson
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 239 - 248
  • [37] Product Recommendations Using Textual Similarity Based Learning Models
    Shrivastava, Rahul
    Sisodia, Dilip Singh
    2019 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI - 2019), 2019,
  • [38] Interpretable semantic textual similarity of sentences using alignment of chunks with classification and regression
    Majumder, Goutam
    Pakray, Partha
    Das, Ranjita
    Pinto, David
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7322 - 7349
  • [39] Interpretable semantic textual similarity of sentences using alignment of chunks with classification and regression
    Goutam Majumder
    Partha Pakray
    Ranjita Das
    David Pinto
    Applied Intelligence, 2021, 51 : 7322 - 7349
  • [40] Semantic textual similarity for modern standard and dialectal Arabic using transfer learning
    Sulaiman, Mansour Al
    Moussa, Abdullah M.
    Abdou, Sherif
    Elgibreen, Hebah
    Faisal, Mohammed
    Rashwan, Mohsen
    PLOS ONE, 2022, 17 (08):