AlphaIntellect at SemEval-2024 Task 6: Detection of Hallucinations in Generated Text

被引:0
|
作者
Choudhury, Sohan [1 ]
Saha, Priyam [2 ]
Ray, Subharthi [2 ]
Das, Shankha Shubhra [2 ]
Das, Dipankar [2 ]
机构
[1] KIIT, Bhubaneswar, India
[2] Jadavpur Univ, Kolkata, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One major issue in natural language generation (NLG) models is detecting hallucinations (semantically inaccurate outputs). This study investigates a hallucination detection system designed for three distinct NLG tasks: definition modeling, paraphrase generation, and machine translation. The system uses feedforward neural networks for classification and SentenceTransformer models for similarity scores and sentence embeddings. Even though the SemEval-2024 benchmark is showing good results, there is still room for improvement. Promising paths towards improving performance include considering multi-task learning methods, including strategies for handling out-of-domain data and minimizing bias, and investigating sophisticated architectures.
引用
收藏
页码:952 / 958
页数:7
相关论文
共 50 条
  • [41] HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?
    Dipta, Shubhashis Roy
    Shahriar, Sadat
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 485 - 491
  • [42] Team QUST at SemEval-2024 Task 8: A Comprehensive Study of Monolingual and Multilingual Approaches for Detecting AI-generated Text
    Xu, Xiaoman
    Li, Xiangrun
    Wang, Taihang
    Tian, Jianxiang
    Jiang, Ye
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 463 - 470
  • [43] OPDAI at SemEval-2024 Task 6: Small LLMs can Accelerate Hallucination Detection with Weakly Supervised Data
    Wei, Chengcheng
    Chen, Ze
    Fang, Songtan
    He, Jiarong
    Gao, Max
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 721 - 729
  • [44] HaRMoNEE at SemEval-2024 Task 6: Tuning-based Approaches to Hallucination Recognition
    Obiso, Timothy
    Tu, Jingxuan
    Pustejovsky, James
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1322 - 1331
  • [45] OctavianB at SemEval-2024 Task 6: An exploration of humanlike qualities in hallucinated LLM texts
    Brodoceanu, Octavian
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1160 - 1165
  • [46] HalluSafe at SemEval-2024 Task 6: An NLI-based Approach to Make LLMs Safer by Better Detecting Hallucinations and Overgeneration Mistakes
    Rahimi, Zahra
    Amirzadeh, Hamidreza
    Sohrabi, Alireza
    Taghavi, Zeinab Sadat
    Sameti, Hossein
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 139 - 147
  • [47] Magnum JUCSE at SemEval-2024 Task 4: Multilingual Detection of Persuasion Techniques in Memes
    Khurshid, Adnan
    Das, Dipankar
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1015 - 1018
  • [48] Kathlalu at SemEval-2024 Task 8: A Comparative Analysis of Binary Classification Methods for Distinguishing Between Human and Machine-generated Text
    Cao, Lujia
    Kilic, Ece Lara
    Will, Katharina
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 399 - 402
  • [49] INGEOTEC at SemEval-2024 Task 1: Bag of Words and Transformers
    Moctezuma, Daniela
    Tellez, Eric S.
    Graff, Mario
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1155 - 1159
  • [50] IITK at SemEval-2024 Task 4: Hierarchical Embeddings for Detection of Persuasion Techniques in Memes
    Chikoti, Shreenaga
    Mehta, Shrey
    Modi, Ashutosh
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1779 - 1787