AlphaIntellect at SemEval-2024 Task 6: Detection of Hallucinations in Generated Text

被引:0
|
作者
Choudhury, Sohan [1 ]
Saha, Priyam [2 ]
Ray, Subharthi [2 ]
Das, Shankha Shubhra [2 ]
Das, Dipankar [2 ]
机构
[1] KIIT, Bhubaneswar, India
[2] Jadavpur Univ, Kolkata, India
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One major issue in natural language generation (NLG) models is detecting hallucinations (semantically inaccurate outputs). This study investigates a hallucination detection system designed for three distinct NLG tasks: definition modeling, paraphrase generation, and machine translation. The system uses feedforward neural networks for classification and SentenceTransformer models for similarity scores and sentence embeddings. Even though the SemEval-2024 benchmark is showing good results, there is still room for improvement. Promising paths towards improving performance include considering multi-task learning methods, including strategies for handling out-of-domain data and minimizing bias, and investigating sophisticated architectures.
引用
收藏
页码:952 / 958
页数:7
相关论文
共 50 条
  • [21] Team MLab at SemEval-2024 Task 8: Analyzing Encoder Embeddings for Detecting LLM-generated Text
    Li, Kevin
    Hasanaliyev, Kenan
    Zhu, Sally
    Altshuler, George
    Eberts, Alden
    Chen, Eric
    Wang, Kate
    Xia, Emily
    Browne, Eli
    Chen, Ian
    Eren, Umut
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1463 - 1467
  • [22] Team Innovative at SemEval-2024 Task 8: Multigenerator, Multidomain, and Multilingual Black-Box Machine-Generated Text Detection
    Sharma, Surbhi
    Mansuri, Irfan
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1172 - 1176
  • [23] HierarchyEverywhere at SemEval-2024 Task 4: Detection of Persuasion Techniques in Memes Using Hierarchical Text Classifier
    Ghahroodi, Omid
    Asgari, Ehsaneddin
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1727 - 1732
  • [24] iimasNLP at SemEval-2024 Task 8: Unveiling structure-aware language models for automatic generated text identification
    Valdez, Andric
    Gomez-Adorno, Helena
    Marquez, Fernando
    Pantaleon, Jorge
    Bel-Enguix, Gemma
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1110 - 1114
  • [25] SemEval-2024 Task 4: Multilingual Detection of Persuasion Techniques in Memes
    Dimitrov, Dimitar
    Alam, Firoj
    Hasanain, Maram
    Hasnat, Abul
    Silvestri, Fabrizio
    Nakov, Preslav
    Da San Martino, Giovanni
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 2009 - 2026
  • [26] SCaLAR at SemEval-2024 Task 8: Unmasking the machine : Exploring the power of RoBERTa Ensemble for Detecting Machine Generated Text
    Kumar, Anand M.
    Abhin, B.
    Murali, Sidhaarth Sredharan
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1135 - 1139
  • [27] Maha Bhaashya at SemEval-2024 Task 6: Zero-Shot Multi-task Hallucination Detection
    Bhamidipati, Patanjali
    Malladi, Advaith
    Shrivastava, Manish
    Mamidi, Radhika
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 1685 - 1689
  • [28] Genaios at SemEval-2024 Task 8: Detecting Machine-Generated Text by Mixing Language Model Probabilistic Features
    Sarvazyan, Areg Mikael
    Gonzalez, Jose Angel
    Franco-Salvador, Marc
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 101 - 107
  • [29] Sharif-MGTD at SemEval-2024 Task 8: A Transformer-Based Approach to Detect Machine Generated Text
    Ebrahimi, Seyedeh Fatemeh
    Azari, Karim Akhavan
    Iravani, Amirmasoud
    Qazvini, Arian
    Sadeghi, Pouya
    Taghavi, Zeinab Sadat
    Sameti, Hossein
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 565 - 572
  • [30] Werkzeug at SemEval-2024 Task 8: LLM-Generated Text Detection via Gated Mixture-of-Experts Fine-Tuning
    Wu, Youlin
    Wang, Kaichun
    Ma, Kai
    Yang, Liang
    Lin, Hongfei
    PROCEEDINGS OF THE 18TH INTERNATIONAL WORKSHOP ON SEMANTIC EVALUATION, SEMEVAL-2024, 2024, : 547 - 552