SeqXGPT: Sentence-Level AI-Generated Text Detection

被引:0
|
作者
Wang, Pengyu
Li, Linyang
Ren, Ke
Jiang, Botian
Zhang, Dong
Qiu, Xipeng [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Widely applied large language models (LLMs) can generate human-like content, raising concerns about the abuse of LLMs. Therefore, it is important to build strong AI-generated text (AIGT) detectors. Current works only consider document-level AIGT detection, therefore, in this paper, we first introduce a sentence-level detection challenge by synthesizing a dataset that contains documents that are polished with LLMs, that is, the documents contain sentences written by humans and sentences modified by LLMs. Then we propose Sequence X (Check) GPT, a novel method that utilizes log probability lists from white-box LLMs as features for sentence-level AIGT detection. These features are composed like waves in speech processing and cannot be studied by LLMs. Therefore, we build SeqXGPT based on convolution and self-attention networks. We test it in both sentence and document-level detection challenges. Experimental results show that previous methods struggle in solving sentence-level AIGT detection, while our method not only significantly surpasses baseline methods in both sentence and document-level detection challenges but also exhibits strong generalization capabilities.(1)
引用
收藏
页码:1144 / 1156
页数:13
相关论文
共 50 条
  • [11] Sentence-level heuristic tree search for long text generation
    Chen, Zheng
    Liu, Zhejun
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (02) : 3153 - 3167
  • [12] Navigating the Landscape of AI-Generated Text Detection: Issues and Solutions for Upholding Academic Integrity
    Gupta, Varun
    Gupta, Chetna
    COMPUTER, 2024, 57 (11) : 118 - 123
  • [13] The Imitation Game revisited: A comprehensive survey on recent advances in AI-generated text detection
    Yang, Zhiwei
    Feng, Zhengjie
    Huo, Rongxin
    Lin, Huiru
    Zheng, Hanghan
    Nie, Ruichi
    Chen, Hongrui
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [14] Text Semantic Communication Systems with Sentence-Level Semantic Fidelity
    Tang, Bing
    Li, Qiang
    Huang, Likun
    Yin, Yiran
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [15] Sentence-level heuristic tree search for long text generation
    Zheng Chen
    Zhejun Liu
    Complex & Intelligent Systems, 2024, 10 : 3153 - 3167
  • [16] Towards Detection of AI-Generated Texts and Misinformation
    Najee-Ullah, Ahmad
    Landeros, Luis
    Balytskyi, Yaroslav
    Chang, Sang-Yoon
    SOCIO-TECHNICAL ASPECTS IN SECURITY, STAST 2021, 2022, 13176 : 194 - 205
  • [17] AI-generated or AI touch-up? Identifying AI contribution in text data
    Hashemi, Ahmad
    Shi, Wei
    Corriveau, Jean-Pierre
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [18] Parentheses insertion based sentence-level text adversarial attack
    Li, Ang
    Yang, Xinghao
    Liu, Baodi
    Chen, Honglong
    Tao, Dapeng
    Liu, Weifeng
    MULTIMEDIA SYSTEMS, 2025, 31 (02)
  • [19] Detection of AI-Generated Emails - A Case Study
    Gryka, Pawel
    Gradon, Kacper
    Kozlowski, Marek
    Kutyla, Milosz
    Janicki, Artur
    19TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY, ARES 2024, 2024,
  • [20] A Modified Fuzzy Relational Clustering Approach for Sentence-Level Text
    Al-Amin, Sikder Tahsin
    Hasan, Mahade
    Hashem, M. M. A.
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2015, : 566 - 570