Who Wrote this Code? Watermarking for Code Generation

被引:0
|
作者
Lee, Taehyun [1 ]
Hong, Seokhee [1 ,3 ]
Ahn, Jaewoo [1 ]
Hong, Ilgee [1 ,4 ]
Lee, Hwaran [2 ]
Yun, Sangdoo [1 ,2 ]
Shin, Jamin [2 ]
Kim, Gunhee [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] NAVER AI Lab, Grenoble, France
[3] LG AI Res, Seoul, South Korea
[4] Georgia Inst Technol, Atlanta, GA USA
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since the remarkable generation performance of large language models raised ethical and legal concerns, approaches to detect machine-generated text by embedding watermarks are being developed. However, we discover that the existing works fail to function appropriately in code generation tasks due to the task's nature of having low entropy. Extending a logit-modifying watermark method, we propose Selective WatErmarking via Entropy Thresholding ( SWEET), which enhances detection ability and mitigates code quality degeneration by removing low-entropy segments at generating and detecting watermarks. Our experiments show that SWEET significantly improves code quality preservation while outperforming all baselines, including post-hoc detection methods, in detecting machine-generated code text. Our code is available in https://github.com/hongcheki/sweet-watermark.
引用
收藏
页码:4890 / 4911
页数:22
相关论文
共 50 条
  • [41] QR Code Watermarking Algorithm based on Wavelet Transform
    Panyavaraporn, Jantana
    Horkaew, Paramate
    Wongtrairat, Wannaree
    2013 13TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT): COMMUNICATION AND INFORMATION TECHNOLOGY FOR NEW LIFE STYLE BEYOND THE CLOUD, 2013, : 791 - 796
  • [42] Digital watermarking:: Spreading code versus channel coding
    Samee, Muhammad Kashif
    Goetze, Juergen
    Ruan, Shanq-Jang
    Pai, Yu-Ting
    2008 3RD INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS, CONTROL AND SIGNAL PROCESSING, VOLS 1-3, 2008, : 1409 - +
  • [43] A blind audio watermarking algorithm based on convolutional code
    Xu Da-Wen
    Wang Rang-Ding
    Bao Ji-Long
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 2551 - +
  • [44] Digital watermarking with improved SMS applied for QR code
    Pan, Jeng-Shyang
    Sun, Xiao-Xue
    Chu, Shu-Chuan
    Abraham, Ajith
    Yan, Bin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 97
  • [45] Digital audio watermarking based on code division multiplexing
    Jia, Jun
    Wang, Shuo-Zhong
    Zhang, Xin-Peng
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2004, 38 (12): : 2073 - 2077
  • [46] Integration of Watermarking and QR Code for Authentication of Data Center
    Pramkeaw, Patiyuth
    Ganokratanaa, Thittaporn
    Phatchuay, Siriruang
    2016 12TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2016, : 669 - 672
  • [47] Application of LDPC Code Decoding Algorithm in Digital Watermarking
    Wang, Zhongxun
    Bao, Zhankai
    Meng, Lingzeng
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (05) : 393 - 402
  • [48] DWT and QR Code Based Watermarking for Document DRM
    Cardamone, Nicolo
    d'Amore, Fabrizio
    DIGITAL FORENSICS AND WATERMARKING, IWDW 2018, 2019, 11378 : 137 - 150
  • [49] An Improved Digital Watermarking Technology Based on QR Code
    Zhang, Weijun
    Meng, Xuetian
    PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1004 - 1007
  • [50] A New Fragile Watermarking based on Distributed Hamming Code
    Rasouli, Faeze
    Taheri, Mohammad
    2021 26TH INTERNATIONAL COMPUTER CONFERENCE, COMPUTER SOCIETY OF IRAN (CSICC), 2021,