Evaluation of Large Language Models on Code Obfuscation (Student Abstract)

被引:0
|
作者
Swindle, Adrian [1 ]
McNealy, Derrick [2 ]
Krishnan, Giri [3 ]
Ramyaa, Ramyaa [4 ]
机构
[1] St Louis Univ, St Louis, MO 63103 USA
[2] Univ Southern Mississippi, Hattiesburg, MS USA
[3] Univ Calif San Diego, San Diego, CA USA
[4] New Mexico Inst Min & Technol, Socorro, NM USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Obfuscation intends to decrease interpretability of code and identification of code behavior. Large Language Models (LLMs) have been proposed for code synthesis and code analysis. This paper attempts to understand how well LLMs can analyse code and identify code behavior. Specifically, this paper systematically evaluates several LLMs' capabilities to detect obfuscated code and identify behavior across a variety of obfuscation techniques with varying levels of complexity. LLMs proved to be better at detecting obfuscations that changed identifiers, even to misleading ones, compared to obfuscations involving code insertions (unused variables, as well as variables that replace constants with expressions that evaluate to those constants). Hardest to detect were obfuscations that layered multiple simple transformations. For these, only 20-40% of the LLMs' responses were correct. Adding misleading documentation was also successful in misleading LLMs. We provide all our code to replicate results at https://github.com/SwindleA/LLMCodeObfuscation. Overall, our results suggest a gap in LLMs' ability to understand code.
引用
收藏
页码:23664 / 23666
页数:3
相关论文
共 50 条
  • [1] Large Language Models for Code Obfuscation Evaluation of the Obfuscation Capabilities of OpenAI's GPT-3.5 on C Source Code
    Kochberger, Patrick
    Gramberger, Maximilian
    Schrittwieser, Sebastian
    Lawitschka, Caroline
    Weippl, Edgar R.
    PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE ON SECURITY AND CRYPTOGRAPHY, SECRYPT 2023, 2023, : 7 - 19
  • [2] ChatGPT-Generated Code Assignment Detection Using Perplexity of Large Language Models (Student Abstract)
    Xu, Zhenyu
    Xu, Ruoyu
    Sheng, Victor S.
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23688 - 23689
  • [3] Large Language Models as Planning Domain Generators (Student Abstract)
    Oswald, James
    Srinivas, Kavitha
    Kokel, Harsha
    Lee, Junkyu
    Katz, Michael
    Sohrabi, Shirin
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23604 - 23605
  • [4] Automated Assessment of Fidelity and Interpretability: An Evaluation Framework for Large Language Models' Explanations (Student Abstract)
    Kuo, Mu-Tien
    Hsueh, Chih-Chung
    Tsai, Richard Tzong-Han
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23554 - 23555
  • [5] Control code obfuscation by abstract interpretation
    Preda, MD
    Giacobazzi, R
    SEFM 2005: THIRD IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND FORMAL METHODS, PROCEEDINGS, 2005, : 301 - 310
  • [6] Automated Natural Language Explanation of Deep Visual Neurons with Large Models (Student Abstract)
    Zhao, Chenxu
    Qian, Wei
    Shi, Yucheng
    Huai, Mengdi
    Liu, Ninghao
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23712 - 23713
  • [7] Semantic-based code obfuscation by abstract interpretation
    Preda, MD
    Giacobazzi, R
    AUTOMATA, LANGUAGES AND PROGRAMMING, PROCEEDINGS, 2005, 3580 : 1325 - 1336
  • [8] Code Obfuscation Against Abstract Model Checking Attacks
    Bruni, Roberto
    Giacobazzi, Roberto
    Gori, Roberta
    VERIFICATION, MODEL CHECKING, AND ABSTRACT INTERPRETATION (VMCAI 2018), 2018, 10747 : 94 - 115
  • [9] Semantics-based code obfuscation by abstract interpretation
    Dalla Preda, Mila
    Giacobazzi, Roberto
    JOURNAL OF COMPUTER SECURITY, 2009, 17 (06) : 855 - 908
  • [10] Large Language Models Are Not Strong Abstract Reasoners
    Gendron, Gael
    Bao, Qiming
    Witbrock, Michael
    Dobbie, Gillian
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 6270 - 6278