ConFunc: Enhanced Binary Function-Level Representation through Contrastive Learning

被引:0
|
作者
Li, Longfei [1 ]
Yin, Xiaokang [2 ]
Li, Xiao [2 ]
Zhu, Xiaoya [2 ]
Liu, Shengli [2 ]
机构
[1] Zhengzhou Univ, Zhengzhou, Peoples R China
[2] Informat Engn Univ, Zhengzhou, Peoples R China
关键词
binary code similarity detection; machine learning; contrastive learning; function embeddings;
D O I
10.1109/TrustCom60117.2023.00169
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Binary code similarity detection (BCSD) has numerous applications, including malware detection, vulnerability search, plagiarism detection, and patch identification. Recent studies have demonstrated that with the rapid progress of machine learning (ML) techniques, various BCSD approaches based on machine learning have exhibited stronger performance than traditional methods. However, current ML-based BCSD approaches tend to ignore the issue of training samples, and most ML-based BCSD approaches are based on supervised learning, which is suffered from the labelling difficulties. To mitigate these issues, we propose ConFunc: a function-level binary code similarity detection framework based on contrastive learning. Performance evaluation shows that ConFunc enhances the Mean Reciprocal Rank (MRR) and Recall rates (Recall@1) of baseline models by fully harnessing the potential of the data. Additionally, ConFunc demonstrates stronger performance in scenarios with scarce data, achieving the baseline model's performance on the entire dataset using only 10% of the complete dataset. In real-world patch identification and vulnerability search tasks, ConFunc consistently outperforms other baseline models in MRR and Recall@10.
引用
收藏
页码:1241 / 1248
页数:8
相关论文
共 50 条
  • [1] ATM Cell Scheduling and learning by function-level evolvable hardware
    Wang, Qingchun
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1686 - 1688
  • [2] Clustering Enhanced Multiplex Graph Contrastive Representation Learning
    Yuan, Ruiwen
    Tang, Yongqiang
    Wu, Yajing
    Zhang, Wensheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1341 - 1355
  • [3] Function-Level Code Obfuscation Detection Through Self-Attention-Guided Multi-Representation Fusion
    Tian, Zhenzhou
    He, Ruikang
    Zhao, Hongliang
    Chen, Lingwei
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2024, 34 (04) : 651 - 673
  • [4] Learning Robust Representation Through Graph Adversarial Contrastive Learning
    Guo, Jiayan
    Li, Shangyang
    Zhao, Yue
    Zhang, Yan
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2022, 13245 LNCS : 682 - 697
  • [5] Learning Robust Representation Through Graph Adversarial Contrastive Learning
    Guo, Jiayan
    Li, Shangyang
    Zhao, Yue
    Zhang, Yan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 682 - 697
  • [6] Representation-enhanced APT Detection Using Contrastive Learning
    Zhou, Fengxi
    Chang, Baoming
    Wen, Yu
    Meng, Dan
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 1 - 9
  • [7] Diffusion Model-Enhanced Contrastive Learning for Graph Representation
    Dai, Qi
    Song, Yumeng
    Gu, Yu
    Li, Fangfang
    Li, Xiaohua
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT VI, DASFAA 2024, 2024, 14855 : 332 - 341
  • [8] Function-level Vulnerability Detection Through Fusing Multi-Modal Knowledge
    Ni, Chao
    Guo, Xinrong
    Zhu, Yan
    Xu, Xiaodan
    Yang, Xiaohu
    2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 1911 - 1918
  • [9] Liquidity takers behavior representation through a contrastive learning approach
    Ruan, Ruihua
    Bacry, Emmanuel
    Muzy, Jean-Francois
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON AI IN FINANCE, ICAIF 2023, 2023, : 601 - 609
  • [10] Community-Enhanced Contrastive Siamese Networks for Graph Representation Learning
    Li, Yafang
    Wang, Wenbo
    Ma, Guixiang
    Zu, Baokai
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2023, 2023, 14117 : 300 - 314