HEBCS: A High-Efficiency Binary Code Search Method

被引:2
|
作者
Sun, Xiangjie [1 ,2 ]
Wei, Qiang [2 ]
Du, Jiang [2 ]
Wang, Yisen [2 ]
机构
[1] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450002, Peoples R China
[2] PLA Informat Engn Univ, Sch Cyber Sci & Engn, Zhengzhou 450001, Peoples R China
关键词
binary code search; binary code similarity; locality-sensitive hash; software analysis;
D O I
10.3390/electronics12163464
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary code search is a technique that involves finding code with similarity to a given code within a code database. It finds extensive application in scenarios such as vulnerability queries and code defect analysis. While many existing methods employ advanced machine learning models for similarity analysis, their lack of interpretability and low efficiency in dealing with large-scale functions still remain challenges. To address these issues, we propose a high-efficiency binary code search method called HEBCS. It employs an interpretable approach to extract function-level features and transforms each feature into a locality-sensitive hash representation. Then, the hashes of these features are combined to form the hash of the function. By leveraging the pigeonhole principle, HEBCS enables efficient storage and retrieval of functions, ensuring high execution efficiency even in the presence of large-scale data. Furthermore, we compare HEBCS with a classic method and a state-of-the-art method, demonstrating that HEBCS achieves significantly higher search efficiency while maintaining a comparable accuracy, recall and F1-score. In real-world vulnerability query applications, HEBCS demonstrated promising results. Its effectiveness in large-scale binary function searches suggests significant potential for practical applications.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] A High-Efficiency Traffic Time Series Representations Based on Binary Symbolic
    Sun Meiyu
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 3891 - 3895
  • [22] High-Efficiency Transmissive Tunable Metasurfaces for Binary Cascaded Diffractive Layers
    Jia, Yuetian
    Lu, Huan
    Fan, Zhixiang
    Wu, Bei
    Qu, Fengzhong
    Zhao, Min-Jian
    Qian, Chao
    Chen, Hongsheng
    IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2024, 72 (05) : 4532 - 4540
  • [23] Nanofabrication of subwavelength, binary, high-efficiency diffractive optical elements in GaAs
    Wendt, JR
    Vawter, GA
    Smith, RE
    Warren, ME
    JOURNAL OF VACUUM SCIENCE & TECHNOLOGY B, 1995, 13 (06): : 2705 - 2708
  • [24] The High-Efficiency Design Method for Capacitive MEMS Accelerometer
    Liu, Wen
    Zhao, Tianlong
    He, Zhiyuan
    Ye, Jingze
    Gong, Shaotong
    Wang, Xianglong
    Yang, Yintang
    MICROMACHINES, 2023, 14 (10)
  • [25] A HIGH-EFFICIENCY DRIVE METHOD FOR ELECTROLUMINESCENT MATRIX DISPLAYS
    MILLER, MR
    TUTTLE, RP
    PROCEEDINGS OF THE SID, 1982, 23 (02): : 85 - 89
  • [26] A METHOD OF MAINTAINING THE HIGH-EFFICIENCY OF BRIDGE POWER ADDITION
    LONDON, SY
    LESHCHEV, BV
    TOMASHEVICH, SV
    TELECOMMUNICATIONS AND RADIO ENGINEERING, 1983, 37-8 (05) : 121 - 122
  • [27] Development of a design method for a high-efficiency water pump
    Deguchi, A
    Fujita, T
    Nomoto, Y
    JSAE REVIEW, 2000, 21 (01): : 35 - 39
  • [28] A Construction of Binary Punctured Linear Codes and A Supporting Method for Best Code Search
    Ohara, Takuya
    Takita, Makoto
    Morii, Masakatu
    PROCEEDINGS OF 2020 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS (ISITA2020), 2020, : 170 - 174
  • [29] A Construction of Binary Punctured Linear Codes and A Supporting Method for Best Code Search
    Ohara, Takuya
    Takita, Makoto
    Morii, Masakatu
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2022, E105A (03) : 372 - 380
  • [30] High-efficiency 3D shape measurement based on redesigned Gray code and aligned phase unwrapping method
    Wang, Yawei
    Tang, Zixin
    Tang, Tianhang
    Chen, Jie
    Shi, Xuelei
    Lv, Shun
    Liu, Yiguang
    OPTIK, 2022, 265