HEBCS: A High-Efficiency Binary Code Search Method

被引:2
|
作者
Sun, Xiangjie [1 ,2 ]
Wei, Qiang [2 ]
Du, Jiang [2 ]
Wang, Yisen [2 ]
机构
[1] Zhengzhou Univ, Sch Cyber Sci & Engn, Zhengzhou 450002, Peoples R China
[2] PLA Informat Engn Univ, Sch Cyber Sci & Engn, Zhengzhou 450001, Peoples R China
关键词
binary code search; binary code similarity; locality-sensitive hash; software analysis;
D O I
10.3390/electronics12163464
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary code search is a technique that involves finding code with similarity to a given code within a code database. It finds extensive application in scenarios such as vulnerability queries and code defect analysis. While many existing methods employ advanced machine learning models for similarity analysis, their lack of interpretability and low efficiency in dealing with large-scale functions still remain challenges. To address these issues, we propose a high-efficiency binary code search method called HEBCS. It employs an interpretable approach to extract function-level features and transforms each feature into a locality-sensitive hash representation. Then, the hashes of these features are combined to form the hash of the function. By leveraging the pigeonhole principle, HEBCS enables efficient storage and retrieval of functions, ensuring high execution efficiency even in the presence of large-scale data. Furthermore, we compare HEBCS with a classic method and a state-of-the-art method, demonstrating that HEBCS achieves significantly higher search efficiency while maintaining a comparable accuracy, recall and F1-score. In real-world vulnerability query applications, HEBCS demonstrated promising results. Its effectiveness in large-scale binary function searches suggests significant potential for practical applications.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] HIGH-EFFICIENCY BINARY LENSES
    VELDKAMP, WB
    SWANSON, GJ
    SHAVER, DC
    OPTICS COMMUNICATIONS, 1985, 53 (06) : 353 - 358
  • [2] High-efficiency optical line code
    Stojanovic, ND
    Stefanovic, MC
    Smiljanic, VB
    Nikolic, ZJ
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2000, 54 (01) : 68 - 72
  • [3] High-efficiency optical line code
    Stojanović, Nebojša D.
    Stefanović, Mihajlo Č.
    Smiljanić, Vesna B.
    Nikolić, Zoran J.
    AEU-Archiv fur Elektronik und Ubertragungstechnik, 2000, 54 (01): : 68 - 72
  • [4] High-efficiency pipeline design of binary arithmetic encoder
    SONG Rui
    CUI HongFei
    LI YunSong
    WU ChengKe
    Science China(Information Sciences), 2014, 57 (09) : 102 - 109
  • [5] High-efficiency pipeline design of binary arithmetic encoder
    Song Rui
    Cui HongFei
    Li YunSong
    Wu ChengKe
    SCIENCE CHINA-INFORMATION SCIENCES, 2014, 57 (09) : 1 - 8
  • [6] High-efficiency pipeline design of binary arithmetic encoder
    Rui Song
    HongFei Cui
    YunSong Li
    ChengKe Wu
    Science China Information Sciences, 2014, 57 : 1 - 8
  • [7] A high-efficiency aerothermoelastic analysis method
    Wan ZhiQiang
    Wang YaoKun
    Liu YunZhen
    Yang Chao
    SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2014, 57 (06) : 1111 - 1118
  • [8] A high-efficiency aerothermoelastic analysis method
    ZhiQiang Wan
    YaoKun Wang
    YunZhen Liu
    Chao Yang
    Science China Physics, Mechanics & Astronomy, 2014, 57 : 1111 - 1118
  • [9] A high-efficiency aerothermoelastic analysis method
    WAN ZhiQiang
    WANG YaoKun
    LIU YunZhen
    YANG Chao
    Science China(Physics,Mechanics & Astronomy), 2014, (06) : 1111 - 1118
  • [10] Flexible and high-efficiency turbo product code decoder design
    Zhou, Li
    Liu, Hengzhu
    Zhang, Botao
    IEICE ELECTRONICS EXPRESS, 2012, 9 (12): : 1044 - 1050