Topology-Aware Hashing for Effective Control Flow Graph Similarity Analysis

被引:4
|
作者
Li, Yuping [1 ]
Jang, Jiyong [2 ]
Ou, Xinming [3 ]
机构
[1] Pinterest, San Francisco, CA 94107 USA
[2] IBM Res, Yorktown Hts, NY USA
[3] Univ S Florida, Tampa, FL 33620 USA
基金
美国国家科学基金会;
关键词
CFG comparison; Binary similarity; Malware analysis;
D O I
10.1007/978-3-030-37228-6_14
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Control Flow Graph (CFG) similarity analysis is an essential technique for a variety of security analysis tasks, including malware detection and malware clustering. Even though various algorithms have been developed, existing CFG similarity analysis methods still suffer from limited efficiency, accuracy, and usability. In this paper, we propose a novel fuzzy hashing scheme called topology-aware hashing (TAH) for effective and efficient CFG similarity analysis. Given the CFGs constructed from program binaries, we extract blended n-gram graphical features of the CFGs, encode the graphical features into numeric vectors (called graph signatures), and then measure the graph similarity by comparing the graph signatures. We further employ a fuzzy hashing technique to convert the numeric graph signatures into smaller fixed-size fuzzy hash signatures for efficient similarity calculation. Our comprehensive evaluation demonstrates that TAH is more effective and efficient compared to existing CFG comparison techniques. To demonstrate the applicability of TAH to real-world security analysis tasks, we develop a binary similarity analysis tool based on TAH, and show that it outperforms existing similarity analysis tools while conducting malware clustering.
引用
收藏
页码:278 / 298
页数:21
相关论文
共 50 条
  • [21] TOPOLOGY-AWARE JOINT GRAPH FILTER AND EDGE WEIGHT IDENTIFICATION FOR NETWORK PROCESSES
    Natali, Alberto
    Coutino, Mario
    Leus, Geert
    PROCEEDINGS OF THE 2020 IEEE 30TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2020,
  • [22] Brain multigraph prediction using topology-aware adversarial graph neural network
    Bessadok, Alaa
    Mahjoub, Mohamed Ali
    Rekik, Islem
    MEDICAL IMAGE ANALYSIS, 2021, 72
  • [23] TaReT: Temporal knowledge graph reasoning based on topology-aware dynamic relation graph and temporal fusion
    Ma, Jiangtao
    Li, Kunlin
    Zhang, Fan
    Wang, Yanjun
    Luo, Xiangyang
    Li, Chenliang
    Qiao, Yaqiong
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (06)
  • [24] MTS-Net: An enriched topology-aware architecture for molecular graph representation learning
    Yang, Fan
    Zhou, Qing
    Su, Renbin
    Xiong, Weihong
    Journal of Intelligent and Fuzzy Systems, 2024, 47 (1-2): : 99 - 110
  • [25] The Interaction Graph Auto-encoder Network Based on Topology-aware for Transferable Recommendation
    Yu, Ruiyun
    Yang, Kang
    Guo, Bingyang
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 2403 - 2412
  • [26] Handling Missing Sensors in Topology-Aware IoT Applications with Gated Graph Neural Network
    Liu, Shengzhong
    Yao, Shuochao
    Huang, Yifei
    Liu, Dongxin
    Shao, Huajie
    Zhao, Yiran
    Li, Jinyang
    Wang, Tianshi
    Wang, Ruijie
    Yang, Chaoqi
    Abdelzaher, Tarek
    PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT, 2020, 4 (03):
  • [27] Design and Analysis of the Gateway-level Topology Map in Topology-aware ALM Systems
    Cui, Jianqun
    Xiong, Naixue
    Wu, Libing
    Jia, Keming
    Gao, Kuan
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 4409 - 4414
  • [28] Topology-aware Piecewise Linearization of the AC Power Flow through Generative Modeling
    Cho, Young-Ho
    Zhu, Hao
    2023 NORTH AMERICAN POWER SYMPOSIUM, NAPS, 2023,
  • [29] Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning
    Yu, Sixing
    Mazaheri, Arya
    Jannesari, Ali
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [30] PSF toolkit: an R package for pathway curation and topology-aware analysis
    Hakobyan, Siras
    Stepanyan, Ani
    Nersisyan, Lilit
    Binder, Hans
    Arakelyan, Arsen
    FRONTIERS IN GENETICS, 2023, 14