Multi-perspective API call sequence behavior analysis and fusion for malware classification

被引:1
|
作者
Wu, Peng [1 ,2 ]
Gao, Mohan [1 ]
Sun, Fuhui [3 ]
Wang, Xiaoyan [3 ]
Pan, Li [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai Key Lab Integrated Adm Technol Informat S, Shanghai 200240, Peoples R China
[3] Informat Technol Serv Ctr Peoples Court, Beijing 100745, Peoples R China
基金
中国国家自然科学基金;
关键词
Malware detection; Contrastive learning; Graph neural network;
D O I
10.1016/j.cose.2024.104177
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing variety of malicious software, i.e., malware, has caused great damage and economic loss to computer systems. The API call sequence of malware reflects its dynamic behavior during execution, which is difficult to disguise. Therefore, API call sequence can serve as a robust feature for the detection and classification of malware. The statistical analysis presented in this paper reveals two distinct characteristics within the API call sequences of different malware: (1) the API existence feature caused by frequent calls to the APIs with some special functions, and (2) the API transition feature caused by frequent calls to some special API subsequence patterns. Based on these two characteristics, this paper proposes MINES, a Multi-perspective apI call sequeNce bEhavior fuSion malware classification Method. Specifically, the API existence features from different perspectives are described by two graphs that model diverse rich and complex existence relationships between APIs, and we adopt the graph contrastive learning framework to extract the consistent shared API existence feature from two graphs. Similarly, the API transition features of different hops are described by the multi-order transition probability matrices. By treat each order as a channel, a CNN-based contrastive learning framework is adopted to extract the API transition feature. Finally, the two kinds of extracted features are fused to classify malware. Experiments on five datasets demonstrate the superiority of MINES over various state-of-the-arts by a large margin.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] MINES: Multi-perspective API Call Sequence Behavior Fusion Malware Classification
    Gao, Mohan
    Wu, Peng
    Pan, Li
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2024, PT IV, 2024, 14853 : 210 - 220
  • [2] A Multi-Perspective malware detection approach through behavioral fusion of API call sequence
    Amer, Eslam
    Zelinka, Ivan
    El-Sappagh, Shaker
    COMPUTERS & SECURITY, 2021, 110
  • [3] Dynamic API call sequence visualisation for malware classification
    Tang, Mingdong
    Qian, Quan
    IET INFORMATION SECURITY, 2019, 13 (04) : 367 - 377
  • [4] Improvement of malware detection and classification using API call sequence alignment and visualization
    Hyunjoo Kim
    Jonghyun Kim
    Youngsoo Kim
    Ikkyun Kim
    Kuinam J. Kim
    Hyuncheol Kim
    Cluster Computing, 2019, 22 : 921 - 929
  • [5] Improvement of malware detection and classification using API call sequence alignment and visualization
    Kim, Hyunjoo
    Kim, Jonghyun
    Kim, Youngsoo
    Kim, Ikkyun
    Kim, Kuinam J.
    Kim, Hyuncheol
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 921 - 929
  • [6] Dynamic Malware Analysis Based on API Sequence Semantic Fusion
    Zhang, Sanfeng
    Wu, Jiahao
    Zhang, Mengzhe
    Yang, Wang
    APPLIED SCIENCES-BASEL, 2023, 13 (11):
  • [7] A Novel Approach to Detect Malware Based on API Call Sequence Analysis
    Ki, Youngjoon
    Kim, Eunjin
    Kim, Huy Kang
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
  • [8] Evading API Call Sequence Based Malware Classifiers
    Fadadu, Fenil
    Handa, Anand
    Kumar, Nitesh
    Shukla, Sandeep Kumar
    INFORMATION AND COMMUNICATIONS SECURITY (ICICS 2019), 2020, 11999 : 18 - 33
  • [9] Malware Classification Using Dynamically Extracted API Call Embeddings
    Aggarwal, Sahil
    Di Troia, Fabio
    APPLIED SCIENCES-BASEL, 2024, 14 (13):
  • [10] Features Engineering for Malware Family Classification Based API Call
    Daeef, Ammar Yahya
    Al-Naji, Ali
    Chahl, Javaan
    COMPUTERS, 2022, 11 (11)