DENDROID: A text mining approach to analyzing and classifying code structures in Android malware families

被引:155
|
作者
Suarez-Tangil, Guillermo [1 ]
Tapiador, Juan E. [1 ]
Pens-Lopez, Pedro [1 ]
Blasco, Jorge [1 ]
机构
[1] Univ Carlos III Madrid, Dept Comp Sci, Comp Secur COSEC Lab, Madrid 28911, Spain
关键词
Malware analysis; Software similarity and classification; Text mining; Information retrieval; Smartphones; Android OS;
D O I
10.1016/j.eswa.2013.07.106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid proliferation of smartphones over the last few years has come hand in hand with and impressive growth in the number and sophistication of malicious apps targetting smartphone users. The availability of reuse-oriented development methodologies and automated malware production tools makes exceedingly easy to produce new specimens. As a result, market operators and malware analysts are increasingly overwhelmed by the amount of newly discovered samples that must be analyzed. This situation has stimulated research in intelligent instruments to automate parts of the malware analysis process. In this paper, we introduce DENDROID, a system based on text mining and information retrieval techniques for this task. Our approach is motivated by a statistical analysis of the code structures found in a dataset of ANDROID OS malware families, which reveals some parallelisms with classical problems in those domains. We then adapt the standard Vector Space Model and reformulate the modelling process followed in text mining applications. This enables us to measure similarity between malware samples, which is then used to automatically classify them into families. We also investigate the application of hierarchical clustering over the feature vectors obtained for each malware family. The resulting dendo-grams resemble the so-called phylogenetic trees for biological species, allowing us to conjecture about evolutionary relationships among families. Our experimental results suggest that the approach is remarkably accurate and deals efficiently with large databases of malware instances. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1104 / 1117
页数:14
相关论文
共 42 条
  • [1] Classifying Android Malware through Subgraph Mining
    Martinelli, Fabio
    Saracino, Andrea
    Sgandurra, Daniele
    DATA PRIVACY MANAGEMENT AND AUTONOMOUS SPONTANEOUS SECURITY, DPM 2013, 2014, 8247 : 268 - 283
  • [2] Lexical Mining of Malicious URLs for Classifying Android Malware
    Wang, Shanshan
    Yan, Qiben
    Chen, Zhenxiang
    Wang, Lin
    Spolaor, Riccardo
    Yang, Bo
    Conti, Mauro
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2018, PT I, 2018, 254 : 248 - 263
  • [3] Detecting Android Malware and Classifying Its Families in Large-scale Datasets
    Sun, Bo
    Takahashi, Takeshi
    Ban, Tao
    Inoue, Daisuke
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2022, 13 (02)
  • [4] CANDYMAN: Classifying Android malware families by modelling dynamic traces with Markov chains
    Martin, Alejandro
    Rodriguez-Fernandez, Victor
    Camacho, David
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 74 : 121 - 133
  • [5] A text mining approach to analyzing Annals literature
    Moro, Sergio
    Rita, Paulo
    Cortez, Paulo
    ANNALS OF TOURISM RESEARCH, 2017, 66 : 208 - 210
  • [6] Visualising Static Features and Classifying Android Malware Using a Convolutional Neural Network Approach
    Kiraz, Omer
    Dogru, Ibrahim Alper
    APPLIED SCIENCES-BASEL, 2024, 14 (11):
  • [7] DroidMD: An efficient and scalable Android malware detection approach at source code level
    Akram J.
    Mumtaz M.
    Jabeen G.
    Luo P.
    International Journal of Information and Computer Security, 2021, 15 (2-3) : 299 - 321
  • [8] MADRAS-NET: A deep learning approach for detecting and classifying android malware using Linknet
    Wang, Yi
    Jia, Shanshan
    Measurement: Sensors, 2024, 33
  • [9] Android Botnet Detection An Integrated Source Code Mining Approach
    Alothman, Basil
    Rattadilok, Prapa
    2017 12TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2017, : 111 - 115
  • [10] A hybrid pattern based text mining approach for malware detection using DBScan
    Aashima Malhotra
    Karan Bajaj
    CSI Transactions on ICT, 2016, 4 (2-4) : 141 - 149