A Universal Malicious Documents Static Detection Framework Based on Feature Generalization

被引:9
|
作者
Lu, Xiaofeng [1 ]
Wang, Fei [1 ]
Jiang, Cheng [1 ]
Lio, Pietro [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Cyberspace Secur, Beijing 100876, Peoples R China
[2] Univ Cambridge, Comp Lab, Cambridge CB3 0FD, England
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 24期
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
malicious document detection; static detection; feature generalization; machine learning;
D O I
10.3390/app112412134
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this study, Portable Document Format (PDF), Word, Excel, Rich Test format (RTF) and image documents are taken as the research objects to study a static and fast method by which to detect malicious documents. Malicious PDF and Word document features are abstracted and extended, which can be used to detect other types of documents. A universal static detection framework for malicious documents based on feature generalization is then proposed. The generalized features include specification check errors, the structure path, code keywords, and the number of objects. The proposed method is verified on two datasets, and is compared with Kaspersky, NOD32, and McAfee antivirus software. The experimental results demonstrate that the proposed method achieves good performance in terms of the detection accuracy, runtime, and scalability. The average F1-score of all types of documents is found to be 0.99, and the average detection time of a document is 0.5926 s, which is at the same level as the compared antivirus software.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] PUMD: a PU learning-based malicious domain detection framework
    Zhaoshan Fan
    Qing Wang
    Haoran Jiao
    Junrong Liu
    Zelin Cui
    Song Liu
    Yuling Liu
    Cybersecurity, 5
  • [42] PUMD: a PU learning-based malicious domain detection framework
    Fan, Zhaoshan
    Wang, Qing
    Jiao, Haoran
    Liu, Junrong
    Cui, Zelin
    Liu, Song
    Liu, Yuling
    CYBERSECURITY, 2022, 5 (01)
  • [43] MalFinder: An Ensemble Learning-based Framework For Malicious Traffic Detection
    Rong, Candong
    Gou, Gaopeng
    Cui, Mingxin
    Xiong, Gang
    Li, Zhen
    Guo, Li
    2020 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2020, : 588 - 594
  • [44] FedURL: A BERT-based Federated Malicious URL Detection Framework
    Wu, Songyang
    Shen, Bochen
    Zheng, Hao
    Sun, Wenqi
    Sun, Qi
    Wang, Shuang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IX, ICIC 2024, 2024, 14870 : 122 - 131
  • [45] Malicious Domain Names Detection Algorithm Based on Lexical Analysis and Feature Quantification
    Zhao, Hong
    Chang, Zhaobin
    Wang, Weijie
    Zeng, Xiangyan
    IEEE ACCESS, 2019, 7 : 128990 - 128999
  • [46] Automated Model Learning for Accurate Detection of Malicious Digital Documents
    Scofield, Daniel
    Miles, Craig
    Kuhn, Stephen
    DIGITAL THREATS: RESEARCH AND PRACTICE, 2020, 1 (03):
  • [47] Ensemble Learning for Detection of Malicious Content Embedded in PDF Documents
    Nath, Hiran V.
    Mehtre, B. M.
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [48] A Review of Static Detection Methods for Android Malicious Application
    Pan J.
    Cui Z.
    Lin G.
    Chen X.
    Zheng L.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (08): : 1875 - 1894
  • [49] An ensemble framework for interpretable malicious code detection
    Cheng, Jieren
    Zheng, Jiachen
    Yu, Xiaomei
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10100 - 10117
  • [50] ClickKeeper: A Framework of Malicious Click Prevention and Detection
    Yan, Jihong
    Li, You
    2013 INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES RESEARCH (SSR 2013), PT 1, 2013, 1 : 339 - 344