Big Data Framework for Zero-Day Malware Detection

被引:16
|
作者
Gupta, Deepak [1 ]
Rani, Rinkle [1 ]
机构
[1] Thapar Univ, Dept Comp Sci & Engn, Patiala, Punjab, India
关键词
Apache Spark; big data; machine learning; malware detection; MLlib;
D O I
10.1080/01969722.2018.1429835
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Malware has already been recognized as one of the most dominant cyber threats on the Internet today. It is growing exponentially in terms of volume, variety, and velocity, and thus overwhelms the traditional approaches used for malware detection and classification. Moreover, with the advent of Internet of Things, there is a huge growth in the volume of digital devices and in such scenario, malicious binaries are bound to grow even faster making it a big data problem. To analyze and detect unknown malware on a large scale, security analysts need to make use of machine learning algorithms along with big data technologies. These technologies help them to deal with current threat landscape consisting of complex and large flux of malicious binaries. This paper proposes the design of a scalable architecture built on the top of Apache Spark which uses its scalable machine learning library (MLlib) for detecting zero-day malware. The proposed platform is tested and evaluated on a dataset comprising of 0.2 million files consisting of 0.05 million clean files and 0.15 million malicious binaries covering a large number of malware families over a period of 7 years starting from 2010.
引用
收藏
页码:103 / 121
页数:19
相关论文
共 50 条
  • [41] Zero-day attack detection: a systematic literature review
    Ahmad, Rasheed
    Alsmadi, Izzat
    Alhamdani, Wasim
    Tawalbeh, Lo'ai
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (10) : 10733 - 10811
  • [42] SimCSE for Encrypted Traffic Detection and Zero-Day Attack Detection
    Bar, Rotem
    Hajaj, Chen
    IEEE ACCESS, 2022, 10 : 56952 - 56960
  • [43] A Survey on Zero-Day Polymorphic Worm Detection Techniques
    Kaur, Ratinder
    Singh, Maninder
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2014, 16 (03): : 1520 - 1549
  • [44] An active learning framework using deep Q-network for zero-day attack detection
    Wu, Yali
    Hu, Yanghu
    Wang, Junhu
    Feng, Mengqi
    Dong, Ang
    Yang, Yanxi
    COMPUTERS & SECURITY, 2024, 139
  • [45] Deep Anomaly Detection Framework Utilizing Federated Learning for Electricity Theft Zero-Day Cyberattacks
    Alshehri, Ali
    Badr, Mahmoud M.
    Baza, Mohamed
    Alshahrani, Hani
    SENSORS, 2024, 24 (10)
  • [46] VisualPhishNet: Zero-Day PhishingWebsite Detection by Visual Similarity
    Abdelnabi, Sahar
    Krombholz, Katharina
    Fritz, Mario
    CCS '20: PROCEEDINGS OF THE 2020 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2020, : 1681 - 1698
  • [47] Zero-day attack detection: a systematic literature review
    Rasheed Ahmad
    Izzat Alsmadi
    Wasim Alhamdani
    Lo’ai Tawalbeh
    Artificial Intelligence Review, 2023, 56 : 10733 - 10811
  • [48] ZeVigilante: Detecting Zero-Day Malware Using Machine Learning and Sandboxing Analysis Techniques
    Alhaidari, Fahd
    Shaib, Nouran Abu
    Alsafi, Maram
    Alharbi, Haneen
    Alawami, Majd
    Aljindan, Reem
    Rahman, Atta-ur
    Zagrouba, Rachid
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [49] A Consensus Framework for Reliability and Mitigation of Zero-Day Attacks in IoT
    Sharma, Vishal
    Lee, Kyungroul
    Kwon, Soonhyun
    Kim, Jiyoon
    Park, Hyungjoon
    Yim, Kangbin
    Lee, Sun-Young
    SECURITY AND COMMUNICATION NETWORKS, 2017,
  • [50] Can Machine/Deep Learning Classifiers Detect Zero-Day Malware with High Accuracy?
    Abri, Faranak
    Siami-Namini, Sima
    Khanghah, Mandi Adl
    Soltani, Fahimch Mirza
    Namin, Akbar Siami
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 3252 - 3259