Data-driven Curation, Learning and Analysis for Inferring Evolving loT Botnets in the Wild

被引:3
|
作者
Pour, Morteza Safaei [1 ]
Mangino, Antonio [1 ]
Friday, Kurt [1 ]
Rathbun, Matthias [1 ]
Bou-Harb, Elias [1 ]
Iqbal, Farkhund [2 ]
Shaban, Khaled [3 ]
Erradi, Abdelkarim [3 ]
机构
[1] Florida Atlantic Univ, Cyber Threat Intelligence Lab, Boca Raton, FL 33431 USA
[2] Zayed Univ, Coll Technol Innovat, Dubai, U Arab Emirates
[3] Qatar Univ, Dept Comp Sci & Engn, Doha, Qatar
基金
美国国家科学基金会;
关键词
Internet-of-Things; IoT botnets; network security; network telescopes; Internet measurements; deep learning; INTERNET;
D O I
10.1145/3339252.3339272
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The insecurity of the Internet-of-Things (IoT) paradigm continues to wreak havoc in consumer and critical infrastructure realms. Several challenges impede addressing IoT security at large, including, the lack of IoT-centric data that can be collected, analyzed and correlated, due to the highly heterogeneous nature of such devices and their widespread deployments in Internet-wide environments. To this end, this paper explores macroscopic, passive empirical data to shed light on this evolving threat phenomena. This not only aims at classifying and inferring Internet-scale compromised IoT devices by solely observing such one-way network traffic, but also endeavors to uncover, track and report on orchestrated "in the wild" IoT botnets. Initially, to prepare the effective utilization of such data, a novel probabilistic model is designed and developed to cleanse such traffic from noise samples (i.e., misconfiguration traffic). Subsequently, several shallow and deep learning models are evaluated to ultimately design and develop a multi-window convolution neural network trained on active and passive measurements to accurately identify compromised IoT devices. Consequently, to infer orchestrated and unsolicited activities that have been generated by well-coordinated IoT botnets, hierarchical agglomerative clustering is deployed by scrutinizing a set of innovative and efficient network feature sets. By analyzing 3.6 TB of recent darknet traffic, the proposed approach uncovers a momentous 440,000 compromised IoT devices and generates evidence -based artifacts related to 350 IoT botnets. While some of these detected botnets refer to previously documented campaigns such as the Hide and Seek, Ha j ime and Fbot, other events illustrate evolving threats such as those with cryptojacking capabilities and those that are targeting industrial control system communication and control services.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Charting the landscape of data-driven learning using a bibliometric analysis
    Dong, Jihua
    Zhao, Yanan
    Buckingham, Louisa
    RECALL, 2023, 35 (03) : 339 - 355
  • [22] Risks in Work-Integrated Learning: A Data-Driven Analysis
    Xu, Xiao
    EDUCATION SCIENCES, 2025, 15 (01):
  • [23] PADDLE: Performance Analysis using a Data-driven Learning Environment
    Thiagarajan, Jayaraman J.
    Anirudh, Rushil
    Kailkhura, Bhavya
    Jain, Nikhil
    Islam, Tanzima
    Bhatele, Abhinav
    Yeom, Jae-Seung
    Gamblin, Todd
    2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 784 - 793
  • [24] Data-Driven Nonlinear Modal Analysis: A Deep Learning Approach
    Li, Shanwu
    Yang, Yongchao
    NONLINEAR STRUCTURES & SYSTEMS, VOL 1, 2023, : 229 - 231
  • [25] Learning event-triggered control based on evolving data-driven fuzzy granular models
    Cordovil Jr, Luiz A. Q.
    Coutinho, Pedro H. S.
    Bessa, Iury
    Peixoto, Marcia L. C.
    Palhares, Reinaldo Martinez
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) : 2805 - 2827
  • [26] Data-Driven Predictive Maintenance in Evolving Environments: A Comparison Between Machine Learning and Deep Learning for Novelty Detection
    Del Buono, Francesco
    Calabrese, Francesca
    Baraldi, Andrea
    Paganelli, Matteo
    Regattieri, Alberto
    SUSTAINABLE DESIGN AND MANUFACTURING, KES-SDM 2021, 2022, 262 : 109 - 119
  • [27] Optimization for Data-Driven Learning and Control
    Khan, Usman A.
    Bajwa, Waheed U.
    Nedic, Angelia
    Rabbat, Michael G.
    Sayed, Ali H.
    PROCEEDINGS OF THE IEEE, 2020, 108 (11) : 1863 - 1868
  • [28] Data-Driven Control and Learning Systems
    Hou, Zhongsheng
    Gao, Huijun
    Lewis, Frank L.
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (05) : 4070 - 4075
  • [29] Data-driven learning: The perpetual challenge
    Johns, T
    TEACHING AND LEARNING BY DOING CORPUS ANALYSIS, 2002, (42): : 107 - 117