SHARKFIN: Spatio-temporal mining of software adoption and penetration

被引:1
|
作者
Papalexakis, Evangelos E. [1 ]
Dumitras, Tudor [2 ]
Chau, Duen Horng [3 ]
Prakash, B. Aditya [4 ]
Faloutsos, Christos [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Univ Maryland, Dept ECE, College Pk, MD 20742 USA
[3] Georgia Tech, Sch Computat Sci & Engn, Atlanta, GA USA
[4] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
关键词
Malware propagation; Internet security; Data analysis;
D O I
10.1007/s13278-014-0240-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
How does malware propagate? Does it form spikes over time? Does it resemble the propagation pattern of benign files, such as software patches? Does it spread uniformly over countries? How long does it take for a URL that distributes malware to be detected and shut down? In this work, we answer these questions by analyzing patterns from 22 million malicious (and benign) files, found on 1.6 million hosts worldwide during the month of June 2011. We conduct this study using the WINE database available at Symantec Research Labs. Additionally, we explore the research questions raised by sampling on such large databases of executables; the importance of studying the implications of sampling is twofold: First, sampling is a means of reducing the size of the database hence making it more accessible to researchers; second, because every such data collection can be perceived as a sample of the real world. We discover the SHARKFIN temporal propagation pattern of executable files, the GEOSPLIT pattern in the geographical spread of machines that report executables to Symantec's servers, the Periodic Power Law (PPL) distribution of the lifetime of URLs, and we show how to efficiently extrapolate crucial properties of the data from a small sample. We further investigate the propagation pattern of benign and malicious executables, unveiling latent structures in the way these files spread. To the best of our knowledge, our work represents the largest study of propagation patterns of executables.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [31] Mining spatio-temporal patterns in object mobility databases
    Verhein, Florian
    Chawla, Sanjay
    DATA MINING AND KNOWLEDGE DISCOVERY, 2008, 16 (01) : 5 - 38
  • [32] Towards a framework for mining and analysing spatio-temporal datasets
    Bertolotto, M.
    Di Martino, S.
    Ferrucci, F.
    Kechadi, T.
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2007, 21 (08) : 895 - 906
  • [33] Spatio-Temporal Data Mining: A Survey of Problems and Methods
    Atluri, Gowtham
    Karpatne, Anuj
    Kumar, Vipin
    ACM COMPUTING SURVEYS, 2018, 51 (04)
  • [34] Spatio-temporal data mining in ecological and veterinary epidemiology
    Moustakas, Aristides
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2017, 31 (04) : 829 - 834
  • [35] Spatio-Temporal Data Mining for Typhoon Image Collection
    Asanobu Kitamoto
    Journal of Intelligent Information Systems, 2002, 19 : 25 - 41
  • [36] Spatio-Temporal Data Mining for Aviation Delay Prediction
    Zhang, Kai
    Jiang, Yushan
    Liu, Dahai
    Song, Houbing
    2020 IEEE 39TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2020,
  • [37] Spatio-Temporal Frequent Itemset Mining on Web Data
    Aggarwal, Apeksha
    Toshniwal, Durga
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 1160 - 1165
  • [38] Spatio-temporal Data Mining for Maritime Situational Awareness
    Arguedas, Virginia Fernandez
    Mazzarella, Fabio
    Vespe, Michele
    OCEANS 2015 - GENOVA, 2015,
  • [39] Mining Spatio-Temporal Metadata for Satellite Images Interpretation
    Ettabaa, K. Saheb
    Farah, I. R.
    Ahmed, M. B.
    Solaiman, B.
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 736 - +
  • [40] STS: Complex Spatio-Temporal Sequence Mining in Flickr
    Zhou, Chunjie
    Meng, Xiaofeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, 2011, 6587 : 208 - 223