Toward Scalable Internet Traffic Measurement and Analysis with Hadoop

被引:1
|
作者
Lee, Yeonhee [1 ]
Lee, Youngseok [1 ]
机构
[1] Chungnam Natl Univ, Dept Comp Engn, Daejon, South Korea
关键词
Hadoop; Hive; MapReduce; NetFlow; pcap; packet; traffic measurement; analysis;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Internet traffic measurement and analysis has long been used to characterize network usage and user behaviors, but faces the problem of scalability under the explosive growth of Internet traffic and high-speed access. Scalable Internet traffic measurement and analysis is difficult because a large data set requires matching computing and storage resources. Hadoop, an open-source computing platform of MapReduce and a distributed file system, has become a popular infrastructure for massive data analytics because it facilitates scalable data processing and storage services on a distributed computing system consisting of commodity hardware. In this paper, we present a Hadoop-based traffic monitoring system that performs IP, TCP, HTTP, and NetFlow analysis of multi-terabytes of Internet traffic in a scalable manner. From experiments with a 200-node testbed, we achieved 14 Gbps throughput for 5 TB files with IP and HTTP-layer analysis MapReduce jobs. We also explain the performance issues related with traffic analysis MapReduce jobs.
引用
收藏
页码:6 / 13
页数:8
相关论文
共 50 条
  • [21] Offline traffic analysis system based on Hadoop
    QIAO Yuanyuan
    LEI Zhenming
    YUAN Lun
    GUO Minjie
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2013, 20 (05) : 97 - 103
  • [22] Practical scalable image analysis and indexing using Hadoop
    Jonathon S. Hare
    Sina Samangooei
    Paul H. Lewis
    Multimedia Tools and Applications, 2014, 71 : 1215 - 1248
  • [23] Practical scalable image analysis and indexing using Hadoop
    Hare, Jonathon S.
    Samangooei, Sina
    Lewis, Paul H.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 71 (03) : 1215 - 1248
  • [24] Galaxy plus Hadoop: Toward a Collaborative and Scalable Image Processing Toolbox in Cloud
    Chen, Shiping
    Bednarz, Tomasz
    Szul, Piotr
    Wang, Dadong
    Arzhaeva, Yulia
    Burdett, Neil
    Khassapov, Alex
    Zic, John
    Nepal, Surya
    Gurevey, Tim
    Taylor, John
    SERVICE-ORIENTED COMPUTING - ICSOC 2013 WORKSHOPS, 2014, 8377 : 339 - 351
  • [25] The traffic measurement and the empirical studies for the Internet
    Kushida, T
    GLOBECOM 98: IEEE GLOBECOM 1998 - CONFERENCE RECORD, VOLS 1-6: THE BRIDGE TO GLOBAL INTEGRATION, 1998, : 1142 - 1147
  • [26] Stochastic sampling for Internet traffic measurement
    Wolf, Tilman
    Cai, Yan
    Kelly, Patrick
    Gong, Weibo
    2007 IEEE GLOBAL INTERNET SYMPOSIUM, 2007, : 31 - 36
  • [27] Internet measurement: Infrastructure, traffic and applications
    Fitz-Gerald, Stuart J.
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2007, 27 (05) : 375 - 376
  • [28] Traffic measurement and the empirical studies for the Internet
    Kushida, Takayuki
    Conference Record / IEEE Global Telecommunications Conference, 1998, 2 : 1142 - 1147
  • [29] Measurement and interpretation of voice traffic on the Internet
    Maxemchuk, NF
    Lo, S
    ICC'97: 1997 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS - TOWARDS THE KNOWLEDGE MILLENNIUM, CONFERENCE RECORD - VOLS 1-3, 1997, : 500 - 507
  • [30] Toward a scalable visualization system for network traffic monitoring
    Le Malecot, Erwan
    Kohara, Masayoshi
    Hori, Yoshiaki
    Sakurai, Kouichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (05): : 1300 - 1310