Identifying High Cardinality Internet Hosts

被引:46
|
作者
Cao, Jin [1 ]
Jin, Yu [2 ]
Chen, Aiyou [1 ]
Bu, Tian [1 ]
Zhang, Zhi-Li [2 ]
机构
[1] Alcatel Lucent, Bell Labs, Murray Hill, NJ USA
[2] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/INFCOM.2009.5061990
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet host cardinality, defined as the number of distinct peers that an Internet host communicates with, is an important metric for profiling Internet hosts. Some example applications include behavior based network intrusion detection, p2p hosts identification, and server identification. However, due to the tremendous number of hosts in the Internet and high speed links, tracking the exact cardinality of each host is not feasible due to the limited memory and computation resource. Existing approaches on host cardinality counting have primarily focused on hosts of extremely high cardinatities. These methods do not work well with hosts of moderately large cardinalities that are needed for certain host behavior profiling such as detection of p2p hosts or port scanners. In this paper, we propose an online sampling approach for identifying hosts whose cardinality exceeds some moderate prescribed threshold, e.g. 50, or within specific ranges. The main advantage of our approach is that it can filter out the majority of low cardinality hosts while preserving the hosts of interest, and hence minimize the memory resources wasted by tracking irrelevant hosts. Our approach consists of three components: 1) two-phase filtering for eliminating low cardinality hosts, 2) thresholded bitmap for counting cardinatities, and 3) bias correction. Through both theoretical analysis and experiments using real Internet traces, we demonstrate that our approach requires much less memory than existing approaches do whereas yields more accurate estimates.
引用
收藏
页码:810 / +
页数:2
相关论文
共 50 条
  • [1] Identifying High-Cardinality Hosts from Network-wide Traffic Measurements
    Liu, Yang
    Chen, Wenji
    Guan, Yong
    2013 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2013, : 287 - 295
  • [2] Identifying High-Cardinality Hosts from Network-Wide Traffic Measurements
    Liu, Yang
    Chen, Wenji
    Guan, Yong
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2016, 13 (05) : 547 - 558
  • [3] Monotonicity of the minimum cardinality of an identifying code in the hypercube
    Moncel, J
    DISCRETE APPLIED MATHEMATICS, 2006, 154 (06) : 898 - 899
  • [4] Mycoplasmas - Identifying hosts for a stealth pathogen
    Clark, J
    VETERINARY JOURNAL, 2005, 170 (03): : 273 - 274
  • [5] SNM Global hosts Internet Symposium
    不详
    JOURNAL OF NUCLEAR MEDICINE, 2001, 42 (05) : 22N - 22N
  • [6] Hosts dive into PDAs, multimedia and the internet
    Online Review, 1993, 17 (05):
  • [7] Internet reaches 93 million hosts
    Rutkowski, T
    IEEE INTERNET COMPUTING, 2000, 4 (06) : 11 - 11
  • [8] Robust Statistical Geolocation of Internet Hosts
    Wang, Zheng
    Mark, Brian L.
    2015 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2015,
  • [9] Identifying active resident hosts of VFR visitors
    Griffin, Tom
    Guttentag, Daniel
    INTERNATIONAL JOURNAL OF TOURISM RESEARCH, 2020, 22 (05) : 627 - 636
  • [10] Identifying malicious hosts involved in periodic communications
    Apruzzese, Giovanni
    Marchetti, Mirco
    Colajanni, Michele
    Zoccoli, Gabriele Gambigliani
    Guido, Alessandro
    2017 IEEE 16TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2017, : 11 - 18