Identifying High Cardinality Internet Hosts

被引:46
|
作者
Cao, Jin [1 ]
Jin, Yu [2 ]
Chen, Aiyou [1 ]
Bu, Tian [1 ]
Zhang, Zhi-Li [2 ]
机构
[1] Alcatel Lucent, Bell Labs, Murray Hill, NJ USA
[2] Univ Minnesota, Dept Comp Sci, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/INFCOM.2009.5061990
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Internet host cardinality, defined as the number of distinct peers that an Internet host communicates with, is an important metric for profiling Internet hosts. Some example applications include behavior based network intrusion detection, p2p hosts identification, and server identification. However, due to the tremendous number of hosts in the Internet and high speed links, tracking the exact cardinality of each host is not feasible due to the limited memory and computation resource. Existing approaches on host cardinality counting have primarily focused on hosts of extremely high cardinatities. These methods do not work well with hosts of moderately large cardinalities that are needed for certain host behavior profiling such as detection of p2p hosts or port scanners. In this paper, we propose an online sampling approach for identifying hosts whose cardinality exceeds some moderate prescribed threshold, e.g. 50, or within specific ranges. The main advantage of our approach is that it can filter out the majority of low cardinality hosts while preserving the hosts of interest, and hence minimize the memory resources wasted by tracking irrelevant hosts. Our approach consists of three components: 1) two-phase filtering for eliminating low cardinality hosts, 2) thresholded bitmap for counting cardinatities, and 3) bias correction. Through both theoretical analysis and experiments using real Internet traces, we demonstrate that our approach requires much less memory than existing approaches do whereas yields more accurate estimates.
引用
收藏
页码:810 / +
页数:2
相关论文
共 50 条
  • [31] Fundamental effects of clustering on the euclidean embedding of internet hosts
    Lee, Sanghwan
    Zhang, Zhi-Li
    Saliu, Sambit
    Saha, Debanjan
    Srinivasan, Mukund
    NETWORKING 2007: AD HOC AND SENSOR NETWORKS, WIRELESS NETWORKS, NEXT GENERATION INTERNET, PROCEEDINGS, 2007, 4479 : 890 - +
  • [32] Two-tier geographic location of Internet hosts
    Gueye, B
    Ziviani, A
    Fdida, S
    de Rezende, JF
    Duarte, OCMB
    HIGH SPEED NETWORKS AND MULTIMEDIA COMMUNICATIONS, PROCEEDINGS, 2004, 3079 : 730 - 739
  • [33] Approximate Cardinality Estimation (ACE) in large-scale Internet of Things deployments
    Cao, Qing
    Feng, Yunhe
    Lu, Zheng
    Qi, Hairong
    Tolbert, Leon M.
    Wan, Lipeng
    Wang, Zhibo
    Zhou, Wenjun
    AD HOC NETWORKS, 2017, 66 : 52 - 63
  • [34] Leveraging buffering delay estimation for geolocation of Internet hosts
    Gueye, Bamba
    Uhlig, Steve
    Ziviani, Artur
    Fdida, Serge
    NETWORKING 2006: NETWORKING TECHNOLOGIES, SERVICES, AND PROTOCOLS; PERFORMANCE OF COMPUTER AND COMMUNICATION NETWORKS; MOBILE AND WIRELESS COMMUNICATIONS SYSTEMS, 2006, 3976 : 319 - 330
  • [35] Abnormal hosts monitor for city wide core network by real time super points cardinality estimation
    Xu, Jie
    Ding, Wei
    Hu, Xiaoyan
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 627 - 634
  • [36] A screen for identifying maladaptive internet use
    Chow S.L.
    Leung G.M.
    Ng C.
    Yu E.
    International Journal of Mental Health and Addiction, 2009, 7 (2) : 324 - 332
  • [37] Identifying locations for targeted advertising on the Internet
    Bhatnagar, A
    Papatla, P
    INTERNATIONAL JOURNAL OF ELECTRONIC COMMERCE, 2001, 5 (03) : 23 - 44
  • [38] Identifying and analysis the core structure of the internet
    Xie, Yuancheng
    Zhang, Zhaoxin
    Gao, Haoyang
    Li, Ning
    Li, Jialu
    COMPUTING, 2025, 107 (03)
  • [39] Identifying critical autonomous systems in the Internet
    Nur, Abdullah Yasin
    Tozal, Mehmet Engin
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (10): : 4965 - 4985
  • [40] LZR: Identifying Unexpected Internet Services
    Izhikevich, Liz
    Teixeira, Renata
    Durumeric, Zakir
    PROCEEDINGS OF THE 30TH USENIX SECURITY SYMPOSIUM, 2021, : 3111 - 3128