Summarizing two-dimensional data with skyline-based statistical descriptors

被引:0
|
作者
Cormode, Graham [1 ]
Korn, Flip [1 ]
Muthukrishnan, S. [2 ]
Srivastava, Divesh [1 ]
机构
[1] AT&T Labs Res, Austin, TX USA
[2] Rutgers State Univ, New Brunswick, NJ USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Much real data consists of more than one dimension, such as financial transactions (eg, price x volume) and IP network flows (eg, duration x num-Bytes), and capture relationships between the variables. For a single dimension, quantiles are intuitive and robust descriptors. Processing and analyzing such data, particularly in data warehouse or data streaming settings, requires similarly robust and informative statistical descriptors that go beyond one-dimension. Applying quantile methods to summarize a multidimensional distribution along only singleton attributes ignores the rich dependence amongst the variables. In this paper, we present new skyline-based statistical descriptors for capturing the distributions over pairs of dimensions. They generalize the notion of quantiles in the individual dimensions, and also incorporate properties of the joint distribution. We introduce phi-quantours and alpha-radials, which are skyline points over subsets of the data, and propose (phi, alpha)-quantiles, found from the union of these skylines, as statistical descriptors of two-dimensional distributions. We present efficient online algorithms for tracking (phi, alpha)-quantiles on two-dimensional streams using guaranteed small space. We identify the principal properties of the proposed descriptors and perform extensive experiments with synthetic and real IP traffic data to study the efficiency of our proposed algorithms.
引用
收藏
页码:42 / +
页数:3
相关论文
共 50 条
  • [1] Two-dimensional wavelet based statistical monitoring of image data
    Koosha, Mehdi
    Noorossana, Rassoul
    Ahmadi, Orod
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2022, 38 (07) : 3797 - 3815
  • [2] A Skyline-Based Decision Boundary Estimation Method for Binominal Classification in Big Data
    Kalyvas, Christos
    Maragoudakis, Manolis
    COMPUTATION, 2020, 8 (03)
  • [3] SKYLINE PROJECTIONS IN TWO-DIMENSIONAL NMR-SPECTROSCOPY
    BLUMICH, B
    ZIESSOW, D
    JOURNAL OF MAGNETIC RESONANCE, 1982, 49 (01) : 151 - 154
  • [4] TWO-DIMENSIONAL MAPPING OF SENSORY TEXTURE DESCRIPTORS
    Rohm, Harald
    Duerrschmid, Klaus
    Forker, Anne
    Jaros, Doris
    JOURNAL OF TEXTURE STUDIES, 2010, 41 (06) : 789 - 803
  • [5] A two-dimensional backward heat problem with statistical discrete data
    Nguyen Dang Minh
    Khanh To Duc
    Nguyen Huy Tuan
    Dang Duc Trong
    JOURNAL OF INVERSE AND ILL-POSED PROBLEMS, 2018, 26 (01): : 13 - 31
  • [6] A COMPLETE SET OF FOURIER DESCRIPTORS FOR TWO-DIMENSIONAL SHAPES
    CRIMMINS, TR
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1982, 12 (06): : 848 - 855
  • [7] STATISTICAL-ANALYSIS OF TWO-DIMENSIONAL PALEOCURRENT DATA - METHODS AND EXAMPLES
    FISHER, NI
    POWELL, CM
    AUSTRALIAN JOURNAL OF EARTH SCIENCES, 1989, 36 (01) : 91 - 107
  • [8] Magnetoelectricity in two-dimensional statistical mixtures
    Turik, A. V.
    Chernobabov, A. I.
    Rodinin, M. Yu
    Tolokol'nikov, E. A.
    PHYSICS OF THE SOLID STATE, 2009, 51 (07) : 1478 - 1481
  • [9] Statistical mechanics of two-dimensional foams
    Durand, M.
    EPL, 2010, 90 (06)
  • [10] THE STATISTICAL MECHANICS OF TWO-DIMENSIONAL VESICLES
    Fisher, Michael E.
    JOURNAL OF MATHEMATICAL CHEMISTRY, 1990, 4 (01) : 395 - 399