Z-Glyph: Visualizing outliers in multivariate data

被引:32
|
作者
Cao, Nan [1 ]
Lin, Yu-Ru [2 ]
Gotz, David [3 ]
Du, Fan [4 ]
机构
[1] Tongji Univ, Shanghai, Peoples R China
[2] Univ Pittsburgh, Pittsburgh, PA USA
[3] Univ N Carolina, Chapel Hill, NC USA
[4] Univ Maryland, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
Outlier detection; anomaly detection; information visualization; multidimensional data visualization; INTERACTIVE VISUALIZATION; INTRUSION; TAXONOMY; NUMBER;
D O I
10.1177/1473871616686635
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Outlier analysis techniques are extensively used in many domains such as intrusion detection. Today, even with the most advanced statistical learning techniques, human judgment still plays an important role in outlier analysis tasks due to the difficulty of defining and collecting outlier examples. This work seeks to tackle this problem by introducing a new visualization design, Z-Glyph, a family of glyphs designed to facilitate human judgment in outlier analysis of multivariate data. By employing a location-scale transformation, a Z-Glyph represents the normal data using regular shapes (e.g. straight line and circle), such that the abnormal data can be revealed when deviating from the regular shapes. Extensive controlled experiment and case studies based on real-world datasets indicate the superior performance of the Z-Glyph family, compared with the baselines, suggesting that the proposed design is able to leverage human perceptional features with statistical characterization. This study contributes to a more fundamental understanding about designing visual representations for revealing outliers in multivariate data, which can be applied as a building block in many domain-specific anomaly detection applications.
引用
收藏
页码:22 / 40
页数:19
相关论文
共 50 条
  • [31] StreamVisND: Visualizing Relationships in Streaming Multivariate Data
    Cheng, Shenghui
    Wang, Yue
    Zhang, Dan
    Jiang, Zhifang
    Mueller, Klaus
    2015 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY, 2015, : 191 - 192
  • [32] CoPlot: A tool for visualizing multivariate data in medicine
    Bravata, Dena M.
    Shojania, Kaveh G.
    Olkin, Ingram
    Raveh, Adi
    STATISTICS IN MEDICINE, 2008, 27 (12) : 2234 - 2247
  • [33] Towards Contextual Glyph Design: Visualizing Hearing Screenings
    Ramos, Barbara Nascimento
    Macas, Catarina
    Lourenco, Nuno
    Polisciuc, Evgheni
    2023 27TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION, IV, 2023, : 96 - 102
  • [34] UNION-INTERSECTION TESTING FOR OUTLIERS IN MULTIVARIATE NORMAL DATA
    CARONI, C
    PRESCOTT, P
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1995, 51 (2-4) : 185 - 196
  • [35] Incremental methods for detecting outliers from multivariate data stream
    Fong, S. (ccfong@umac.mo), 1600, Acta Press, Building B6, Suite 101, 2509 Dieppe Avenue S.W., Calgary, AB, T3E 7J9, Canada
  • [36] A comparative study of methods to handle outliers in multivariate data analysis
    Grentzelos, Christos
    Caroni, Chrysseis
    Barranco-Chamorro, Inmaculada
    COMPUTATIONAL AND MATHEMATICAL METHODS, 2021, 3 (03)
  • [37] COMPARISON OF DIFFERENT TECHNIQUES FOR DETECTION OF OUTLIERS IN CASE OF MULTIVARIATE DATA
    Iqbal, Muhammad Zafar
    Habib, Samra
    Khan, Muhammad Imran
    Kashif, Muhammad
    PAKISTAN JOURNAL OF AGRICULTURAL SCIENCES, 2020, 57 (03): : 865 - 869
  • [38] Detection of multivariate outliers in business survey data with incomplete information
    Todorov, Valentin
    Templ, Matthias
    Filzmoser, Peter
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2011, 5 (01) : 37 - 56
  • [39] Detecting outliers in multivariate data and visualization-R scripts
    Kim, Sung-Soo
    KOREAN JOURNAL OF APPLIED STATISTICS, 2018, 31 (04) : 517 - 528
  • [40] Eigenstructure-Based Angle for Detecting Outliers in Multivariate Data
    Aziz, Nazrina
    SAINS MALAYSIANA, 2014, 43 (12): : 1973 - 1977