Comparison of Multivariate Outlier Detection Methods for Nearly Elliptical Distributions

被引:3
|
作者
Wada, Kazumi [1 ]
Kawano, Mariko [1 ]
Tsubaki, Hiroe [1 ]
机构
[1] Natl Stat Ctr NSTAC, Shinjyuku Ku, 19-1 Wakamatsu Cho, Tokyo 1628668, Japan
关键词
robust estimation; location and scatter; MSD; BACON; Fast-MCD; NNVE; R;
D O I
10.17713/ajs.v49i2.872
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this paper, the performance of outlier detection methods has been evaluated with symmetrically distributed datasets. We choose four estimators, viz. modified Stahel-Donoho (MSD) estimators, blocked adaptive computationally efficient outlier nominators, minimum covariance determinant estimator obtained by a fast algorithm, and nearest-neighbour variance estimator, which are known for their good performance with elliptically distributed data, for practical applications in national survey data processing. We adopt the data model of multivariate skew-t distribution, of which only the direction of the main axis is skewed and contaminated with outliers following another probability distribution for evaluation. We conducted Monte Carlo simulation under the data distribution to compare the performance of outlier detection. We also explore the applicability of the selected methods for several accounting items in small and medium enterprise survey data. Accordingly, it was found that the MSD estimators are the most suitable.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [31] Multivariate outlier detection in exploration geochemistry
    Filzmoser, P
    Garrett, RG
    Reimann, C
    COMPUTERS & GEOSCIENCES, 2005, 31 (05) : 579 - 587
  • [32] SIMULTANEOUS MULTIVARIATE OUTLIER AND TREND DETECTION
    Pazdernik, Karl
    Stanfill, Bryan
    Bramer, Lisa
    MacPhee, Kellie J.
    2019 IEEE DATA SCIENCE WORKSHOP (DSW), 2019, : 93 - 99
  • [33] Discussion of "Multivariate functional outlier detection"
    Arribas-Gil, Ana
    Romo, Juan
    STATISTICAL METHODS AND APPLICATIONS, 2015, 24 (02): : 263 - 267
  • [34] Rejoinder to ‘multivariate functional outlier detection’
    Mia Hubert
    Peter Rousseeuw
    Pieter Segaert
    Statistical Methods & Applications, 2015, 24 : 269 - 277
  • [35] Some Perspectives on Multivariate Outlier Detection
    Cerioli, Andrea
    Atkinson, Anthony C.
    Riani, Marco
    NEW PERSPECTIVES IN STATISTICAL MODELING AND DATA ANALYSIS, 2011, : 231 - 238
  • [36] Error rates for multivariate outlier detection
    Cerioli, Andrea
    Farcomeni, Alessio
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2011, 55 (01) : 544 - 553
  • [37] Rejoinder to 'multivariate functional outlier detection'
    Hubert, Mia
    Rousseeuw, Peter
    Segaert, Pieter
    STATISTICAL METHODS AND APPLICATIONS, 2015, 24 (02): : 269 - 277
  • [38] Outlier detection in animal multivariate trajectories
    Navarro, Jorge
    de Diego, Isaac Martin
    Perez, Paula Carballo
    Ortega, Felipe
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 190
  • [39] A comparison of four different methods for outlier detection in bioequivalence studies
    Ramsay, T
    Elkum, N
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2005, 15 (01) : 43 - 52
  • [40] Multivariate unified skew-elliptical distributions.
    Arellano-Valle, Reinaldo B.
    Genton, Marc G.
    CHILEAN JOURNAL OF STATISTICS, 2010, 1 (01): : 17 - 33