Using eigenvalues of distance matrices for outlier detection

被引:0
|
作者
Modarres, Reza [1 ]
机构
[1] George Washington Univ, Dept Stat, Washington, DC 20052 USA
关键词
Distance matrix; decomposition; eigenvalue; outlier; detection;
D O I
10.3233/IDA-230048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distance or dissimilarity matrices are widely used in applications. We study the relationships between the eigenvalues of the distance matrices and outliers and show that outliers affect the pairwise distances and inflate the eigenvalues. We obtain the eigenvalues of a distance matrix that is affected by k outliers and compare them to the eigenvalues of a distance matrix with a constant structure. We show a discrepancy in the sizes of the eigenvalues of a distance matrix that is contaminated with outliers, present an algorithm and offer a new outlier detection method based on the eigenvalues of the distance matrix. We compare the new distance-based outlier technique with several existing methods under five distributions. The methods are applied to a study of public utility companies and gene expression data.
引用
收藏
页码:871 / 889
页数:19
相关论文
共 50 条
  • [31] Outlier detection in hyperspectral imagery using closest distance to center with ellipsoidal multivariate trimming
    Caulk, Ryan F.
    Reyes, Kevin B.
    Bauer, Kenneth W., Jr.
    JOURNAL OF DEFENSE MODELING AND SIMULATION-APPLICATIONS METHODOLOGY TECHNOLOGY-JDMS, 2012, 9 (02): : 163 - 172
  • [32] Using Distance-Based Outlier Detection Method to Handle the Abnormal Gateway in WSN
    Su, Wei
    Fu, Jingqi
    Wang, Haikuan
    ASIASIM 2012, PT II, 2012, 324 : 151 - 159
  • [33] OUTLIER DETECTION AND RELIABILITY OF ADJUSTMENT MODELS WITH SINGULAR COVARIANCE MATRICES
    WANG JinlingCHENG YongqiTAO BenzaoWANG Jinling
    Geo-Spatial Information Science, 1998, (01) : 55 - 59
  • [34] ERDOF: outlier detection algorithm based on entropy weight distance and relative density outlier factor
    Zhang Z.
    Liu W.
    Zhang Y.
    Deng Y.
    Wei M.
    Tongxin Xuebao/Journal on Communications, 2021, 42 (09): : 133 - 143
  • [35] Classification of Graph Sequences Utilizing the Eigenvalues of the Distance Matrices and Hidden Markov Models
    Schmidt, Miriam
    Schwenker, Friedhelm
    GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, 2011, 6658 : 325 - 334
  • [36] Classification of graph sequences utilizing the eigenvalues of the distance matrices and hidden markov models
    Schmidt M.
    Schwenker F.
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2011, 6658 LNCS : 325 - 334
  • [37] R-NN curves:: An intuitive approach to outlier detection using a distance based method
    Guha, Rajarshi
    Dutta, Debojyoti
    Jurs, Peter C.
    Chen, Ting
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (04) : 1713 - 1722
  • [38] A novel spatial outlier detection algorithm based on Mahalanobis distance
    Wen, Junhao
    Wu, Hongyan
    Wu, Zhongfu
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 3574 - 3577
  • [39] Multi-Tactic Distance-based Outlier Detection
    Cao, Lei
    Yan, Yizhou
    Kuhlman, Caitlin
    Wang, Qingyang
    Rundensteiner, Elke A.
    Eltabakh, Mohamed
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 959 - 970
  • [40] CORRECTING AND COMPLEMENTING FREEWAY TRAFFIC ACCIDENT DATA USING MAHALANOBIS DISTANCE BASED OUTLIER DETECTION
    Sun, Bin
    Cheng, Wei
    Bai, Guohua
    Goswami, Prashant
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2017, 24 (05): : 1597 - 1607