A predictive DEA model for outlier detection

被引:8
|
作者
Yang, Mingwen [1 ,2 ]
Wan, Guohua [1 ]
Zheng, Eric [2 ]
机构
[1] Shanghai Jiao Tong Univ, Antai Coll Econ & Management, Shanghai 200030, Peoples R China
[2] Univ Texas Dallas, Naveen Jindal Sch Management, Richardson, TX 75080 USA
关键词
predictive DEA; Bi-super DEA; outlier detection; simulation;
D O I
10.1080/23270012.2014.889911
中图分类号
F [经济];
学科分类号
02 ;
摘要
Outlier detection is one of the key issues in any data-driven analytics. In this paper, we propose Bi-super DEA, a super DEA-based method that constructs both efficient and inefficient frontiers for outlier detection. In evaluating its predictive performance, we develop a novel predictive DEA procedure, PDEA, which extends the conventional DEA approaches that have been primarily used for in-sample efficiency estimation, to predict outputs for the out-of-sample. This enables us to compare the predictive performance of our approach against several popular outlier detection methods including the parametric robust regression in statistics and non-parametric k-means in data mining. We conduct comprehensive simulation experiments to examine the relative performance of these outlier detection methods under the influence of five factors: sample size, linearity of production function, normality of noise distribution, homogeneity of data, and levels of random noise contaminating the data generating process (DGP). We find that, somewhat surprisingly, Bi-super CCR consistently outperforms Bi-super BCC in detecting outliers. Under the linearity, normality and homogeneity conditions, the parametric robust regression method works best. However, when the DGP violates these conditions, Bi-super DEA emerges as the better choice due to its distribution-free property. Our results shed light on the conditions that each method excels or fails and provide users with practical guidelines on how to choose appropriate methods to detect outliers.
引用
收藏
页码:20 / 41
页数:22
相关论文
共 50 条
  • [1] Outlier detection in two-stage semiparametric DEA models
    Johnson, Andrew L.
    McGinnis, Leon F.
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2008, 187 (02) : 629 - 635
  • [2] Posterior Predictive Outlier Detection Using Sample Reweighting
    Zaslavsky, Alan M.
    Bradlow, Eric T.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2010, 19 (04) : 790 - 807
  • [3] Outlier Detection in a Circular Regression Model
    Rambli, Adzhar
    Yunus, Rossita Mohamad
    Mohamed, Ibrahim
    Hussin, Abdul Ghapor
    SAINS MALAYSIANA, 2015, 44 (07): : 1027 - 1032
  • [4] Detection Procedure for a Single Additive Outlier and Innovational Outlier in a Bilinear Model
    Zaharim, Azami
    Ahmad, Ibrahim
    Mohamed, Ibrahim
    Yahaya, Mohd Sahar
    PAKISTAN JOURNAL OF STATISTICS AND OPERATION RESEARCH, 2007, 3 (01) : 1 - 5
  • [5] Model predictive control for ARMAX processes with additive outlier noise
    Gao, Hui
    Tian, Ziwen
    MEASUREMENT & CONTROL, 2022, 55 (7-8): : 861 - 868
  • [6] OUTLIER DETECTION IN THE STATE-SPACE MODEL
    CHIB, S
    TIWARI, RC
    STATISTICS & PROBABILITY LETTERS, 1994, 20 (02) : 143 - 148
  • [7] An optimization model for Outlier detection in categorical data
    He, ZY
    Deng, SC
    Xu, XF
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 400 - 409
  • [8] Bayesian outlier detection in Capital Asset Pricing Model
    De Giuli, Maria Elena
    Maggi, Mario Alessandro
    Tarantola, Claudia
    STATISTICAL MODELLING, 2010, 10 (04) : 375 - 390
  • [9] Improvement on the Innovational Outlier Detection Procedure in a Bilinear Model
    Mohamed, I. B.
    Ismail, M. I.
    Yahya, M. S.
    Hussin, A. G.
    Mohamed, N.
    Zaharim, A.
    Zainol, M. S.
    SAINS MALAYSIANA, 2011, 40 (02): : 191 - 196
  • [10] Simplified outlier detection for improving the robustness of a fuzzy model
    Jin, Yali
    Cao, Weihua
    Wu, Min
    Yuan, Yan
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (04)