A predictive DEA model for outlier detection

被引:8
|
作者
Yang, Mingwen [1 ,2 ]
Wan, Guohua [1 ]
Zheng, Eric [2 ]
机构
[1] Shanghai Jiao Tong Univ, Antai Coll Econ & Management, Shanghai 200030, Peoples R China
[2] Univ Texas Dallas, Naveen Jindal Sch Management, Richardson, TX 75080 USA
关键词
predictive DEA; Bi-super DEA; outlier detection; simulation;
D O I
10.1080/23270012.2014.889911
中图分类号
F [经济];
学科分类号
02 ;
摘要
Outlier detection is one of the key issues in any data-driven analytics. In this paper, we propose Bi-super DEA, a super DEA-based method that constructs both efficient and inefficient frontiers for outlier detection. In evaluating its predictive performance, we develop a novel predictive DEA procedure, PDEA, which extends the conventional DEA approaches that have been primarily used for in-sample efficiency estimation, to predict outputs for the out-of-sample. This enables us to compare the predictive performance of our approach against several popular outlier detection methods including the parametric robust regression in statistics and non-parametric k-means in data mining. We conduct comprehensive simulation experiments to examine the relative performance of these outlier detection methods under the influence of five factors: sample size, linearity of production function, normality of noise distribution, homogeneity of data, and levels of random noise contaminating the data generating process (DGP). We find that, somewhat surprisingly, Bi-super CCR consistently outperforms Bi-super BCC in detecting outliers. Under the linearity, normality and homogeneity conditions, the parametric robust regression method works best. However, when the DGP violates these conditions, Bi-super DEA emerges as the better choice due to its distribution-free property. Our results shed light on the conditions that each method excels or fails and provide users with practical guidelines on how to choose appropriate methods to detect outliers.
引用
收藏
页码:20 / 41
页数:22
相关论文
共 50 条
  • [21] Outlier Detection in Logistic Regression: A Quest for Reliable Knowledge from Predictive Modeling and Classification
    Nurunnabi, Abdul
    West, Geoff
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 643 - 652
  • [22] A Comparative Study of Cluster Based Outlier Detection, Distance Based Outlier Detection and Density Based Outlier Detection Techniques
    Mandhare, Harshada C.
    Idate, S. R.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 931 - 935
  • [23] A multi-source information fusion model for outlier detection
    Zhang, Pengfei
    Li, Tianrui
    Wang, Guoqiang
    Wang, Dexian
    Lai, Pei
    Zhang, Fan
    INFORMATION FUSION, 2023, 93 : 192 - 208
  • [24] Simultaneous outlier detection and variable selection for spatial Durbin model
    Cheng, Yi
    Song, Yunquan
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2023, 37 (03) : 596 - 618
  • [25] Vehicle Model based Outlier Detection for Automotive Visual Odometry
    Ohr, Florian M.
    Parakrama, Thusitha
    Rosenstiel, W.
    2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 82 - 87
  • [26] A Healthcare Quality Assessment Model Based on Outlier Detection Algorithm
    Alharbe, Nawaf
    Rakrouki, Mohamed Ali
    Aljohani, Abeer
    PROCESSES, 2022, 10 (06)
  • [27] A new outlier detection method considering outliers as model errors
    S. Hekimoglu
    B. Erdogan
    R. C. Erenoglu
    Experimental Techniques, 2015, 39 : 57 - 68
  • [28] A Model-Based Approach for Outlier Detection in Sensor Networks
    Ding, Min
    Liang, Qilian
    Cheng, Xiuzhen
    Al-Rodhaan, Mznah
    Al-Dhelaan, Abdullah
    Huang, Scott C. -H.
    Chen, Dechang
    AD HOC & SENSOR WIRELESS NETWORKS, 2011, 12 (3-4) : 275 - 293
  • [29] Robust estimation and outlier detection based on linear regression model
    Cui, Le
    Cheng, Libo
    Jiang, Xiaoming
    Chen, Zhanfang
    Albarka
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 4657 - 4664
  • [30] Outlier detection method based on multi-model consensus
    Wang, Yujing
    Chen, Zhengguang
    Liu, Shuo
    Liu, Jinming
    Wang, Quan
    SPECTROSCOPY LETTERS, 2025,