Exploiting domain knowledge to detect outliers

被引:12
|
作者
Angiulli, Fabrizio [1 ]
Fassetti, Fabio [1 ]
机构
[1] Univ Calabria, DIMES Dept, I-87036 Cosenza, Italy
关键词
Outlier detection; Unsupervised methods; Knowledge representation; Concept learning; INFERENCE;
D O I
10.1007/s10618-013-0310-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel definition of outlier whose aim is to embed an available domain knowledge in the process of discovering outliers. Specifically, given a background knowledge, encoded by means of a set of first-order rules, and a set of positive and negative examples, our approach aims at singling out the examples showing abnormal behavior. The technique here proposed is unsupervised, since there are no examples of normal or abnormal behavior, even if it has connections with supervised learning, since it is based on induction from examples. We provide a notion of compliance of a set of facts with respect to a background knowledge and a set of examples, which is exploited to detect the examples that prevent to improve generalization of the induced hypothesis. By testing compliance with respect to both the direct and the dual concept, we are able to distinguish among three kinds of abnormalities, that are irregular, anomalous, and outlier observations. This allows us to provide a finer characterization of the anomaly at hand and to single out subtle forms of anomalies. Moreover, we are also able to provide explanations for the abnormality of an observation which make intelligible the motivation underlying its exceptionality. We present both exact and approximate algorithms for mining abnormalities. The approximate algorithms improve execution time while guaranteeing good accuracy. Moreover, we discuss peculiarities of the novel approach, present examples of knowledge mined, analyze the scalability of the algorithms, and provide comparison with noise handling mechanisms and some alternative approaches.
引用
收藏
页码:519 / 568
页数:50
相关论文
共 50 条
  • [1] Exploiting domain knowledge to detect outliers
    Fabrizio Angiulli
    Fabio Fassetti
    Data Mining and Knowledge Discovery, 2014, 28 : 519 - 568
  • [2] Exploiting Domain Knowledge for Object Discovery
    Collet, Alvaro
    Xiong, Bo
    Gurau, Corina
    Hebert, Martial
    Srinivasa, Siddhartha S.
    2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 2118 - 2125
  • [3] Exploiting domain knowledge for approximate diagnosis
    ten Teije, A
    van Harmelen, F
    IJCAI-97 - PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 AND 2, 1997, : 454 - 459
  • [4] EXPLOITING DOMAIN KNOWLEDGE IN IC CELL LAYOUT
    KIM, JH
    MCDERMOTT, J
    SIEWIOREK, DP
    IEEE DESIGN & TEST OF COMPUTERS, 1984, 1 (03): : 52 - 64
  • [5] Exploiting Domain Knowledge in Making Delegation Decisions
    Emele, Chukwuemeka David
    Norman, Timothy J.
    Sensoy, Murat
    Parsons, Simon
    AGENTS AND DATA MINING INTERACTION, 2012, 7103 : 117 - +
  • [6] Exploiting Saliency Filters and Domain knowledge for Saliency
    Zeng, Jianqin
    Chen, Wei
    Zhang, Guangzheng
    Guo, Kai
    PROGRESS IN MECHATRONICS AND INFORMATION TECHNOLOGY, PTS 1 AND 2, 2014, 462-463 : 410 - 415
  • [7] Exploiting Domain Knowledge to Forecast Heating Oil Consumption
    Corliss, George F.
    Sakauchi, Tsuginosuke
    Vitullo, Steven R.
    Brown, Ronald H.
    ADVANCES IN MATHEMATICAL AND COMPUTATIONAL METHODS: ADDRESSING MODERN CHALLENGES OF SCIENCE, TECHNOLOGY, AND SOCIETY, 2011, 1368
  • [8] A Statistical Model to Detect DRG Outliers
    Lin, Shuguang
    Rouse, Paul
    Wang, Ying-Ming
    Zhang, Fan
    IEEE ACCESS, 2022, 10 : 28717 - 28724
  • [9] Exploiting Domain Knowledge as Causal Independencies in Modeling Gestational Diabetes
    Mathur, Saurabh
    Karanam, Athresh
    Radivojac, Predrag
    Haas, David M.
    Kersting, Kristian
    Natarajan, Sriraam
    BIOCOMPUTING 2023, PSB 2023, 2023, : 359 - 370
  • [10] Exploiting Domain Knowledge by Automated Taxonomy Generation in Recommender Systems
    Li, Tao
    Anand, Sarabjot S.
    E-COMMERCE AND WEB TECHNOLOGIES, PROCEEDINGS, 2009, 5692 : 120 - 131