Model-based exception mining for object-relational data

被引:0
|
作者
Fatemeh Riahi
Oliver Schulte
机构
[1] Simon Fraser University,
来源
关键词
Outlier detection; Exception mining; Statistical-relational learning; Bayesian network; Likelihood ratio; Network data;
D O I
暂无
中图分类号
学科分类号
摘要
This paper develops model-based exception mining and outlier detection for the case of object-relational data. Object-relational data represent a complex heterogeneous network, which comprises objects of different types, links among these objects, also of different types, and attributes of these links. We follow the well-established exceptional model mining (EMM) framework, which has been previously applied for subgroup discovery in propositional data; our novel contribution is to develop EMM for relational data. EMM leverages machine learning models for exception mining: An object is exceptional to the extent that a model learned for the object data differs from a model learned for the general population. In relational data, EMM can therefore be used for detecting single outlier or exceptional objects. We combine EMM with state-of-the-art statistical-relational model discovery methods for constructing a graphical model (Bayesian network), that compactly represents probabilistic associations in the data. We investigate several outlierness metrics, based on the learned object-relational model, that quantify the extent to which the association pattern of a potential outlier object deviates from that of the whole population. Our method is validated on synthetic data sets and on real-world data sets about soccer and hockey matches, IMDb movies and mutagenic compounds. Compared to baseline methods, the EMM approach achieved the best detection accuracy when combined with a novel outlinerness metric. An empirical evaluation on soccer and movie data shows a strong correlation between our novel outlierness metric and success metrics: Individuals that our metric marks out as unusual tend to have unusual success.
引用
收藏
页码:681 / 722
页数:41
相关论文
共 50 条
  • [41] Performance Comparison Slowly Changing Dimensions using Model Relational and Object-Relational
    Urrutia Sepulveda, Angelica
    Cofre Loyola, Rodrigo
    Wilson Hernandez, Manuel
    2015 34TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2015,
  • [42] Interval sequences:: An object-relational approach to manage spatial data
    Kriegel, HP
    Pötke, M
    Seidl, T
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, PROCEEDINGS, 2001, 2121 : 481 - 501
  • [43] Knowledge based, data driven and object-relational workflow management for microarray processing pipeline
    Li, Xin
    CITSA 2007/CCCT 2007: INTERNATIONAL CONFERENCE ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS : INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL III, POST-CONFERENCE ISSUE, PROCEEDINGS, 2007, : 204 - 209
  • [44] Logical designs of object-relational databases
    Mok, WY
    CHALLENGES OF INFORMATION TECHNOLOGY MANAGEMENT IN THE 21ST CENTURY, 2000, : 900 - 901
  • [45] Modeling relationships in object-relational databases
    Soutou, C
    DATA & KNOWLEDGE ENGINEERING, 2001, 36 (01) : 79 - 107
  • [46] Implementation of object-relational DBMSs in a relational database course
    Wang, M
    PROCEEDINGS OF THE THIRTY-SECOND SIGCSE TECHNICAL SYMPOSIUM ON COMPUTER SCIENCE EDUCATION, 2001, 33 (01): : 367 - 370
  • [47] Object-Relational Implementation of Evidential Databases
    Bousnina, Fatma Ezzahra
    Elmi, Sayda
    Tobji, Mohamed Anis Bach
    Chebbah, Mouna
    HadjAli, Allel
    Ben Yaghlane, Boutheina
    2016 INTERNATIONAL CONFERENCE ON DIGITAL ECONOMY (ICDEC), 2016, : 80 - 87
  • [48] A Classification of Object-Relational Impedance Mismatch
    Ireland, Christopher
    Bowers, David
    Newton, Michael
    Waugh, Kevin
    2009 FIRST INTERNATIONAL CONFERENCE ON ADVANCES IN DATABASES, KNOWLEDGE, AND DATA APPLICATIONS, 2009, : 36 - 43
  • [49] XML content management based on object-relational database technology
    Surjanto, B
    Ritter, N
    Loeser, H
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL I, 2000, : 70 - 79
  • [50] Optimization Slowly Changing Dimensions of a Data Warehouse using Object-Relational
    Cofre Loyola, Rodrigo
    Urrutia Sepulveda, Angelica
    Wilson Hernandez, Manuel
    2015 34TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2015,