Model-based exception mining for object-relational data

被引:0
|
作者
Fatemeh Riahi
Oliver Schulte
机构
[1] Simon Fraser University,
来源
关键词
Outlier detection; Exception mining; Statistical-relational learning; Bayesian network; Likelihood ratio; Network data;
D O I
暂无
中图分类号
学科分类号
摘要
This paper develops model-based exception mining and outlier detection for the case of object-relational data. Object-relational data represent a complex heterogeneous network, which comprises objects of different types, links among these objects, also of different types, and attributes of these links. We follow the well-established exceptional model mining (EMM) framework, which has been previously applied for subgroup discovery in propositional data; our novel contribution is to develop EMM for relational data. EMM leverages machine learning models for exception mining: An object is exceptional to the extent that a model learned for the object data differs from a model learned for the general population. In relational data, EMM can therefore be used for detecting single outlier or exceptional objects. We combine EMM with state-of-the-art statistical-relational model discovery methods for constructing a graphical model (Bayesian network), that compactly represents probabilistic associations in the data. We investigate several outlierness metrics, based on the learned object-relational model, that quantify the extent to which the association pattern of a potential outlier object deviates from that of the whole population. Our method is validated on synthetic data sets and on real-world data sets about soccer and hockey matches, IMDb movies and mutagenic compounds. Compared to baseline methods, the EMM approach achieved the best detection accuracy when combined with a novel outlinerness metric. An empirical evaluation on soccer and movie data shows a strong correlation between our novel outlierness metric and success metrics: Individuals that our metric marks out as unusual tend to have unusual success.
引用
收藏
页码:681 / 722
页数:41
相关论文
共 50 条
  • [31] SHAME - AN OBJECT-RELATIONAL FORMULATION
    SPERO, MH
    PSYCHOANALYTIC STUDY OF THE CHILD, 1984, 39 : 259 - 282
  • [32] Research of the object-relational mapping based on NHibernate framework
    Tan, Ran
    Xiong, Menghua
    DCABES 2007 PROCEEDINGS, VOLS I AND II, 2007, : 951 - 954
  • [33] Converting Relational Databases into Object-relational Databases
    Maatuk, Abdelsalam
    Ali, M. Akhtar
    Rossiter, Nick
    JOURNAL OF OBJECT TECHNOLOGY, 2010, 9 (02): : 145 - 161
  • [34] Object-relational queries into multidimensional databases with the active data repository
    Ferreira, Renato
    Parallel Processing Letters, 1999, 9 (02): : 173 - 195
  • [35] A conceptual model for temporal data warehouses and its transformation to the ER and the object-relational models
    Malinowski, E.
    Zimanyi, E.
    DATA & KNOWLEDGE ENGINEERING, 2008, 64 (01) : 101 - 133
  • [36] Issues of Object-Relational View design in data warehousing environment
    Gopalkrishnan, V
    Li, Q
    Karlapalem, K
    1998 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5, 1998, : 2732 - 2737
  • [37] Implementing spatial data warehouse hierarchies in object-relational DBMSs
    Malinowski, Elzbieta
    Zimanyi, Esteban
    ICEIS 2007: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2007, : 186 - 191
  • [38] Object-relational databases: the next wave in pharmaceutical data management
    Cargill, JF
    MacCuish, NE
    DRUG DISCOVERY TODAY, 1998, 3 (12) : 547 - 551
  • [39] Concurrent data materialization for object-relational database with semantic metadata
    Fong, J
    Pang, R
    Fong, A
    Pang, F
    Poon, K
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2003, 13 (03) : 257 - 291
  • [40] Storing and maintaining semistructured data efficiently in an object-relational database
    Mo, YY
    Ling, TW
    WISE 2002: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, 2002, : 247 - 256