Combining heterogeneous classifiers for relational databases

被引:10
|
作者
Manjunath, Geetha [1 ]
Murty, M. Narasimha [1 ]
Sitaram, Dinkar [2 ]
机构
[1] Indian Inst Sci, Dept CSA, Bangalore 560012, Karnataka, India
[2] Hewlett Packard Corp, STSD, Bangalore, Karnataka, India
关键词
Heterogeneous classifier; RDF; Relational data; RDBMS; CLASSIFICATION;
D O I
10.1016/j.patcog.2012.06.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Practical usage of machine learning is gaining strategic importance in enterprises looking for business intelligence. However, most enterprise data is distributed in multiple relational databases with expert-designed schema. Using traditional single-table machine learning techniques over such data not only incur a computational penalty for converting to a flat form (mega-join), even the human-specified semantic information present in the relations is lost. In this paper, we present a practical, two-phase hierarchical meta-classification algorithm for relational databases with a semantic divide and conquer approach. We propose a recursive, prediction aggregation technique over heterogeneous classifiers applied on individual database tables. The proposed algorithm was evaluated on three diverse datasets. namely TPCH, PKDD and UCI benchmarks and showed considerable reduction in classification time without any loss of prediction accuracy. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:317 / 324
页数:8
相关论文
共 50 条
  • [1] Combining dental epidemiological relational databases
    Taylor, D
    Naguib, RNG
    Amin, S
    James, A
    Boulton, S
    SECOND JOINT EMBS-BMES CONFERENCE 2002, VOLS 1-3, CONFERENCE PROCEEDINGS: BIOENGINEERING - INTEGRATIVE METHODOLOGIES, NEW TECHNOLOGIES, 2002, : 2209 - 2210
  • [2] Relational operators in heterogeneous random databases
    Velcescu, Letitia
    Vasile, Laurentiu
    11TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2009), 2009, : 407 - 412
  • [3] Matching Schemas of Heterogeneous Relational Databases
    Karasneh, Yaser
    Ibrahim, Hamidah
    Othman, Mohamed
    Yaakob, Razali
    2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 1 - 7
  • [4] COMBINING HETEROGENEOUS CLASSIFIERS FOR STOCK SELECTION
    Albanis, George
    Batchelor, Roy
    INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2007, 15 (1-2): : 1 - 21
  • [5] Query evaluation for distributed heterogeneous relational databases
    Chen, YJ
    Benn, W
    3RD IFCIS INTERNATIONAL CONFERENCE ON COOPERATIVE INFORMATION SYSTEMS - PROCEEDINGS, 1998, : 44 - 53
  • [6] An approach for matching schemas of heterogeneous relational databases
    Karasneh, Yaser
    Ibrahim, Hamidah
    Othman, Mohamed
    Yaakob, Razali
    Journal of Digital Information Management, 2010, 8 (04): : 260 - 269
  • [7] Combining heterogeneous classifiers via granular prototypes
    Tien Thanh Nguyen
    Mai Phuong Nguyen
    Xuan Cuong Pham
    Liew, Alan Wee-Chung
    Pedrycz, Witold
    APPLIED SOFT COMPUTING, 2018, 73 : 795 - 815
  • [8] Combining heterogeneous classifiers for network intrusion detection
    Borji, Ali
    ADVANCES IN COMPUTER SCIENCE - ASIAN 2007: COMPUTER AND NETWORK SECURITY, PROCEEDINGS, 2007, 4846 : 254 - 260
  • [9] Emotion Recognition from Speech by Combining Databases and Fusion of Classifiers
    Lefter, Iulia
    Rothkrantz, Leon J. M.
    Wiggers, Pascal
    van Leeuwen, David A.
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 353 - +
  • [10] Integration of Heterogeneous Relational Databases: RDF Mapping Approach
    Ismail, Maizatul Alunar
    Yaacob, Mashkuri
    Kareem, Sameem Abdul
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 1727 - 1733