Linear correlation discovery in databases: a data mining approach

被引:11
|
作者
Chiang, RHL
Cecil, CEH
Lim, EP
机构
[1] Univ Cincinnati, Coll Business, Dept Informat Syst, Cincinnati, OH 45221 USA
[2] Nanyang Technol Univ, Nanyang Business Sch, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Ctr Adv Informat Syst, Singapore 639798, Singapore
关键词
knowledge discovery in database; linear correlation; association measurement; data mining;
D O I
10.1016/j.datak.2004.09.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Very little research in knowledge discovery has studied how to incorporate statistical methods to automate linear correlation discovery (LCD). We present an automatic LCD methodology that adopts statistical measurement functions to discover correlations from databases' attributes. Our methodology automatically pairs attribute groups having potential linear correlations, measures the linear correlation of each pair of attribute groups, and confirms the discovered correlation. The methodology is evaluated in two sets of experiments. The results demonstrate the methodology's ability to facilitate linear correlation discovery for databases with a large amount of data. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:311 / 337
页数:27
相关论文
共 50 条
  • [31] A hybrid data mining approach for knowledge extraction and classification in medical databases
    Hassan, Syed Zahid
    Verma, Brijesh
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2007, : 503 - 508
  • [32] A variable resolution approach to cluster discovery in spatial data mining
    Brimicombe, AJ
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2003, PT 3, PROCEEDINGS, 2003, 2669 : 1 - 11
  • [33] A Data Mining Approach for Biomarker Discovery Using Transcriptomics in Endometriosis
    Akter, Sadia
    Xu, Dong
    Nagel, Susan C.
    Joshi, Trupti
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 969 - 972
  • [34] Cluster discovery in spatial data mining: a variable resolution approach
    Brimicombe, AJ
    DATA MINING III, 2002, 6 : 625 - 634
  • [35] Data mining for knowledge discovery in mining
    Golosinski, TS
    Hu, H
    MINE PLANNING AND EQUIPMENT SELECTION 2001, 2001, : 1011 - 1018
  • [36] Mining and visualizing large anticancer drug discovery databases
    Shi, LM
    Fan, Y
    Lee, JK
    Waltham, M
    Andrews, DT
    Scherf, U
    Paull, KD
    Weinstein, JN
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (02): : 367 - 379
  • [37] Knowledge discovery in mining truck condition and performance databases
    Ataman, IK
    Golosinski, TS
    PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL MINING CONGRESS AND EXHIBITION OF TURKEY, 2001, : 231 - 235
  • [38] Causal Rule Mining for Knowledge Discovery from Databases
    Bhoopathi, Harchana
    Rama, B.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 978 - 984
  • [39] Integrating data mining with SQL databases: OLE DB for data mining
    Netz, A
    Chaudhuri, S
    Fayyad, U
    Bernhardt, J
    17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, : 379 - 387
  • [40] Data mining and knowledge discovery in databases for urban solid waste management: A scientific literature review
    Dias, Janaina Lopes
    Sott, Michele Kremer
    Ferrao, Caroline Cipolatto
    Furtado, Joao Carlos
    Ribas Moraes, Jorge Andre
    WASTE MANAGEMENT & RESEARCH, 2021, 39 (11) : 1331 - 1340