Linear correlation discovery in databases: a data mining approach

被引:11
|
作者
Chiang, RHL
Cecil, CEH
Lim, EP
机构
[1] Univ Cincinnati, Coll Business, Dept Informat Syst, Cincinnati, OH 45221 USA
[2] Nanyang Technol Univ, Nanyang Business Sch, Singapore 639798, Singapore
[3] Nanyang Technol Univ, Sch Comp Engn, Ctr Adv Informat Syst, Singapore 639798, Singapore
关键词
knowledge discovery in database; linear correlation; association measurement; data mining;
D O I
10.1016/j.datak.2004.09.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Very little research in knowledge discovery has studied how to incorporate statistical methods to automate linear correlation discovery (LCD). We present an automatic LCD methodology that adopts statistical measurement functions to discover correlations from databases' attributes. Our methodology automatically pairs attribute groups having potential linear correlations, measures the linear correlation of each pair of attribute groups, and confirms the discovered correlation. The methodology is evaluated in two sets of experiments. The results demonstrate the methodology's ability to facilitate linear correlation discovery for databases with a large amount of data. (c) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:311 / 337
页数:27
相关论文
共 50 条
  • [41] An XML approach to knowledge discovery in databases
    Kotásek, P
    Zendulka, J
    KNOWLEDGE-BASED SOFTWARE ENGINEERING, 2000, 62 : 141 - 148
  • [42] Data mining of software development databases
    Khoshgoftaar, TM
    Allen, EB
    Jones, WD
    Hudepohl, JP
    SOFTWARE QUALITY JOURNAL, 2001, 9 (03) : 161 - 176
  • [43] Data Mining of Software Development Databases
    Taghi M. Khoshgoftaar
    Edward B. Allen
    Wendell D. Jones
    John P. Hudepohl
    Software Quality Journal, 2001, 9 : 161 - 176
  • [44] Gene expression databases and data mining
    Anderle, P
    Duval, M
    Draghici, S
    Kuklin, A
    Littlejohn, TG
    Medrano, JE
    Vilanova, D
    Roberts, MA
    BIOTECHNIQUES, 2003, : 36 - 44
  • [45] Data Mining in Multimodal Medical Databases
    Strungaru, Rodica
    Ungureanu, G. Mihaela
    Murri, Roberto
    Pasqualli, Clara
    Seidel, Klaus
    Datcu, Mihai
    Stanciu, Radu
    INTEGRATING BIOMEDICAL INFORMATION: FROM E-CELL TO E-PATIENT, 2006, : 85 - +
  • [46] DATA-MINING CHESS DATABASES
    Bleicher, E.
    Haworth, G. Mc C.
    van der Heijden, H. M. J. F.
    ICGA JOURNAL, 2010, 33 (04) : 212 - 214
  • [47] Data mining and modeling in scientific databases
    Kapetanios, E
    Norrie, MC
    NINTH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 1997, : 24 - 27
  • [48] A hybrid approach to rule discovery in databases
    Zhong, N
    Dong, JZ
    Ohsuga, S
    INFORMATION SCIENCES, 2000, 126 (1-4) : 99 - 127
  • [49] Databases and data mining for computational vaccinology
    Flower, DR
    CURRENT OPINION IN DRUG DISCOVERY & DEVELOPMENT, 2003, 6 (03) : 396 - 400
  • [50] Data mining for distributed Databases with multiagents
    Niimi, A
    Konishi, O
    KNOWLEDGE-BASED INTELLIGNET INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2003, 2774 : 1412 - 1418