Efficient attribute-oriented generalization for knowledge discovery from large databases

被引:38
|
作者
Carter, CL [1 ]
Hamilton, HJ [1 ]
机构
[1] Univ Regina, Dept Comp Sci, Networks Ctr Excellence Program, Ctr Excellence Lab,IRIS, Regina, SK S4S 0A2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
knowledge discovery from databases; data mining; attribute-oriented induction;
D O I
10.1109/69.683752
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present GDBR (Generalize DataBase Relation) and FIGR (Fast, incremental Generalization and Regeneralization), two enhancements of Attribute-Oriented Generalization, a well-known knowledge discovery from databases technique. GDBR and FIGR are both O(n) and, as such, are optimal. GDBR is an on-line algorithm and requires only a small, constant amount of space. FIGR also requires a constant amount of space that is generally reasonable although, under certain circumstances, may grow large. FIGR is incremental, allowing changes to the database to be reflected in the generalization results without rereading input data. FIGR also allows fast regeneralization to both higher and lower levels of generality without rereading input. We compare GDBR and FIGR to two previous algorithms, LCHR and AOI, which are O(n log n) and O(np), respectively, where n is the number of input tuples and p the number of tuples in the generalized relation. Both require O(n) space that, for large input, causes memory problems. We implemented all four algorithms and ran empirical tests, and we found that GDBR and FIGR are faster. In addition, their runtimes increase only linearly as input size increases, while the runtimes of LCHR and AOI increase greatly when input size exceeds memory limitations.
引用
收藏
页码:193 / 208
页数:16
相关论文
共 50 条
  • [41] Efficient Discovery of Partial Periodic Patterns in Large Temporal Databases
    Kiran, Rage Uday
    Veena, Pamalla
    Ravikumar, Penugonda
    Saideep, Chennupati
    Zettsu, Koji
    Shang, Haichuan
    Toyoda, Masashi
    Kitsuregawa, Masaru
    Reddy, P. Krishna
    ELECTRONICS, 2022, 11 (10)
  • [42] Knowledge discovery in deductive databases with large deduction results: The first step
    Goh, CL
    Tsukamoto, M
    Nishio, S
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) : 952 - 956
  • [43] Intelligent Information Management and Knowledge Discovery in Large Numeric and Scientific Databases
    Patrick Perrin
    Frederick E. Petry & William Thomason(Center for Intelligent and Knowledge-Based Systems)(Computer Science Department
    Journal of Systems Engineering and Electronics, 1996, (02) : 73 - 86
  • [44] Knowledge discovery from client-server databases
    Dewhurst, N
    Lavington, S
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 300 - 308
  • [45] Knowledge discovery from telecommunication network alarm databases
    Hatonen, K
    Klemettinen, M
    Mannila, H
    Ronkainen, P
    Toivonen, H
    PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, : 115 - 122
  • [46] Knowledge discovery from communication network alarm databases
    Wang, Xin-miao
    Huang, Tian-xi
    Yan, Pu-liu
    Chong, Yan-wen
    Wuhan University Journal of Natural Sciences, 2000, 5 (02) : 194 - 198
  • [47] Knowledge Discovery from Communication Network Alarm Databases
    WANG Xin miao
    WuhanUniversityJournalofNaturalSciences, 2000, (02) : 194 - 198
  • [48] Causal Rule Mining for Knowledge Discovery from Databases
    Bhoopathi, Harchana
    Rama, B.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 978 - 984
  • [49] Efficient discovery of periodic-frequent patterns in very large databases
    Kiran, R. Uday
    Kitsuregawa, Masaru
    Reddy, P. Krishna
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 112 : 110 - 121
  • [50] Knowledge discovery in databases from a perspective of intelligent information visualization
    Alvarado-Perez, Juan C.
    Bolanos-Ramirez, Harold
    Peluffo-Ordonez, Diego H.
    Murillo, S.
    2015 20TH SYMPOSIUM ON SIGNAL PROCESSING, IMAGES AND COMPUTER VISION (STSIVA), 2015,