Mining optimized support rules for numeric attributes

被引:13
|
作者
Rastogi, R
Shim, K
机构
[1] Bell Labs, Murray Hill, NJ 07974 USA
[2] Korea Adv Inst Sci & Technol, Taejon 305701, South Korea
[3] Adv Informat Technol Res Ctr, Taejon 305701, South Korea
关键词
optimized association rules; data mining; knowledge discovery;
D O I
10.1016/S0306-4379(01)00026-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining association rules on large data sets have received considerable attention in recent years. Association rules are useful for determining correlations between attributes of a relation and have applications in marketing, financial and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support, confidence or gain of the rule is maximized. In this paper, we generalize the optimized support association rule problem by permitting rules to contain disjunctions over uninstantiated numeric attributes. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving the uninstantiated attribute. For rules containing a single numeric attribute, we present a dynamic programming algorithm for computing optimized association rules. Furthermore, we propose bucketing technique for reducing the input size, and a divide and conquer strategy that improves the performance significantly without sacrificing optimality. We also present approximation algorithms based on dynamic programming for two numeric attributes. Our experimental results for a single numeric attribute indicate that our bucketing and divide and conquer enhancements are very effective in reducing the execution times and memory requirements of our dynamic programming algorithm. Furthermore, they show that our algorithms scale up almost linearly with the attribute's domain size as well as the number of disjunctions. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:425 / 444
页数:20
相关论文
共 50 条
  • [41] SOAP: Efficient feature selection of numeric attributes
    Ruiz, R
    Aguilar-Ruiz, JS
    Riquelme, JC
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2002, PROCEEDINGS, 2002, 2527 : 233 - 242
  • [42] Study of Association Rules Mining AlgorithmS Based on Adaptive Support
    He Yueshun
    Du Ping
    PROGRESS IN MEASUREMENT AND TESTING, PTS 1 AND 2, 2010, 108-111 : 436 - 440
  • [43] DISTRIBUTED MINING OF ASSOCIATION RULES BASED ON REDUCING THE SUPPORT THRESHOLD
    Boutsinas, Basilis
    Siotos, Costas
    Gerolimatos, Antonis
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (06) : 1109 - 1129
  • [44] Mining high coherent association rules with consideration of support measure
    Chen, Chun-Hao
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Lin, Yui-Kai
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (16) : 6531 - 6537
  • [45] Automated support specification for efficient mining of interesting association rules
    Lin, Wen-Yang
    Tseng, Ming-Cheng
    JOURNAL OF INFORMATION SCIENCE, 2006, 32 (03) : 238 - 250
  • [46] Technology of interpolation to determine the threshold of support in association rules mining
    Zhu Xijun
    Dai Yueming
    Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 1302 - 1304
  • [47] Mining Partially-Ordered Episode Rules with the Head Support
    Chen, Yangming
    Fournier-Viger, Philippe
    Nouioua, Farid
    Wu, Youxi
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2021), 2021, 12925 : 266 - 271
  • [48] A clustering algorithm with genetically optimized membership functions for fuzzy association rules mining
    Kaya, M
    Alhajj, R
    PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 881 - 886
  • [49] Mining classification rules from datasets with large number of many-valued attributes
    Giuffrida, G
    Chu, WW
    Hanssens, DM
    ADVANCES IN DATABSE TECHNOLOGY-EDBT 2000, PROCEEDINGS, 2000, 1777 : 335 - 349
  • [50] Mining association rules from databases with continuous attributes using Genetic Network Programming
    Taboada, Karla
    Gonzales, Eloy
    Shimada, Kaoru
    Mabu, Shingo
    Hirasawa, Kotaro
    Hu, Jinglu
    2007 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-10, PROCEEDINGS, 2007, : 1311 - 1317