Mining optimized support rules for numeric attributes

被引:13
|
作者
Rastogi, R
Shim, K
机构
[1] Bell Labs, Murray Hill, NJ 07974 USA
[2] Korea Adv Inst Sci & Technol, Taejon 305701, South Korea
[3] Adv Informat Technol Res Ctr, Taejon 305701, South Korea
关键词
optimized association rules; data mining; knowledge discovery;
D O I
10.1016/S0306-4379(01)00026-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining association rules on large data sets have received considerable attention in recent years. Association rules are useful for determining correlations between attributes of a relation and have applications in marketing, financial and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support, confidence or gain of the rule is maximized. In this paper, we generalize the optimized support association rule problem by permitting rules to contain disjunctions over uninstantiated numeric attributes. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving the uninstantiated attribute. For rules containing a single numeric attribute, we present a dynamic programming algorithm for computing optimized association rules. Furthermore, we propose bucketing technique for reducing the input size, and a divide and conquer strategy that improves the performance significantly without sacrificing optimality. We also present approximation algorithms based on dynamic programming for two numeric attributes. Our experimental results for a single numeric attribute indicate that our bucketing and divide and conquer enhancements are very effective in reducing the execution times and memory requirements of our dynamic programming algorithm. Furthermore, they show that our algorithms scale up almost linearly with the attribute's domain size as well as the number of disjunctions. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:425 / 444
页数:20
相关论文
共 50 条
  • [1] Mining optimized support rules for numeric attributes
    Rastogi, R
    Shim, K
    15TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1999, : 206 - 215
  • [2] Mining optimized association rules for numeric attributes
    Fukuda, T
    Morimoto, Y
    Morishita, S
    Tokuyama, T
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1999, 58 (01) : 1 - 12
  • [3] Mining optimized gain rules for numeric attributes
    Brin, S
    Rastogi, R
    Shim, K
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (02) : 324 - 338
  • [4] Mining optimized association rules with categorical and numeric attributes
    Rastogi, R
    Shim, K
    14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 1998, : 503 - 512
  • [5] Mining optimized association rules with categorical and numeric attributes
    Rastogi, R
    Shim, K
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) : 29 - 50
  • [6] Mining two-dimensional optimized association rules and hidden patterns for numeric attributes
    He, Z
    Tian, SF
    Huang, HK
    Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1497 - 1500
  • [7] Handling Numeric Behavioral Attributes in Actionable Behavioral Rules Mining
    Su, Peng
    Yang, Jian
    Li, Zhenpeng
    Liu, Yuan
    PROCEEDINGS 2016 IEEE INTERNATIONAL CONFERENCE ON SERVICE OPERATIONS AND LOGISTICS, AND INFORMATICS (SOLI), 2016, : 178 - 183
  • [8] Distribution rules with numeric attributes of interest
    Jorge, Alipio M.
    Azevedo, Paulo J.
    Pereira, Fernando
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 247 - 258
  • [9] Mining numeric association rules with genetic algorithms
    Mata, J
    Alvarez, JL
    Riquelme, JC
    ARTIFICIAL NEURAL NETS AND GENETIC ALGORITHMS, 2001, : 264 - 267
  • [10] Association rules mining based on numeric constraint
    Yan, H. (Haiyan@ncwu.edu.cn), 1600, Advanced Institute of Convergence Information Technology, Myoungbo Bldg 3F,, Bumin-dong 1-ga, Seo-gu, Busan, 602-816, Korea, Republic of (04):