Efficient Discovery of the Most Interesting Associations

被引:15
|
作者
Webb, Geoffrey I. [1 ]
Vreeken, Jilles [2 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[2] Univ Antwerp, Dept Math & Comp Sci, B-2020 Antwerp, Belgium
基金
澳大利亚研究理事会;
关键词
Association mining; itemset mining; interestingness; statistical association mining; ALGORITHM; PATTERN; RULES;
D O I
10.1145/2601433
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Self-sufficient itemsets have been proposed as an effective approach to summarizing the key associations in data. However, their computation appears highly demanding, as assessing whether an itemset is self-sufficient requires consideration of all pairwise partitions of the itemset into pairs of subsets as well as consideration of all supersets. This article presents the first published algorithm for efficiently discovering self-sufficient itemsets. This branch-and-bound algorithm deploys two powerful pruning mechanisms based on upper bounds on itemset value and statistical significance level. It demonstrates that finding top-k productive and nonredundant itemsets, with postprocessing to identify those that are not independently productive, can efficiently identify small sets of key associations. We present extensive evaluation of the strengths and limitations of the technique, including comparisons with alternative approaches to finding the most interesting associations.
引用
收藏
页数:31
相关论文
共 50 条
  • [41] A most interesting young man (Bob Dylan)
    Doyle, Brian
    AMERICAN SCHOLAR, 2008, 77 (03): : 18 - 19
  • [42] THE MOST INTERESTING FORM OF LIE + NAZI ARCHITECTURE
    OCKMAN, J
    OPPOSITIONS, 1981, (24): : 38 - 47
  • [43] MOST INTERESTING JOURNAL ARTICLE IN 47 YEARS
    HOWZE, HR
    AMERICAN BAR ASSOCIATION JOURNAL, 1962, 48 (04): : 304 - &
  • [44] Finding interesting associations without support pruning
    Cohen, E
    Datar, M
    Fujiwara, S
    Gionis, A
    Indyk, P
    Motwani, R
    Ullman, JD
    Yang, C
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2001, 13 (01) : 64 - 78
  • [45] Discovering Interesting Associations in Gestation Course Data
    Skarga-Bandurova, Inna
    Biloborodova, Tetiana
    Nesterov, Maksym
    PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2017), 2017, 10423 : 204 - 214
  • [46] Bisociative Discovery of Interesting Relations between Domains
    Nagel, Uwe
    Thiel, Kilian
    Koetter, Tobias
    Piatek, Dawid
    Berthold, Michael R.
    ADVANCES IN INTELLIGENT DATA ANALYSIS X: IDA 2011, 2011, 7014 : 306 - 317
  • [47] Fast discovery of interesting collections of web services
    Zhu, Zhou
    Bailey, James
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, : 152 - +
  • [48] METHODS PROPOSED IN 1975 - REPORT ON MOST INTERESTING CASES
    AMENDE, HV
    MARTEN, J
    VANDENWEGHE, H
    LANDTECHNIK, 1976, 31 (06): : 245 - 249
  • [49] N-Most Interesting Closed Itemset Mining
    Songrarn, Panida
    Boonjing, Veera
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 619 - 624
  • [50] CRITERIA FOR JOB SATISFACTION - IS INTERESTING WORK MOST IMPORTANT
    WHITE, BJ
    MONTHLY LABOR REVIEW, 1977, 100 (05) : 30 - 35