Knowledge discovery interestingness measures based on unexpectedness

被引：19

作者：

Kontonasios, Kleanthis-Nikolaos ^{[1
]}

Spyropoulou, Eirini ^{[1
]}

De Bie, Tijl ^{[1
]}

机构：

[1] Univ Bristol, Intelligent Syst Lab, Bristol, Avon, England

来源：

WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY | 2012年 / 2卷 / 05期

基金：

英国工程与自然科学研究理事会;

关键词：

PATTERNS;

D O I：

10.1002/widm.1063

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Knowledge discovery methods often discover a large number of patterns. Although this can be considered of interest, it certainly presents considerable challenges too. Indeed, this set of patterns often contains lots of uninteresting patterns that risk overwhelming the data miner. In addition, a single interesting pattern can be discovered in a multitude of tiny variations that for all practical purposes are redundant. These issues are referred to as the pattern explosion problem. They lie at the basis of much recent research attempting to quantify interestingness and redundancy between patterns, with the purpose of filtering down a large pattern set to an interesting and compact subset. Many diverse approaches to interestingness and corresponding interestingness measures (IMs) have been proposed in the literature. Some of them, named objective IMs, define interestingness only based on objective criteria of the pattern and data at hand. Subjective IMs additionally depend on the user's prior knowledge about the dataset. Formalizing unexpectedness is probably the most common approach for defining subjective IMs, where a pattern is deemed unexpected if it contradicts the user's expectations about the dataset. Such subjective IMs based on unexpectedness form the focus of this paper. We categorize measures based on unexpectedness into two major subgroups, namely, syntactical and probabilistic approaches. Based on this distinction, we survey different methods for assessing the unexpectedness of patterns with a special focus on frequent itemsets, tiles, association rules, and classification rules. (c) 2012 Wiley Periodicals, Inc.

引用

页码：386 / 399

页数：14

共 50 条

[1] Unexpectedness as a measure of interestingness in knowledge discovery
Padmanabhan, B
Tuzhilin, A
DECISION SUPPORT SYSTEMS, 1999, 27 (03) : 303 - 318
[2] A survey of interestingness measures for knowledge discovery
McGarry, K
KNOWLEDGE ENGINEERING REVIEW, 2005, 20 (01): : 39 - 61
[3] Development of Subjective Measures of Interestingness: From Unexpectedness to Shocking
Yafi, Eiad
Alam, M. A.
Biswas, Ranjit
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 368 - +
[4] Evaluation of rule interestingness measures in medical knowledge discovery in databases
Ohsaki, Miho
Hidenao, Abe
Tsumoto, Shusaku
Yokoi, Hideto
Yamaguchi, Takahira
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2007, 41 (03) : 177 - 196
[5] A Study of Interestingness Measures for Knowledge Discovery in Databases-A Genetic Approach
Garima, Goyal
Vashishtha, Jyoti
COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 2, 2015, 32 : 69 - 79
[6] Interestingness of discovered association rules in terms of neighborhood-based unexpectedness
Dong, GZ
Li, JY
RESEARCH AND DEVELOPMENT IN KNOWLEDGE DISCOVERY AND DATA MINING, 1998, 1394 : 72 - 86
[7] Interestingness Measures for Classification Based on Association Rules
Nguyen, Loan T. T.
Bay Vo
Hong, Tzung-Pei
Hoang Chi Thanh
COMPUTATIONAL COLLECTIVE INTELLIGENCE - TECHNOLOGIES AND APPLICATIONS, PT II, 2012, 7654 : 383 - 392
[8] Heuristic measures of interestingness
Hilderman, RJ
Hamilton, HJ
PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 232 - 241
[9] On rule interestingness measures
Freitas, AA
KNOWLEDGE-BASED SYSTEMS, 1999, 12 (5-6) : 309 - 315
[10] A clustering of interestingness measures
Vaillant, B
Lenca, P
Lallich, S
DISCOVERY SCIENCE, PROCEEDINGS, 2004, 3245 : 290 - 297

← 1 2 3 4 5 →