Evaluation of rule interestingness measures in medical knowledge discovery in databases

被引:39
|
作者
Ohsaki, Miho
Hidenao, Abe
Tsumoto, Shusaku
Yokoi, Hideto
Yamaguchi, Takahira
机构
[1] Doshisha Univ, Fac Engn, Kyotanabe Shi, Kyoto 6100321, Japan
[2] Shimane Univ, Dept Med Informat, Izumo, Shimane 6938501, Japan
[3] Kagawa Univ Hosp, Dept Med Informat, Miki, Kagawa 7610793, Japan
[4] Keio Univ, Fac Sci & Technol, Kohoku Ku, Kanagawa 2238522, Japan
关键词
data mining; knowledge discovery in databases; interestingness; postprocessing; clinical data;
D O I
10.1016/j.artmed.2007.07.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: We discuss the usefulness of rule interestingness measures for medical KDD through experiments using clinical datasets, and, based on the outcomes of these experiments, also consider how to utilize these measures in postprocessing. Methods and materials: We first conducted an experiment to compare the evaluation results derived from a total of 40 various interestingness measures with those supplied by a medical expert for rules discovered in a clinical dataset on meningitis. We calculated and compared the performance of each interestingness measure to estimate a medical expert's interest using f-measure and correlation coefficient. We then conducted a similar experiment for hepatitis. Results and conclusion: The comprehensive results of experiments on meningitis and hepatitis indicate that the interestingness measures, accuracy, chi-square measure for one quadrant, relative risk, uncovered negative, and peculiarity, have a stable, reasonable performance in estimating real human interest in the medical domain. The results also indicate that the performance of interestingness measures is influenced by the certainty of a hypothesis made by the medical expert, and that the combinational use of interestingness measures will contribute to support medical experts to generate and confirm their hypotheses through human-system interaction. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:177 / 196
页数:20
相关论文
共 50 条
  • [21] Knowledge discovery in databases
    Düsing, R
    WIRTSCHAFTSINFORMATIK, 2000, 42 (01): : 74 - 75
  • [22] Aggregation of valued relations applied to association rule interestingness measures
    Barthelemy, Jean-Pierre
    Legrain, Angelique
    Lenca, Philippe
    Vaillant, Benoit
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, 2006, 3885 : 203 - 214
  • [23] Knowledge discovery in clinical databases and evaluation of discovered knowledge in outpatient clinic
    Tsumoto, S
    INFORMATION SCIENCES, 2000, 124 (1-4) : 125 - 137
  • [24] A probabilistic framework towards the parameterization of association rule interestingness measures
    Lallich, Stephane
    Vaillant, Benoit
    Lenca, Philippe
    METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2007, 9 (03) : 447 - 463
  • [25] Two novel interestingness measures for gene association rule mining
    Meihua Wang
    Shumin Wu
    Ruichu Cai
    Neural Computing and Applications, 2013, 23 : 835 - 841
  • [26] Two novel interestingness measures for gene association rule mining
    Wang, Meihua
    Wu, Shumin
    Cai, Ruichu
    NEURAL COMPUTING & APPLICATIONS, 2013, 23 (3-4): : 835 - 841
  • [27] Optimized rule mining through a unified framework for interestingness measures
    Hebert, Celine
    Cremilleux, Bruno
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 238 - 247
  • [28] A Probabilistic Framework Towards the Parameterization of Association Rule Interestingness Measures
    Stéphane Lallich
    Benoît Vaillant
    Philippe Lenca
    Methodology and Computing in Applied Probability, 2007, 9 : 447 - 463
  • [29] A hybrid approach to rule discovery in databases
    Zhong, N
    Dong, JZ
    Ohsuga, S
    INFORMATION SCIENCES, 2000, 126 (1-4) : 99 - 127
  • [30] Knowledge discovery in medical multi-databases: A rough set approach
    Tsumoto, S
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 147 - 155