An adjustable description quality measure for pattern discovery using the AQ methodology

被引:12
|
作者
Kaufman, KA [1 ]
Michalski, RS
机构
[1] George Mason Univ, Machine Learning & Inference Lab, Fairfax, VA 22030 USA
[2] Polish Acad Sci, Inst Comp Sci, PL-00901 Warsaw, Poland
关键词
machine learning; data mining; learning from noisy data; natural induction; AQ learning; decision rules; separate and conquer;
D O I
10.1023/A:1008787919756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In concept learning and data mining tasks, the learner is typically faced with a choice of many possible hypotheses or patterns characterizing the input data. If one can assume that training data contain no noise, then the primary conditions a hypothesis must satisfy are consistency and completeness with regard to the data. In real-world applications, however, data are often noisy, and the insistence on the full completeness and consistency of the hypothesis is no longer valid. In such situations, the problem is to determine a hypothesis that represents the best trade-off between completeness and consistency. This paper presents an approach to this problem in which a learner seeks rules optimizing a rule quality criterion that combines the rule coverage (a measure of completeness) and training accuracy (a measure of inconsistency). These factors are combined into a single rule quality measure through a lexicographical evaluation functional (LEF). The method has been implemented in the AQ18 learning system for natural induction and pattern discovery, and compared with several other methods. Experiments have shown that the proposed method can be easily tailored to different problems and can simulate different rule learners by modifying the parameter of the rule quality criterion.
引用
收藏
页码:199 / 216
页数:18
相关论文
共 50 条
  • [31] Serum antibody epitope discovery using pattern tiling
    Paull, Michael
    Daugherty, Patrick
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 255
  • [32] Pattern Discovery Using K-Means Algorithm
    Ahmed, Almahdi Mohammed
    Norwawi, Norita Md
    Ishak, Wan Hussain Wan
    Alkilany, Ahmed
    2014 WORLD CONGRESS ON COMPUTER APPLICATIONS AND INFORMATION SYSTEMS (WCCAIS), 2014,
  • [33] Pattern Discovery in Melanoma Domain Using Partitional Clustering
    Vernet, David
    Nicolas, Ruben
    Golobardes, Elisabet
    Fornells, Albert
    Garriga, Carles
    Puig, Susana
    Malvehy, Josep
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2008, 184 : 323 - +
  • [34] Online Unsupervised Pattern Discovery in Speech using Parallelization
    Gajjar, Mrugesh R.
    Govindarajan, R.
    Sreenivas, T. V.
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2458 - +
  • [35] Metagenes and molecular pattern discovery using matrix factorization
    Brunet, JP
    Tamayo, P
    Golub, TR
    Mesirov, JP
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (12) : 4164 - 4169
  • [36] Benchmarking Interpretability in Healthcare Using Pattern Discovery and Disentanglement
    Zhou, Pei-Yuan
    Takeuchi, Amane
    Martinez-Lopez, Fernando
    Ehghaghi, Malikeh
    Wong, Andrew K. C.
    Lee, En-Shiun Annie
    BIOENGINEERING-BASEL, 2025, 12 (03):
  • [37] Feature Discovery in Relevance Feedback Using Pattern Mining
    Pipanmaekaporn, Luepol
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 301 - 307
  • [38] ENHANCING ANOMALY DETECTION USING TEMPORAL PATTERN DISCOVERY
    Jakkula, Vikramaditya R.
    Crandall, Aaron S.
    Cook, Diane J.
    ADVANCED INTELLIGENT ENVIRONMENTS, 2009, : 175 - 194
  • [39] Pattern density methodology using IBM foundry technologies
    Scagnelli, David
    Grant, Casey
    Carrig, Keith
    Kemerer, Tim
    Landis, Howard
    McDevitt, Tom
    Sucharitaves, Jeanne-Tania
    Tsai, Esther
    Kumar, Mukesh
    Pastel, Paul
    57TH ELECTRONIC COMPONENTS & TECHNOLOGY CONFERENCE, 2007 PROCEEDINGS, 2007, : 1300 - +
  • [40] Motor unit potential characterization using "pattern discovery"
    Pino, L. J.
    Stashuk, D. W.
    Boe, S. G.
    Doherty, T. J.
    MEDICAL ENGINEERING & PHYSICS, 2008, 30 (05) : 563 - 573