An adjustable description quality measure for pattern discovery using the AQ methodology

被引:12
|
作者
Kaufman, KA [1 ]
Michalski, RS
机构
[1] George Mason Univ, Machine Learning & Inference Lab, Fairfax, VA 22030 USA
[2] Polish Acad Sci, Inst Comp Sci, PL-00901 Warsaw, Poland
关键词
machine learning; data mining; learning from noisy data; natural induction; AQ learning; decision rules; separate and conquer;
D O I
10.1023/A:1008787919756
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In concept learning and data mining tasks, the learner is typically faced with a choice of many possible hypotheses or patterns characterizing the input data. If one can assume that training data contain no noise, then the primary conditions a hypothesis must satisfy are consistency and completeness with regard to the data. In real-world applications, however, data are often noisy, and the insistence on the full completeness and consistency of the hypothesis is no longer valid. In such situations, the problem is to determine a hypothesis that represents the best trade-off between completeness and consistency. This paper presents an approach to this problem in which a learner seeks rules optimizing a rule quality criterion that combines the rule coverage (a measure of completeness) and training accuracy (a measure of inconsistency). These factors are combined into a single rule quality measure through a lexicographical evaluation functional (LEF). The method has been implemented in the AQ18 learning system for natural induction and pattern discovery, and compared with several other methods. Experiments have shown that the proposed method can be easily tailored to different problems and can simulate different rule learners by modifying the parameter of the rule quality criterion.
引用
收藏
页码:199 / 216
页数:18
相关论文
共 50 条
  • [21] A knowledge discovery methodology from EEG data for cyclic alternating pattern detection
    Fátima Machado
    Francisco Sales
    Clara Santos
    António Dourado
    C. A. Teixeira
    BioMedical Engineering OnLine, 17
  • [22] Visual Pattern Discovery using Random Projections
    Anand, Anushka
    Wilkinson, Leland
    Tuan Nhon Dang
    2012 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2012, : 43 - 52
  • [23] Detecting Design Pattern Using Subgraph Discovery
    Qiu, Ming
    Jiang, Qingshan
    Gao, An
    Chen, Ergan
    Qiu, Di
    Chai, Shang
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, PT I, PROCEEDINGS, 2010, 5990 : 350 - 359
  • [24] Navigation pattern discovery using grammatical inference
    Karampatziakis, N
    Paliouras, G
    Pierrakos, D
    Stamatopoulos, P
    GRAMMATICAL INFERENCE: ALGORITHMS AND APPLICATIONS, PROCEEDINGS, 2004, 3264 : 187 - 198
  • [25] An ERB Loudness Pattern Based Objective Speech Quality Measure
    Chen, Guo
    Parsa, Vijay
    Scollie, Susan
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2174 - +
  • [26] A methodology to enhance equipment performance using the OEE measure
    Badiger, Anil S.
    Gandhinathan, R.
    Gaitonde, V. N.
    EUROPEAN JOURNAL OF INDUSTRIAL ENGINEERING, 2008, 2 (03) : 356 - 376
  • [27] Using prior models as a measure of novelty in knowledge discovery
    Ludwig, J
    Fine, MJ
    Livingston, G
    Vozalis, E
    Buchanan, B
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, : 1071 - 1071
  • [28] A methodology to measure the service quality of online shopping of electronic goods in India
    Satapathy, S.
    Patel, S. K.
    Biswas, A.
    Mishra, P. D.
    INTERNATIONAL JOURNAL OF INDIAN CULTURE AND BUSINESS MANAGEMENT, 2013, 6 (02) : 227 - 247
  • [29] A Composite Methodology for Supporting Collaboration Pattern Discovery via Semantic Enrichment and Multidimensional Analysis
    Cuzzocrea, Alfredo
    Diamantini, Claudia
    Genga, Laura
    Potena, Domenico
    Storti, Emanuele
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 459 - 464
  • [30] Imperfect Pattern Recognition Using the Fuzzy Measure Theory
    Dahabiah, Anas
    Puentes, John
    Solaiman, Basel
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, PROCEEDINGS, 2009, 5788 : 101 - 108