Using rule sets to maximize ROC performance

被引:0
|
作者
Fawcett, T [1 ]
机构
[1] Hewlett Packard Labs, Palo Alto, CA 94304 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rules are commonly used for classification because they are modular, intelligible and easy to learn. Existing work in classification rule learning assumes the goal is to produce categorical classifications to maximize classification accuracy, Recent work in machine learning has pointed out the limitations of classification accuracy: when class distributions are skewed, or error costs are unequal, an accuracy maximizing rule set can perforin poorly. A more flexible use of a rule set is to produce instance scores indicating the likelihood that an instance belongs to a given class. With such an ability, we can apply rulesets effectively when distributions are skewed or error costs are unequal. This paper empirically investigates different strategies for evaluating rule sets when the goal is to maximize the scoring (ROC) performance.
引用
收藏
页码:131 / 138
页数:8
相关论文
共 50 条
  • [1] PRIE: a system for generating rulelists to maximize ROC performance
    Fawcett, Tom
    DATA MINING AND KNOWLEDGE DISCOVERY, 2008, 17 (02) : 207 - 224
  • [2] PRIE: a system for generating rulelists to maximize ROC performance
    Tom Fawcett
    Data Mining and Knowledge Discovery, 2008, 17 : 207 - 224
  • [3] Maximizing the Area under the ROC Curve with Decision Lists and Rule Sets
    Bostrom, Henrik
    PROCEEDINGS OF THE SEVENTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 27 - 34
  • [4] Heuristic dispatching rule to maximize TDD and IDD performance
    Ho, TF
    Li, RK
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2004, 42 (24) : 5133 - 5147
  • [5] Using fingerprints to select training sets to maximize HQSAR predictions
    Yue, SY
    RATIONAL APPROACHES TO DRUG DESIGN, 2001, : 186 - 194
  • [6] Active learning to maximize area under the ROC curve
    Culver, Matt
    Kun, Deng
    Scott, Stephen
    ICDM 2006: SIXTH INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2006, : 149 - +
  • [7] Using the Borda rule for ranking sets of objects
    Darmann, Andreas
    Klamler, Christian
    SOCIAL CHOICE AND WELFARE, 2019, 53 (03) : 399 - 414
  • [8] Using the Borda rule for ranking sets of objects
    Andreas Darmann
    Christian Klamler
    Social Choice and Welfare, 2019, 53 : 399 - 414
  • [9] Fuzzy rule sets for enhancing performance in a supply chain network
    Ho, G. T. S.
    Lau, H. C. W.
    Chung, S. H.
    Fung, R. Y. K.
    Chan, T. M.
    Lee, C. K. M.
    INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2008, 108 (07) : 947 - 972
  • [10] Using dependence diagrams to summarize decision rule sets
    Karimi, Kamran
    Hamilton, Howard J.
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2008, 5032 : 163 - +