A rough set-based case-based reasoner for text categorization

被引:34
|
作者
Li, Y
Shiu, SCK [1 ]
Pal, SK
Liu, JNK
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700035, W Bengal, India
关键词
text categorization (TC); case-based reasoning (CBR); rough set; case coverage; case reachability;
D O I
10.1016/j.ijar.2005.06.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel rough set-based case-based reasoner For Use in text categorization (TC). The reasoner has four main components: feature term extractor, document representor, case selector, and case retriever. It operates by first reducing the number of feature terms in the documents Using the rough set technique. Then, the number of documents is reduced using a new document selection approach based on the case-based reasoning (CBR) concepts of coverage and reachability. As a result, both the number of feature terms and documents are reduced with only minimal loss of information. Finally, this smaller set of documents with fewer feature terms is Used in TC. The proposed rough set-based case-based reasoner wits tested on the Reuters21578 text datasets. The experimental results demonstrate its effectiveness and efficiency as it significantly reduced feature terms and documents, important for improving the efficiency of TC, while preserving and even improving classification accuracy. (C) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:229 / 255
页数:27
相关论文
共 50 条
  • [31] Rough set-based feature selection method
    ZHAN Yanmei
    Progress in Natural Science, 2005, (03) : 88 - 92
  • [32] A study on rough set-based collaborative filtering
    Zhang, W
    Liu, L
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS 1 AND 2, 2004, : 640 - 644
  • [33] A rough set-based supplier evaluation method
    Wang, Wenpeng
    Liu, Junhong
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE AND SOCIAL RESEARCH (ICSSR 2013), 2013, 64 : 438 - 440
  • [34] Rough fuzzy set-based image compression
    Petrosino, Alfredo
    Ferone, Alessio
    FUZZY SETS AND SYSTEMS, 2009, 160 (10) : 1485 - 1506
  • [35] Fuzzy Rough Set-Based Sentence Similarity Measure and its Application to Text Summarization
    Chatterjee, Niladri
    Yadav, Nidhika
    IETE TECHNICAL REVIEW, 2019, 36 (05) : 517 - 525
  • [36] Rough set based approaches to feature selection for Case-Based Reasoning classifiers
    Salamo, Maria
    Lopez-Sanchez, Maite
    PATTERN RECOGNITION LETTERS, 2011, 32 (02) : 280 - 292
  • [37] Software-effort estimation with a case-based reasoner
    Prietula, M. J.
    Vicinanza, S. S.
    Mukhopadhyay, T.
    Journal of Experimental & Theoretical Artificial Intelligence, 8 (3-4):
  • [38] Topic Word Set-Based Text Clustering
    Ghazifard, Amir Mehdi
    Shams, Mohammadreza
    Shamaee, Zeinab
    2013 7TH INTERNATIONAL CONFERENCE ON E-COMMERCE IN DEVELOPING COUNTRIES: WITH FOCUS ON E-SECURITY (ECDC), 2013,
  • [39] Building a case-based reasoner for clinical decision support
    Wills, A
    Watson, I
    PRICAI 2004: TRENDS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3157 : 554 - 562
  • [40] Evaluating a case-based reasoner for clinical decision support
    Wills, A
    Watson, I
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2004, 3213 : 575 - 582