A rough set-based case-based reasoner for text categorization

被引:34
|
作者
Li, Y
Shiu, SCK [1 ]
Pal, SK
Liu, JNK
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700035, W Bengal, India
关键词
text categorization (TC); case-based reasoning (CBR); rough set; case coverage; case reachability;
D O I
10.1016/j.ijar.2005.06.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel rough set-based case-based reasoner For Use in text categorization (TC). The reasoner has four main components: feature term extractor, document representor, case selector, and case retriever. It operates by first reducing the number of feature terms in the documents Using the rough set technique. Then, the number of documents is reduced using a new document selection approach based on the case-based reasoning (CBR) concepts of coverage and reachability. As a result, both the number of feature terms and documents are reduced with only minimal loss of information. Finally, this smaller set of documents with fewer feature terms is Used in TC. The proposed rough set-based case-based reasoner wits tested on the Reuters21578 text datasets. The experimental results demonstrate its effectiveness and efficiency as it significantly reduced feature terms and documents, important for improving the efficiency of TC, while preserving and even improving classification accuracy. (C) 2005 Elsevier Inc. All rights reserved.
引用
收藏
页码:229 / 255
页数:27
相关论文
共 50 条
  • [41] Software-effort estimation with a case-based reasoner
    Prietula, M
    Vicinanza, SS
    Mukhopadhyay, T
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 1996, 8 (3-4) : 341 - 363
  • [42] An Improved Set-based Reasoner for the Description Logic DLD4,x
    Cantone, Domenico
    Nicolosi-Asmundo, Marianna
    Santamaria, Daniele Francesco
    FUNDAMENTA INFORMATICAE, 2021, 178 (04) : 315 - 346
  • [43] A rough set-based hybrid feature selection method for topic-specific text filtering
    Li, Q
    Li, JH
    Liu, GS
    Li, SH
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1464 - 1468
  • [44] Apply a rough set-based classifier to dependency parsing
    Ji, Yangsheng
    Shang, Lin
    Dai, Xinyu
    Ma, Ruoce
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY, 2008, 5009 : 97 - 105
  • [45] Structural risk minimization of rough set-based classifier
    Jinfu Liu
    Mingliang Bai
    Na Jiang
    Daren Yu
    Soft Computing, 2020, 24 : 2049 - 2066
  • [46] A Framework on Rough Set-Based Partitioning Attribute Selection
    Herawan, Tutut
    Deris, Mustafa Mat
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2009, 5755 : 91 - 100
  • [47] On optimisation of the rough set-based decision rule extraction
    Stanczyk, Urszula
    PROCEEDINGS OF THE 8TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE (ACS'08): RECENT ADVANCES ON APPLIED COMPUTER SCIENCE, 2008, : 296 - +
  • [48] Rough set-based approaches for discretization: a compact review
    Rahman Ali
    Muhammad Hameed Siddiqi
    Sungyoung Lee
    Artificial Intelligence Review, 2015, 44 : 235 - 263
  • [49] Study of Rough Set-Based Taste Signals Identification
    Sun, Yingjuan
    Sun, Yinghui
    Pu, Dongbing
    PROGRESS IN POWER AND ELECTRICAL ENGINEERING, PTS 1 AND 2, 2012, 354-355 : 1170 - +
  • [50] Rough set-based regionalisation in air quality monitoring
    Karuppanan, Komathy
    Jayakumaran, Christina
    INTERNATIONAL JOURNAL OF ENVIRONMENT AND POLLUTION, 2013, 53 (1-2) : 131 - 147