ARTC: feature selection using association rules for text classification

被引:0
|
作者
Mozamel M. Saeed
Zaher Al Aghbari
机构
[1] Prince Sattam Bin Abdulaziz University,Department of Computer Science
[2] University of Sharjah,Department of Computer Science
来源
关键词
Feature selection; Association rules; Text classification; Contrasting feature set; Text binary vector;
D O I
暂无
中图分类号
学科分类号
摘要
Feature vectors are extracted to represent objects in many classification tasks, such as text classification. Due to the high dimensionality of these raw feature vectors, the classification efficiency and accuracy are reduced. Therefore, reducing the size of feature vectors by selecting the relevant features that better represent the objects is an important aspect in text classification. Feature selection not only reduces the dimensionality of the feature vectors, but also produces more efficient classification models with higher predictive power. In this paper, we propose ARTC, which is an effective feature selection method that is based on the extraction of association rules to classify text documents. The extracted association rules discover the hidden relationships and correlations between the relevant words within the textual documents of a class and a cross different classes. Consequently, each class of documents is represented by a small set of contrasting features that are more effective in text classification. Our experiments show that ARTC outperforms other relevant techniques in terms of classification performance and efficiency.
引用
收藏
页码:22519 / 22529
页数:10
相关论文
共 50 条
  • [21] Feature subset selection using naive Bayes for text classification
    Feng, Guozhong
    Guo, Jianhua
    Jing, Bing-Yi
    Sun, Tieli
    PATTERN RECOGNITION LETTERS, 2015, 65 : 109 - 115
  • [22] Feature selection using improved mutual information for text classification
    Novovicová, J
    Malík, A
    Pudil, P
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1010 - 1017
  • [23] Feature Selection for Text Classification Using Machine Learning Approaches
    K. Thirumoorthy
    K. Muneeswaran
    National Academy Science Letters, 2022, 45 : 51 - 56
  • [24] A Review on Feature Selection and Feature Extraction for Text Classification
    Shah, Foram P.
    Patel, Vibha
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 2264 - 2268
  • [25] Feature selection using association word mining for classification
    Ko, SJ
    Lee, JH
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 2001, 2113 : 211 - 220
  • [26] Strategy of revising rules for association text classification
    Qiu, Jiangtao
    Tang, Changjie
    Zeng, Tao
    Liu, Yintian
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2009, 46 (04): : 683 - 688
  • [27] Feature Selection for Text Classification using OR plus SVM-RFE
    Luo, Meixiang
    Luo, Linkai
    2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 1648 - 1652
  • [28] Efficient Text Classification Using Best Feature Selection and Combination of Methods
    Srinivas, M.
    Supreethi, K. P.
    Prasad, E. V.
    Kumari, S. Anitha
    HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: DESIGNING INFORMATION ENVIRONMENTS, PT I, 2009, 5617 : 437 - +
  • [29] Comparison on Feature Selection Methods for Text Classification
    Liu, Wenkai
    Xiao, Jiongen
    Hong, Ming
    2020 THE 4TH INTERNATIONAL CONFERENCE ON MANAGEMENT ENGINEERING, SOFTWARE ENGINEERING AND SERVICE SCIENCES (ICMSS 2020), 2020, : 82 - 86
  • [30] Efficient Method for Feature Selection in Text Classification
    Sun, Jian
    Zhang, Xiang
    Liao, Dan
    Chang, Victor
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2017,