ARTC: feature selection using association rules for text classification

被引:0
|
作者
Mozamel M. Saeed
Zaher Al Aghbari
机构
[1] Prince Sattam Bin Abdulaziz University,Department of Computer Science
[2] University of Sharjah,Department of Computer Science
来源
关键词
Feature selection; Association rules; Text classification; Contrasting feature set; Text binary vector;
D O I
暂无
中图分类号
学科分类号
摘要
Feature vectors are extracted to represent objects in many classification tasks, such as text classification. Due to the high dimensionality of these raw feature vectors, the classification efficiency and accuracy are reduced. Therefore, reducing the size of feature vectors by selecting the relevant features that better represent the objects is an important aspect in text classification. Feature selection not only reduces the dimensionality of the feature vectors, but also produces more efficient classification models with higher predictive power. In this paper, we propose ARTC, which is an effective feature selection method that is based on the extraction of association rules to classify text documents. The extracted association rules discover the hidden relationships and correlations between the relevant words within the textual documents of a class and a cross different classes. Consequently, each class of documents is represented by a small set of contrasting features that are more effective in text classification. Our experiments show that ARTC outperforms other relevant techniques in terms of classification performance and efficiency.
引用
收藏
页码:22519 / 22529
页数:10
相关论文
共 50 条
  • [41] Higher order feature selection for text classification
    Jan Bakus
    Mohamed S. Kamel
    Knowledge and Information Systems, 2006, 9 : 468 - 491
  • [42] Optimal Feature Selection for Imbalanced Text Classification
    Khurana A.
    Verma O.P.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (01): : 135 - 147
  • [43] Higher order feature selection for text classification
    Bakus, J
    Kamel, MS
    KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 9 (04) : 468 - 491
  • [44] Feature selection for text classification with Naive Bayes
    Chen, Jingnian
    Huang, Houkuan
    Tian, Shengfeng
    Qu, Youli
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 5432 - 5435
  • [45] Feature Selection Algorithm Based on Association Rules
    Qu, Yi
    Fang, Yu
    Yan, Fengqi
    2018 INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SCIENCE AND APPLICATION TECHNOLOGY, 2019, 1168
  • [46] Association Rules Based Short Text Feature Extension
    Huang Wei
    Li Shan-Fei
    Tan Yue-Jin
    Gao Bing
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9 (10): : 227 - 230
  • [47] Classification and variable selection using the mining of positive and negative association rules
    Van, Thanh Do
    Nguyen, Giap Cu
    Thi, Ha Dinh
    Ngoc, Lam Pham
    INFORMATION SCIENCES, 2023, 631 : 218 - 240
  • [48] Clustering Based Feature Selection using Extreme Learning Machines for Text Classification
    Roul, Rajendra Kumar
    Gugnani, Shashank
    Kalpeshbhai, Shah Mit
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [49] An embedded feature selection approach for depression classification using short text sequences
    Priya, S. Kavi
    Karthika, K. Pon
    APPLIED SOFT COMPUTING, 2023, 147
  • [50] Grooming Detection using Fuzzy-Rough Feature Selection and Text Classification
    Zuo, Zheming
    Li, Jie
    Anderson, Philip
    Yang, Longzhi
    Naik, Nitin
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,