Integrating Noun-Based Feature Ranking and Selection Methods with Arabic Text Associative Classification Approach

被引:0
|
作者
Abdullah S. Ghareb
Abdul Razak Hamdan
Azuraliza Abu Bakar
机构
[1] Universiti Kebangsaan Malaysia,Center for Artificial Intelligence Technology, Faculty of Information Science and Technology
关键词
Noun extraction; Feature ranking; Feature selection; Associative classification; Arabic text; Category association rule;
D O I
暂无
中图分类号
学科分类号
摘要
Feature ranking and selection (FR&S) is an important preprocessing phase for text classification, and it is in most cases produces small valuable sub-feature space among the whole feature space and reduces the classification errors. As the associative classification (AC) approach is an efficient method and its training and testing depend on the way that features ranked and selected, the examining of feature ranking methods is very significant. This paper presents an integration method of Arabic noun extraction with four FR&S methods: term frequency–inverse document frequency (TF-IDF), document frequency, odd ratio, and class discriminating measure (CDM). Association rule technology uses the result of the integrated feature selection to construct an Arabic text associative classifier. In this study, the majority voting and ordered decision list prediction methods are used by AC to assign test document to its category. A set of experiments are conducted on collection of Arabic text documents, and the experimental results show that our AC method works better with extracted nouns and feature selection method than with feature selection method individually. The AC based on CDM and TF-IDF methods outperforms the other methods in terms of AC accuracy. As the results indicate, the proposed method produces satisfactory classification accuracy and it has good selecting effect on the Arabic text associative classifier.
引用
收藏
页码:7807 / 7822
页数:15
相关论文
共 50 条
  • [41] Text classification based on feature selection and LDA model
    Zheng, C. (csahu@126.com), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [42] Hybrid ACO and TOFA Feature Selection Approach for Text Classification
    Alghamdi, Hanan S.
    Tang, H. Lilian
    Alshomrani, Saleh
    2012 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2012,
  • [43] A New Big Data Feature Selection Approach for Text Classification
    Amazal, Houda
    Kissi, Mohamed
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [44] Utilizing Artificial Bee Colony Algorithm as Feature Selection Method in Arabic Text Classification
    Hijazi, Musab Mustafa
    Zeki, Akram
    Ismail, Amelia
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 536 - 547
  • [45] A lightweight filter based feature selection approach for multi-label text classification
    Dhal P.
    Azad C.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12345 - 12357
  • [46] A COPRAS-based Approach to Multi-Label Feature Selection for Text Classification
    Mohanrasu, S. S.
    Janani, K.
    Rakkiyappan, R.
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2024, 222 : 3 - 23
  • [47] Feature selection using binary grey wolf optimizer with elite-based crossover for Arabic text classification
    Hamouda Chantar
    Majdi Mafarja
    Hamad Alsawalqah
    Ali Asghar Heidari
    Ibrahim Aljarah
    Hossam Faris
    Neural Computing and Applications, 2020, 32 : 12201 - 12220
  • [48] Feature selection using binary grey wolf optimizer with elite-based crossover for Arabic text classification
    Chantar, Hamouda
    Mafarja, Majdi
    Alsawalqah, Hamad
    Heidari, Ali Asghar
    Aljarah, Ibrahim
    Faris, Hossam
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (16): : 12201 - 12220
  • [49] Comparison of Feature Selection Methods in Text Classification on Highly Skewed Datasets
    Asim, Muhammad Nabeel
    Wasim, Muhammad
    Ali, Muhammad Sajid
    Rehman, Abdur
    2017 FIRST INTERNATIONAL CONFERENCE ON LATEST TRENDS IN ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (INTELLECT), 2017,
  • [50] Comparative Study of Feature Selection Methods for Medical Full Text Classification
    Adriano Goncalves, Carlos
    Lorenzo Iglesias, Eva
    Borrajo, Lourdes
    Camacho, Rui
    Seara Vieira, Adrian
    Goncalves, Celia Talma
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2019), PT II, 2019, 11466 : 550 - 560