Integrating Noun-Based Feature Ranking and Selection Methods with Arabic Text Associative Classification Approach

被引:3
|
作者
Ghareb, Abdullah S. [1 ]
Hamdan, Abdul Razak [1 ]
Abu Bakar, Azuraliza [1 ]
机构
[1] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Ctr Artificial Intelligence Technol, Bangi 43600, Selangor, Malaysia
关键词
Noun extraction; Feature ranking; Feature selection; Associative classification; Arabic text; Category association rule;
D O I
10.1007/s13369-014-1304-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Feature ranking and selection (FR&S) is an important preprocessing phase for text classification, and it is in most cases produces small valuable sub-feature space among the whole feature space and reduces the classification errors. As the associative classification (AC) approach is an efficient method and its training and testing depend on the way that features ranked and selected, the examining of feature ranking methods is very significant. This paper presents an integration method of Arabic noun extraction with four FR&S methods: term frequency-inverse document frequency (TF-IDF), document frequency, odd ratio, and class discriminating measure (CDM). Association rule technology uses the result of the integrated feature selection to construct an Arabic text associative classifier. In this study, the majority voting and ordered decision list prediction methods are used by AC to assign test document to its category. A set of experiments are conducted on collection of Arabic text documents, and the experimental results show that our AC method works better with extracted nouns and feature selection method than with feature selection method individually. The AC based on CDM and TF-IDF methods outperforms the other methods in terms of AC accuracy. As the results indicate, the proposed method produces satisfactory classification accuracy and it has good selecting effect on the Arabic text associative classifier.
引用
收藏
页码:7807 / 7822
页数:16
相关论文
共 50 条
  • [1] Integrating Noun-Based Feature Ranking and Selection Methods with Arabic Text Associative Classification Approach
    Abdullah S. Ghareb
    Abdul Razak Hamdan
    Azuraliza Abu Bakar
    Arabian Journal for Science and Engineering, 2014, 39 : 7807 - 7822
  • [2] Arabic Text Classification: A Review Study on Feature Selection Methods
    Hijazi, Musab Mustafa
    Zeki, Akram
    Ismail, Amelia
    2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 554 - 559
  • [3] The Effect of Combining Different Feature Selection Methods on Arabic Text Classification
    Al-Thubaity, Abdulmohsen
    Abanumay, Norah
    AL-Jerayyed, Sara
    Alrukban, Aljoharah
    Mannaa, Zarah
    2013 14TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD 2013), 2013, : 211 - 216
  • [4] Utilizing arabic wordnet relations in arabic text classification: New feature selection methods
    Yousif, Suhad A.
    Sultani, Zainab N.
    Samawi, Venus W.
    IAENG International Journal of Computer Science, 2019, 46 (04) : 1 - 12
  • [5] Firefly Algorithm based Feature Selection for Arabic Text Classification
    Marie-Sainte, Souad Larabi
    Alalyani, Nada
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (03) : 320 - 328
  • [6] Feature selection based on ACO and knowledge graph for Arabic text classification
    Mosa, Mohamed Atef
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024, 36 (07) : 1155 - 1172
  • [7] Feature Selection Methods for Text Classification
    Dasgupta, Anirban
    Drineas, Petros
    Harb, Boulos
    Josifovski, Vanja
    Mahoney, Michael W.
    KDD-2007 PROCEEDINGS OF THE THIRTEENTH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2007, : 230 - +
  • [8] Different Classification Algorithms Based on Arabic Text Classification: Feature Selection Comparative Study
    Raho, Ghazi
    Al-Shalabi, Riyad
    Kanaan, Ghassan
    Asma'aNassar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2015, 6 (02) : 192 - 195
  • [9] Text Associative Classification Approach for Mining Arabic Data Set
    Ghareb, Abdullah S.
    Hamdan, Abdul Razak
    Abu Bakar, Azuraliza
    2012 4TH CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2012, : 114 - 120
  • [10] Feature Selection Method Based On Statistics of Compound Words for Arabic Text Classification
    Adel, Aisha
    Omar, Nazlia
    Albared, Mohammed
    Al-Shabi, Adel
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (02) : 178 - 185