Weirdness Coefficient as a Feature Selection Method for Arabic Special Domain Text Classification

被引:0
|
作者
Al-Thubaity, AbdulMohsen [1 ]
Alanazi, Albandari
Hazzaa, Itisam [2 ]
Al-Tuwaijri, Haya [1 ,3 ]
机构
[1] King Abdulaziz City Sci & Technol, Comp Res Inst, Riyadh, Saudi Arabia
[2] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[3] King Abdulaziz City Sci & Technol, Comp Res Inst, Riyadh, Saudi Arabia
关键词
Weirdness Coefficient; NB; K-NN; Arabic text classification; feature selection;
D O I
10.1109/IALP.2012.64
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the importance of organizing and managing the rapid growth in knowledge of Arabic electronic content, this study introduces the Weirdness Coefficient (W) as a new feature selection method for Arabic special domain text classification. The proposed method was used to classify a dataset comprising five Islamic topics using Naive base (NB) and K-nearest neighbor (K-NN) classifiers, and three representation schemas. The results were also compared with a well-known feature selection method, Chi-squared. In addition to its simplicity in computation, the Weirdness Coefficient showed promising classification accuracy.
引用
收藏
页码:69 / 72
页数:4
相关论文
共 50 条
  • [1] Feature Selection Method Based On Statistics of Compound Words for Arabic Text Classification
    Adel, Aisha
    Omar, Nazlia
    Albared, Mohammed
    Al-Shabi, Adel
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2019, 16 (02) : 178 - 185
  • [2] Utilizing Artificial Bee Colony Algorithm as Feature Selection Method in Arabic Text Classification
    Hijazi, Musab Mustafa
    Zeki, Akram
    Ismail, Amelia
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 536 - 547
  • [3] Firefly Algorithm based Feature Selection for Arabic Text Classification
    Marie-Sainte, Souad Larabi
    Alalyani, Nada
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (03) : 320 - 328
  • [4] Arabic Text Classification: A Review Study on Feature Selection Methods
    Hijazi, Musab Mustafa
    Zeki, Akram
    Ismail, Amelia
    2021 22ND INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2021, : 554 - 559
  • [5] Utilizing arabic wordnet relations in arabic text classification: New feature selection methods
    Yousif, Suhad A.
    Sultani, Zainab N.
    Samawi, Venus W.
    IAENG International Journal of Computer Science, 2019, 46 (04) : 1 - 12
  • [6] Feature Selection for Text Classification Based on Gini Coefficient of Inequality
    Singh, Sanasam Ranbir
    Murthy, Hema A.
    Gonsalves, Timothy A.
    PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON FEATURE SELECTION IN DATA MINING, 2010, 10 : 76 - 85
  • [7] Efficient Method for Feature Selection in Text Classification
    Sun, Jian
    Zhang, Xiang
    Liao, Dan
    Chang, Victor
    2017 INTERNATIONAL CONFERENCE ON ENGINEERING AND TECHNOLOGY (ICET), 2017,
  • [8] A new feature selection method for text classification
    Uchyigit, Gulden
    Clark, Keith
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2007, 21 (02) : 423 - 438
  • [9] Text feature selection method for hierarchical classification
    Zhu, Cui-Ling
    Ma, Jun
    Zhang, Dong-Mei
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2011, 24 (01): : 103 - 110
  • [10] Feature Selection Method of Text Tendency Classification
    Li, Yanling
    Dai, Guanzhong
    Li, Gang
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 34 - +