Weirdness Coefficient as a Feature Selection Method for Arabic Special Domain Text Classification

被引:0
|
作者
Al-Thubaity, AbdulMohsen [1 ]
Alanazi, Albandari
Hazzaa, Itisam [2 ]
Al-Tuwaijri, Haya [1 ,3 ]
机构
[1] King Abdulaziz City Sci & Technol, Comp Res Inst, Riyadh, Saudi Arabia
[2] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[3] King Abdulaziz City Sci & Technol, Comp Res Inst, Riyadh, Saudi Arabia
关键词
Weirdness Coefficient; NB; K-NN; Arabic text classification; feature selection;
D O I
10.1109/IALP.2012.64
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the importance of organizing and managing the rapid growth in knowledge of Arabic electronic content, this study introduces the Weirdness Coefficient (W) as a new feature selection method for Arabic special domain text classification. The proposed method was used to classify a dataset comprising five Islamic topics using Naive base (NB) and K-nearest neighbor (K-NN) classifiers, and three representation schemas. The results were also compared with a well-known feature selection method, Chi-squared. In addition to its simplicity in computation, the Weirdness Coefficient showed promising classification accuracy.
引用
收藏
页码:69 / 72
页数:4
相关论文
共 50 条
  • [41] The Hybrid Feature Selection k-means Method for Arabic Webpage Classification
    Alghamdi, Hanan
    Selamat, Ali
    JURNAL TEKNOLOGI, 2014, 70 (05):
  • [42] A New Feature Selection Method for Text Classification Based on Independent Feature Space Search
    Liu, Yong
    Ju, Shenggen
    Wang, Junfeng
    Su, Chong
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [43] A Review on Feature Selection and Feature Extraction for Text Classification
    Shah, Foram P.
    Patel, Vibha
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 2264 - 2268
  • [44] A CLASS SPECIFIC FEATURE SELECTION METHOD FOR IMPROVING THE PERFORMANCE OF TEXT CLASSIFICATION
    Venkatesh, V.
    Sharan, S. B.
    Mahalaxmy, S.
    Monisha, S.
    Sanjey, Ashick D. S.
    Ashokkumar, P.
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (02): : 1018 - 1028
  • [45] Distance Variance Score: An Efficient Feature Selection Method in Text Classification
    Wang, Heyong
    Hong, Ming
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [46] A hybrid feature selection method for text classification using a feature-correlation-based genetic algorithmA hybrid feature selection method for text classification...L. Farek, A. Benaidja
    Lazhar Farek
    Amira Benaidja
    Soft Computing, 2024, 28 (23) : 13567 - 13593
  • [47] A feature selection method based on synonym merging in text classification system
    Haipeng Yao
    Chong Liu
    Peiying Zhang
    Luyao Wang
    EURASIP Journal on Wireless Communications and Networking, 2017
  • [48] A feature selection method based on synonym merging in text classification system
    Yao, Haipeng
    Liu, Chong
    Zhang, Peiying
    Wang, Luyao
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2017,
  • [49] A new feature selection method for handling redundant information in text classification
    You-wei Wang
    Li-zhou Feng
    Frontiers of Information Technology & Electronic Engineering, 2018, 19 : 221 - 234
  • [50] A novel multivariate filter method for feature selection in text classification problems
    Labani, Mahdieh
    Moradi, Parham
    Ahmadizar, Fardin
    Jalili, Mahdi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 70 : 25 - 37