NAIVE BAYESIAN AND K-NEAREST NEIGHBOUR TO CATEGORIZE ARABIC TEXT DATA

被引:0
|
作者
Hadi, Wa'el Musa
Thabtah, Fadi
Hawari, Samer A. L.
Ababneh, Jafar
机构
关键词
Text Categorization; Naive Bayesian; Arabic Text Data; K-Nearest Neighbour;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Text classification is a supervised learning technique that uses labelled training data to derive a classification system (classifier) and then automatically classifies unlabelled text data using the derived classifier. This paper investigates Naive Bayesian method (NB) and K-Nearest Neighbour algorithm (KNN) on different Arabic data sets. The bases of our comparison are the most popular text evaluation measures. The Experimental results against different Arabic text categorisation data sets reveal that NB algorithm outperforms the KNN based on Cosine Coefficient approach with regards to all measures.
引用
收藏
页码:196 / 200
页数:5
相关论文
共 50 条
  • [1] VSMs with K-Nearest Neighbour to Categorise Arabic Text Data
    Thabtah, Fadl
    Hadi, Wa'el Musa
    Al-shammare, Gaith
    WCECS 2008: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, 2008, : 778 - 781
  • [2] Arabic Text Classification Using K-Nearest Neighbour Algorithm
    Alhutaish, Roiss
    Omar, Nazlia
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (02) : 190 - 195
  • [3] Naive Bayesian Based on Chi Square to Categorize Arabic Data
    Thabtah, Fadi
    Eljinini, Mohammad Ali H.
    Zamzeer, Mannam
    Hadi, Wa'el Musa
    INNOVATION AND KNOWLEDGE MANAGEMENT IN TWIN TRACK ECONOMIES: CHALLENGES & SOLUTIONS, VOLS 1-3, 2009, : 930 - +
  • [4] Balanced k-nearest neighbour imputation
    Hasler, Caren
    Tille, Yves
    STATISTICS, 2016, 50 (06) : 1310 - 1331
  • [5] k-Nearest Neighbour Classifiers - A Tutorial
    Cunningham, Padraig
    Delany, Sarah Jane
    ACM COMPUTING SURVEYS, 2021, 54 (06)
  • [6] K-Nearest Neighbour Classification for Interval-Valued Data
    Vu-Linh Nguyen
    Destercke, Sebastien
    Masson, Marie-Helene
    SCALABLE UNCERTAINTY MANAGEMENT (SUM 2017), 2017, 10564 : 93 - 106
  • [7] An evaluation of k-nearest neighbour imputation using Likert data
    Jönsson, P
    Wohlin, C
    10TH INTERNATIONAL SYMPOSIUM ON SOFTWARE METRICS, PROCEEDINGS, 2004, : 108 - 118
  • [8] Benchmarking k-nearest neighbour imputation with homogeneous Likert data
    Jonsson, Per
    Wohlin, Claes
    EMPIRICAL SOFTWARE ENGINEERING, 2006, 11 (03) : 463 - 489
  • [9] Benchmarking k-nearest neighbour imputation with homogeneous Likert data
    Per Jönsson
    Claes Wohlin
    Empirical Software Engineering, 2006, 11
  • [10] Semi-supervised Naive Hubness Bayesian k-Nearest Neighbor for Gene Expression Data
    Buza, Krisztian
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 101 - 110