NAIVE BAYESIAN AND K-NEAREST NEIGHBOUR TO CATEGORIZE ARABIC TEXT DATA

被引：0

作者：

Hadi, Wa'el Musa

Thabtah, Fadi

Hawari, Samer A. L.

Ababneh, Jafar

机构：

来源：

EUROPEAN SIMULATION AND MODELLING CONFERENCE 2008 | 2008年

关键词：

Text Categorization; Naive Bayesian; Arabic Text Data; K-Nearest Neighbour;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Text classification is a supervised learning technique that uses labelled training data to derive a classification system (classifier) and then automatically classifies unlabelled text data using the derived classifier. This paper investigates Naive Bayesian method (NB) and K-Nearest Neighbour algorithm (KNN) on different Arabic data sets. The bases of our comparison are the most popular text evaluation measures. The Experimental results against different Arabic text categorisation data sets reveal that NB algorithm outperforms the KNN based on Cosine Coefficient approach with regards to all measures.

引用

页码：196 / 200

页数：5

共 50 条

[1] VSMs with K-Nearest Neighbour to Categorise Arabic Text Data
Thabtah, Fadl
Hadi, Wa'el Musa
Al-shammare, Gaith
WCECS 2008: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, 2008, : 778 - 781
[2] Arabic Text Classification Using K-Nearest Neighbour Algorithm
Alhutaish, Roiss
Omar, Nazlia
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (02) : 190 - 195
[3] Naive Bayesian Based on Chi Square to Categorize Arabic Data
Thabtah, Fadi
Eljinini, Mohammad Ali H.
Zamzeer, Mannam
Hadi, Wa'el Musa
INNOVATION AND KNOWLEDGE MANAGEMENT IN TWIN TRACK ECONOMIES: CHALLENGES & SOLUTIONS, VOLS 1-3, 2009, : 930 - +
[4] Balanced k-nearest neighbour imputation
Hasler, Caren
Tille, Yves
STATISTICS, 2016, 50 (06) : 1310 - 1331
[5] k-Nearest Neighbour Classifiers - A Tutorial
Cunningham, Padraig
Delany, Sarah Jane
ACM COMPUTING SURVEYS, 2021, 54 (06)
[6] K-Nearest Neighbour Classification for Interval-Valued Data
Vu-Linh Nguyen
Destercke, Sebastien
Masson, Marie-Helene
SCALABLE UNCERTAINTY MANAGEMENT (SUM 2017), 2017, 10564 : 93 - 106
[7] An evaluation of k-nearest neighbour imputation using Likert data
Jönsson, P
Wohlin, C
10TH INTERNATIONAL SYMPOSIUM ON SOFTWARE METRICS, PROCEEDINGS, 2004, : 108 - 118
[8] Benchmarking k-nearest neighbour imputation with homogeneous Likert data
Jonsson, Per
Wohlin, Claes
EMPIRICAL SOFTWARE ENGINEERING, 2006, 11 (03) : 463 - 489
[9] Benchmarking k-nearest neighbour imputation with homogeneous Likert data
Per Jönsson
Claes Wohlin
Empirical Software Engineering, 2006, 11
[10] Semi-supervised Naive Hubness Bayesian k-Nearest Neighbor for Gene Expression Data
Buza, Krisztian
PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 101 - 110

← 1 2 3 4 5 →