Detection of semantic errors in Arabic texts

被引：4

作者：

Zribi, Chiraz Ben Othmane ^{[1
]}

Ben Ahmed, Mohamed ^{[1
]}

机构：

[1] Manouba Univ, RIADI Lab, Manouba, Tunisia

来源：

ARTIFICIAL INTELLIGENCE | 2013年 / 195卷

关键词：

Semantic error; Detection; Statistical method; Linguistic method; Combining methods; Co-occurrence; Collocation; Latent Semantic Analysis (LSA); Multi-Agent System (MAS); Arabic;

D O I：

10.1016/j.artint.2012.07.002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting semantic errors in a text is still a challenging area of investigation. A lot of research has been done on lexical and syntactic errors while fewer studies have tackled semantic errors, as they are more difficult to treat. Compared to other languages, Arabic appears to be a special challenge for this problem. Because words are graphically very similar to each other, the risk of getting semantic errors in Arabic texts is bigger. Moreover, there are special cases and unique complexities for this language. This paper deals with the detection of semantic errors in Arabic texts but the approach we have adopted can also be applied for texts in other languages. It combines four contextual methods (using statistics and linguistic information) in order to decide about the semantic validity of a word in a sentence. We chose to implement our approach on a distributed architecture, namely, a Multi Agent System (MAS). The implemented system achieved a precision rate of about 90% and a recall rate of about 83%. (C) 2012 Elsevier B.V. All rights reserved.

引用

页码：249 / 264

页数：16

共 50 条

[21] Deep Learning Based Technique for Plagiarism Detection in Arabic Texts
Suleiman, Dima
Awajan, Arafat
Al-Madi, Nailah
2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 216 - 222
[22] "Easy" meta-embedding for detecting and correcting semantic errors in Arabic documents
Zribi, Chiraz Ben Othmane
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (14) : 21161 - 21175
[23] Texts Semantic Similarity Detection Based Graph Approach
Mohebbi, Majid
Talebpour, Alireza
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2016, 13 (02) : 246 - 251
[24] Errors and non-errors in English-Arabic machine translation of gender-bound constructs in technical texts
Abu-Ayyash, Emad A. S.
ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2017), 2017, 117 : 73 - 80
[25] Diaparonyms in ESP Texts as Potential Causes of Lexical and Semantic Errors in FLSP Teaching
Chiknaverova, K. G.
Voevoda, E. V.
TOMSK STATE UNIVERSITY JOURNAL, 2021, (471): : 205 - 214
[26] SUDAN ARABIC TEXTS
不详
SUDAN NOTES AND RECORDS, 1936, 19 (01) : 196 - 197
[27] OCR OF ARABIC TEXTS
AMIN, A
LECTURE NOTES IN COMPUTER SCIENCE, 1988, 301 : 616 - 625
[28] Semantic Similarity Analysis for Corpus Development and Paraphrase Detection in Arabic
Mahmoud, Adnen
Zrigui, Mounir
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2021, 18 (01) : 1 - 7
[29] Language Errors in Machine Translation of Encyclopedic Texts from English into Arabic: the case of Google Translate
Al-Samawi, Ahmad Muhammed
ARAB WORLD ENGLISH JOURNAL, 2014, : 182 - 211
[30] Detection of Semantic Errors from Simple Bangla Sentences
Hasan, K. M. Azharul
Hozaifa, Muhammad
Dutta, Sanjoy
2014 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2014, : 296 - 299

← 1 2 3 4 5 →