Arabic named entity recognition in crime documents

被引:0
|
作者
Asharef, M. [1 ]
Omar, N. [1 ]
Albared, M. [1 ]
机构
[1] Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Malaysia
关键词
Natural language processing systems - Search engines - Character recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Named entity recognition (NER) systems aim to automatically identify and classify the proper nouns in text. NER systems play a significant role in many areas of Natural Language Processing (NLP) such as question answering systems, text summarization and information retrieval. Unlike previous Arabic NER systems which have been built to extract named entities from general Arabic text, our task involves extracting named entities from crime documents. Extracting named entities from crime text provides basic information for crime analysis. This paper presents a rule-based approach to Arabic NER system relevant to the crime domain. Based on morphological information, predefined crime and general indicator lists and an Arabic named entity annotation corpus from crime domain, several syntactical rules and patterns of Arabic NER are induced and then formalized. Then, these rules and patterns are applied to identify and classify named entities in Arabic crime text. The result shows that the accuracy of our system is 90%, and this result indicates that the method is effective and the performance of the achieved system is satisfactory. © 2005 - 2012 JATIT & LLS. All rights reserved.
引用
收藏
页码:1 / 6
相关论文
共 50 条
  • [21] Arabic Named Entity Recognition Using Boosting Method
    Sajadi, Mohamad Bagher
    Minaei, Behrooz
    2017 19TH CSI INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2017, : 281 - 288
  • [22] Named Entity Recognition and Classification in Historical Documents: A Survey
    Ehrmann, Maud
    Hamdi, Ahmed
    Pontes, Elvys Linhares
    Romanello, Matteo
    Doucet, Antoine
    ACM COMPUTING SURVEYS, 2024, 56 (02)
  • [23] Comparison of named entity recognition methodologies in biomedical documents
    Song, Hye-Jeong
    Jo, Byeong-Cheol
    Park, Chan-Young
    Kim, Jong-Dae
    Kim, Yu-Seop
    BIOMEDICAL ENGINEERING ONLINE, 2018, 17
  • [24] Named Entity Recognition in Unstructured Medical Text Documents
    Pearson, Cole
    Seliya, Naeem
    Dave, Rushit
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 412 - 417
  • [25] A Dataset of German Legal Documents for Named Entity Recognition
    Leitner, Elena
    Rehm, Georg
    Moreno-Schneider, Julian
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4478 - 4485
  • [26] Comparison of named entity recognition methodologies in biomedical documents
    Hye-Jeong Song
    Byeong-Cheol Jo
    Chan-Young Park
    Jong-Dae Kim
    Yu-Seop Kim
    BioMedical Engineering OnLine, 17
  • [27] Arabic Named Entity Recognition: A Feature-Driven Study
    Benajiba, Yassine
    Diab, Mona
    Rosso, Paolo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (05): : 926 - 934
  • [28] Named Entity Recognition in Arabic: A Review of Some Current Systems
    Elsebai, Ali
    Meziane, Farid
    CREATING GLOBAL ECONOMIES THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: THEORY & PRACTICE, VOLS 1-3, 2009, : 1245 - 1251
  • [29] Arabic Named Entity Recognition from diverse text types
    Shaalan, Khaled
    Raza, Hafsa
    ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2008, 5221 : 440 - 451
  • [30] Data Augmentation Techniques on Arabic Data for Named Entity Recognition
    Sabty, Caroline
    Omar, Islam
    Wasfalla, Fady
    Islam, Mohamed
    Abdennadher, Slim
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 292 - 299