Named Entity Recognition of Tunisian Arabic Using the Bi-LSTM-CRF Model

被引:2
|
作者
Mekki, Asma [1 ]
Zribi, Ines [2 ]
Ellouze, Mariem [1 ]
Belguith, Lamia Hadrich [1 ]
机构
[1] Univ Sfax, ANLP Res Grp, MIRACL, Sfax, Tunisia
[2] Univ Monastir, ANLP Res Grp, MIRACL, Monastir, Tunisia
关键词
Named entity recognition; Arabic dialect; Tunisian Arabic; Bi-LSTM-CRF;
D O I
10.1142/S0218213023500628
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named Entity Recognition (NER) is an NLP field that deals with recognizing and classifying entities in written text. Most Arabic NER research studies discuss the Arabic NER challenge for the Modern Standard Arabic (MSA) language. However, the presence of dialectal Arabic textual resources in social media, blogs, TV shows, etc. is increasingly progressive. Therefore, the treatment of named entities is rapidly becoming a necessity, particularly for dialectal Arabic. In this paper, we are interested in the collection and annotation of a corpus as well as the realization of a NER system for Tunisian Arabic (TA), named TUNER. To the best of the researchers' knowledge, this is the first study that uses the suggested method for this purpose. In the present study, we adopt a hybrid method based on a Bi-LSTM-CRF model and a rule-based method. The proposed TUNER system yields an F-measure of 91.43%. This is an interesting improvement over comparable related work dialectal Arabic NER systems.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] News text named entity Recognition based on BI-LSTM-CRF model
    Meng, LingMing
    Qi, WeiMin
    Zhou, YongKang
    Chen, Ying
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7217 - 7222
  • [2] Research of Clinical Named Entity Recognition Based on Bi-LSTM-CRF
    Qin Y.
    Zeng Y.
    Journal of Shanghai Jiaotong University (Science), 2018, 23 (3) : 392 - 397
  • [3] Application of a Hybrid Bi-LSTM-CRF Model to the Task of Russian Named Entity Recognition
    The Anh Le
    Arkhipov, Mikhail Y.
    Burtsev, Mikhail S.
    ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE, 2018, 789 : 91 - 103
  • [4] Research of Clinical Named Entity Recognition Based on Bi-LSTM-CRF
    秦颖
    曾颖菲
    Journal of Shanghai Jiaotong University(Science), 2018, 23 (03) : 392 - 397
  • [5] Thai Named Entity Recognition Using Bi-LSTM-CRF with Word and Character Representation
    Thattinaphanich, Suphanut
    Prom-on, Santitham
    PROCEEDINGS OF THE 2019 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT): ENCOMPASSING INTELLIGENT TECHNOLOGY AND INNOVATION TOWARDS THE NEW ERA OF HUMAN LIFE, 2019, : 149 - 154
  • [6] Chinese Named Entity Recognition in Power Domain Based on Bi-LSTM-CRF
    Zhao, Zhenqiang
    Chen, Zhenyu
    Liu, Jinbo
    Huang, Yunhao
    Gao, Xingyu
    Di, Fangchun
    Li, Lixin
    Ji, Xiaohui
    2019 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND PATTERN RECOGNITION (AIPR 2019), 2019, : 176 - 180
  • [7] Ship Fault Named Entity Recognition Based on Bilayer Bi-LSTM-CRF
    Hou, TongJia
    Zhou, Liang
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 1032 - 1036
  • [8] A Korean named entity recognition method using Bi-LSTM-CRF and masked self-attention
    Jin, Guozhe
    Yu, Zhezhou
    COMPUTER SPEECH AND LANGUAGE, 2021, 65
  • [9] A Korean named entity recognition method using Bi-LSTM-CRF and masked self-attention
    Jin, Guozhe
    Yu, Zhezhou
    Yu, Zhezhou (yuzz@jlu.edu.cn), 1600, Academic Press (65):
  • [10] Effects of Hyper-parameters Setting in Bi-LSTM-CRF on Chinese Named Entity Recognition
    Zhang, Taozheng
    Ma, Pingping
    FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022, 2022, 12705