Comparative study of text representation and learning for Persian named entity recognition

被引:2
|
作者
Pour, Mohammad Mahdi Abdollah [1 ]
Momtazi, Saeedeh [1 ]
机构
[1] Amirkabir Univ Technol, Comp Engn Dept, Tehran, Iran
关键词
contextualized representation; NER; Persian language processing;
D O I
10.4218/etrij.2021-0269
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Transformer models have had a great impact on natural language processing (NLP) in recent years by realizing outstanding and efficient contextualized language models. Recent studies have used transformer-based language models for various NLP tasks, including Persian named entity recognition (NER). However, in complex tasks, for example, NER, it is difficult to determine which contextualized embedding will produce the best representation for the tasks. Considering the lack of comparative studies to investigate the use of different contextualized pretrained models with sequence modeling classifiers, we conducted a comparative study about using different classifiers and embedding models. In this paper, we use different transformer-based language models tuned with different classifiers, and we evaluate these models on the Persian NER task. We perform a comparative analysis to assess the impact of text representation and text classification methods on Persian NER performance. We train and evaluate the models on three different Persian NER datasets, that is, MoNa, Peyma, and Arman. Experimental results demonstrate that XLM-R with a linear layer and conditional random field (CRF) layer exhibited the best performance. This model achieved phrase-based F-measures of 70.04, 86.37, and 79.25 and word-based F scores of 78, 84.02, and 89.73 on the MoNa, Peyma, and Arman datasets, respectively. These results represent state-of-the-art performance on the Persian NER task.
引用
收藏
页码:794 / 804
页数:11
相关论文
共 50 条
  • [21] Learning Entity Representation for Named Entity Disambiguation
    Cai, Rui
    Wang, Houfeng
    Zhang, Junhao
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 267 - 278
  • [22] A comprehensive study of named entity recognition in Chinese clinical text
    Lei, Jianbo
    Tang, Buzhou
    Lu, Xueqin
    Gao, Kaihua
    Jiang, Min
    Xu, Hua
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (05) : 808 - 814
  • [23] A Comparative Study of Named Entity Recognition on Myanmar Language
    Nandar, Tin Latt
    Soe, Thinn Lai
    Soe, Khin Mar
    PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 60 - 64
  • [24] Named Entity Recognition through Deep Representation Learning and Weak Supervision
    Parker, Jerrod
    Yu, Shi
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 3828 - 3839
  • [25] Product named entity recognition in Chinese text
    Jun Zhao
    Feifan Liu
    Language Resources and Evaluation, 2008, 42 : 197 - 217
  • [26] Named entity recognition and classification for text in arabic
    Abuleil, S
    Evens, M
    INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 89 - 94
  • [27] Named Entity Recognition for Short Text Messages
    Ek, Tobias
    Kirkegaard, Camilla
    Jonsson, Hakan
    Nugues, Pierre
    COMPUTATIONAL LINGUISTICS AND RELATED FIELDS, 2011, 27 : 178 - 187
  • [28] Product named entity recognition in Chinese text
    Zhao, Jun
    Liu, Feifan
    LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (02) : 197 - 217
  • [29] A Comparative Study of Biomedical Named Entity Recognition Methods Based Machine Learning Approach
    Rais, Mohammed
    Lachkar, Abdelmonaime
    Lachkar, Abdelhamid
    El Alaoui Ouatik, Said
    2014 THIRD IEEE INTERNATIONAL COLLOQUIUM IN INFORMATION SCIENCE AND TECHNOLOGY (CIST'14), 2014, : 329 - 334
  • [30] Hierarchical Contextualized Representation for Named Entity Recognition
    Luo, Ying
    Xiao, Fengshun
    Zhao, Hai
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8441 - 8448