Recurrent Neural Network-Based Model for Named Entity Recognition with Improved Word Embeddings

被引:2
|
作者
Goyal, Archana [1 ]
Gupta, Vishal [2 ]
Kumar, Manish [3 ]
机构
[1] Goswami Ganesh Dutta Sanatan Dharma Coll, PG Dept Informat Technol, Chandigarh 160030, India
[2] Panjab Univ, Univ Inst Engn & Technol, Chandigarh 160014, India
[3] Panjab Univ Reg Ctr, Comp Sci & Applicat, Muktsar, Punjab, India
关键词
Bidirectional long short-term memory (Bi-LSTM); convolutional neural network (CNN); named entity recognition (NER); recurrent neural network (RNN); word embeddings;
D O I
10.1080/03772063.2021.2006805
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Extraction of meaningful information from a huge amount of data available on the web is a quite challenging task. The challenges faced in information extraction can be overcome with the help of an efficient named entity recognition (NER) system. Named entities are the proper names that play an important role in searching important information of interest. In this study, an efficient deep learning-based NER technique has been proposed which recognizes the named entities belonging to the general domain from Hindi, Punjabi, and bilingual Hindi and Punjabi text. An important variant of recurrent neural network, namely bidirectional long short-term memory-based model using improved word embeddings has been developed. Improved word embeddings are the combination of character convolutional neural network embeddings and part of speech embeddings. The main findings of the study include the development of a NER system that can extract named entities not only from Hindi and Punjabi datasets individually but also from mixed Hindi and Punjabi text. Besides, improved word embeddings are the combination of character-level features and word-level features which we find as the novel work as per our knowledge. Improved word embeddings are found to be effective in achieving better results than the results obtained by earlier NER models with deep feature extraction tasks.
引用
收藏
页码:6970 / 6976
页数:7
相关论文
共 50 条
  • [41] Mongolian Named Entity Recognition with Bidirectional Recurrent Neural Networks
    Wang, Weihua
    Bao, Feilong
    Gao, Guanglai
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 495 - 500
  • [42] Pooled Contextualized Embeddings for Named Entity Recognition
    Akbik, Alan
    Bergmann, Tanja
    Vollgraf, Roland
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 724 - 728
  • [43] LSTM Recurrent Neural Networks for Cybersecurity Named Entity Recognition
    Gasmi, Houssem
    Bouras, Abdelaziz
    Laval, Jannik
    THIRTEENTH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING ADVANCES (ICSEA 2018), 2018, : 1 - 6
  • [44] Named Entity Recognition with Word Embeddings and Wikipedia Categories for a Low-Resource Language
    Das, Arjun
    Ganguly, Debasis
    Garain, Utpal
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2017, 16 (03)
  • [45] Named Entity Recognition Model Based on the Fusion of Word Vectors and Category Vectors
    Zhou, Yang
    Zeng, Haoyang
    Zhang, Wei
    Zhang, Yuguang
    IEEE ACCESS, 2024, 12 : 194657 - 194668
  • [46] An Instance Transfer-Based Approach Using Enhanced Recurrent Neural Network for Domain Named Entity Recognition
    Liu, Chuanbo
    Fan, Chaojie
    Wang, Zhengju
    Sun, Yueqing
    IEEE ACCESS, 2020, 8 : 45263 - 45270
  • [47] A Neural Span-Based Continual Named Entity Recognition Model
    Zhang, Yunan
    Chen, Qingcai
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 13993 - 14001
  • [48] Character level and word level embedding with bidirectional LSTM - Dynamic recurrent neural network for biomedical named entity recognition from literature
    Gajendran, Sudhakaran
    Manjula, D.
    Sugumaran, Vijayan
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 112
  • [49] A Novel Ensemble Method for Named Entity Recognition and Disambiguation Based on Neural Network
    Canale, Lorenzo
    Lisena, Pasquale
    Troncy, Raphael
    SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 : 91 - 107
  • [50] Military Scenario Named Entity Recognition Method Based on Deep neural network
    Wang, Xuefeng
    Zhou, Xiaofei
    Li, Dongsheng
    Hou, Jianfeng
    2018 INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND INFORMATION SCIENCES (ICCAIS), 2018, : 137 - 140