Recurrent Neural Network-Based Model for Named Entity Recognition with Improved Word Embeddings

被引:2
|
作者
Goyal, Archana [1 ]
Gupta, Vishal [2 ]
Kumar, Manish [3 ]
机构
[1] Goswami Ganesh Dutta Sanatan Dharma Coll, PG Dept Informat Technol, Chandigarh 160030, India
[2] Panjab Univ, Univ Inst Engn & Technol, Chandigarh 160014, India
[3] Panjab Univ Reg Ctr, Comp Sci & Applicat, Muktsar, Punjab, India
关键词
Bidirectional long short-term memory (Bi-LSTM); convolutional neural network (CNN); named entity recognition (NER); recurrent neural network (RNN); word embeddings;
D O I
10.1080/03772063.2021.2006805
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Extraction of meaningful information from a huge amount of data available on the web is a quite challenging task. The challenges faced in information extraction can be overcome with the help of an efficient named entity recognition (NER) system. Named entities are the proper names that play an important role in searching important information of interest. In this study, an efficient deep learning-based NER technique has been proposed which recognizes the named entities belonging to the general domain from Hindi, Punjabi, and bilingual Hindi and Punjabi text. An important variant of recurrent neural network, namely bidirectional long short-term memory-based model using improved word embeddings has been developed. Improved word embeddings are the combination of character convolutional neural network embeddings and part of speech embeddings. The main findings of the study include the development of a NER system that can extract named entities not only from Hindi and Punjabi datasets individually but also from mixed Hindi and Punjabi text. Besides, improved word embeddings are the combination of character-level features and word-level features which we find as the novel work as per our knowledge. Improved word embeddings are found to be effective in achieving better results than the results obtained by earlier NER models with deep feature extraction tasks.
引用
收藏
页码:6970 / 6976
页数:7
相关论文
共 50 条
  • [21] Improving Chemical Named Entity Recognition in Patents with Contextualized Word Embeddings
    Zhai, Zenan
    Dat Quoc Nguyen
    Akhondi, Saber A.
    Thorne, Camilo
    Druckenbrodt, Christian
    Cohn, Trevor
    Gregory, Michelle
    Verspoor, Karin
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019), 2019, : 328 - 338
  • [22] Comparing general and specialized word embeddings for biomedical named entity recognition
    Ramos-Vargas, Rigo E.
    Roman-Godinez, Israel
    Torres-Ramos, Sulema
    PEERJ COMPUTER SCIENCE, 2021, 7 : 1 - 22
  • [23] Study of Named Entity Recognition Based on Graph Neural Network
    Shu, Wenhao
    Xi, Xuefeng
    Cui, Zhiming
    Gu, Chenkai
    Computer Engineering and Applications, 2023, 59 (19) : 52 - 65
  • [24] Biomedical Named Entity Recognition Based on Extended Recurrent Neural Networks
    Li, Lishuang
    Jin, Liuke
    Jiang, Zhenchao
    Song, Dingxin
    Huang, Degen
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 649 - 652
  • [25] A U-Shaped Network-Based Grid Tagging Model for Chinese Named Entity Recognition
    Xiang, Yan
    Zhao, Xuedong
    Guo, Junjun
    Shi, Zhiliang
    Chen, Enbang
    Zhang, Xiaobo
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (03): : 4149 - 4167
  • [26] A Feature Based Simple Machine Learning Approach with Word Embeddings to Named Entity Recognition on Tweets
    Taspinar, Mete
    Ganiz, Murat Can
    Acarman, Tankut
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 254 - 259
  • [27] LM-Based Word Embeddings Improve Biomedical Named Entity Recognition: A Detailed Analysis
    Akhtyamova, Liliya
    Cardiff, John
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2020), 2020, 12108 : 624 - 635
  • [28] Recurrent neural networks for Turkish named entity recognition
    Gungor, Onur
    Uskudarli, Suzan
    Gungor, Tunga
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [29] A Joint Model of Named Entity Recognition and Coreference Resolution Based on Hybrid Neural Network
    Gao C.-S.
    Zhang J.-F.
    Li W.-P.
    Zhao W.
    Zhang S.-K.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (03): : 442 - 448
  • [30] Word Embeddings for Unsupervised Named Entity Linking
    Nozza, Debora
    Sas, Cezar
    Fersini, Elisabetta
    Messina, Enza
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 115 - 132