Semi-supervised deep learning based named entity recognition model to parse education section of resumes

被引:16
|
作者
Gaur, Bodhvi [1 ,2 ]
Saluja, Gurpreet Singh [1 ]
Sivakumar, Hamsa Bharathi [1 ]
Singh, Sanjay [1 ]
机构
[1] Manipal Inst Technol, Dept Informat & Commun Technol, MAHE, Manipal 576104, India
[2] Johns Hopkins Univ, Dept Comp Sci, 3400 North Charles St, Baltimore, MD 21218 USA
来源
NEURAL COMPUTING & APPLICATIONS | 2021年 / 33卷 / 11期
关键词
Named entity recognition (NER); Semi-supervised learning; Deep learning models; Natural language processing; Resume information extraction;
D O I
10.1007/s00521-020-05351-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A job seeker's resume contains several sections, including educational qualifications. Educational qualifications capture the knowledge and skills relevant to the job. Machine processing of the education sections of resumes has been a difficult task. In this paper, we attempt to identify educational institutions' names and degrees from a resume's education section. Usually, a significant amount of annotated data is required for neural network-based named entity recognition techniques. A semi-supervised approach is used to overcome the lack of large annotated data. We trained a deep neural network model on an initial (seed) set of resume education sections. This model is used to predict entities of unlabeled education sections and is rectified using a correction module. The education sections containing the rectified entities are augmented to the seed set. The updated seed set is used for retraining, leading to better accuracy than the previously trained model. This way, it can provide a high overall accuracy without the need of large annotated data. Our model has achieved an accuracy of 92.06% on the named entity recognition task.
引用
收藏
页码:5705 / 5718
页数:14
相关论文
共 50 条
  • [31] A Hybrid Approach to Semi-Supervised Named Entity Recognition in Health, Safety and Environment Reports
    Sari, Yunita
    Hassan, M. Fadzil
    Zamin, Norshuhani
    INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATIONS, PROCEEDINGS, 2009, : 599 - 602
  • [32] Named Entity Recognition for Vietnamese documents using semi-supervised learning method of CRFs with Generalized Expectation Criteria
    Thi-Ngan Pham
    Le Minh Nguyen
    Quang-Thuy Ha
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 85 - 88
  • [33] Semi-Supervised Learning Approach for Indonesian Named Entity Recognition (NER) Using Co-Training Algorithm
    Aryoyudanta, Bayu
    Adji, Teguh Bharata
    Llidayah, Lndriana
    2016 INTERNATIONAL SEMINAR ON INTELLIGENT TECHNOLOGY AND ITS APPLICATIONS (ISITIA): RECENT TRENDS IN INTELLIGENT COMPUTATIONAL TECHNOLOGIES FOR SUSTAINABLE ENERGY, 2016, : 7 - 11
  • [34] Activity recognition based on semi-supervised learning
    Guan, Donghai
    Yuan, Weiwei
    Lee, Young-Koo
    Gavrilov, Andrey
    Lee, Sungyoung
    13TH IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2007, : 469 - +
  • [35] Enlarging Drug Dictionary with Semi-Supervised Learning for Drug Entity Recognition
    Zeng, Donghuo
    Sun, Chengjie
    Lin, Lei
    Liu, Bingquan
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1929 - 1931
  • [36] Lao Named Entity Recognition based on semi-supervised cascaded Conditional Random Fields with generalized expectation criteria
    Yang, Mengjie
    Zhou, Lanjiang
    Yu, Zhengtao
    Wang, Hongbin
    Journal of Computational Information Systems, 2015, 11 (20): : 7595 - 7606
  • [37] ROSE-NER: Robust Semi-supervised Named Entity Recognition on Insufficient Labeled Data
    Chen, Haiyan
    Yuan, Shuwei
    Zhang, Xiang
    PROCEEDINGS OF THE 10TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE GRAPHS (IJCKG 2021), 2021, : 38 - 44
  • [38] Deep Semi-Supervised Learning
    Hailat, Zeyad
    Komarichev, Artem
    Chen, Xue-Wen
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2154 - 2159
  • [39] Image Recognition and Analysis of Intrauterine Residues Based on Deep Learning and Semi-Supervised Learning
    Tao, Tao
    Liu, Kan
    Wang, Li
    Wu, Haiying
    IEEE ACCESS, 2020, 8 : 162785 - 162799
  • [40] Semi-supervised Named Entity Recognition for Low-Resource Languages Using Dual PLMs
    Yohannes, Hailemariam Mehari
    Lynden, Steven
    Amagasa, Toshiyuki
    Matono, Akiyoshi
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PT I, NLDB 2024, 2024, 14762 : 166 - 180