A Semi-Markov Structured Support Vector Machine Model for High-Precision Named Entity Recognition

被引:0
|
作者
Arora, Ravneet [1 ]
Tsai, Chen-Tse [1 ]
Tsereteli, Ketevan [1 ]
Kambadur, Prabhanjan [1 ]
Yang, Yi [2 ]
机构
[1] Bloomberg LP, New York, NY 10022 USA
[2] ASAPP Inc, New York, NY USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition (NER) is the backbone of many NLP solutions. F-1 score, the harmonic mean of precision and recall, is often used to select/evaluate the best models. However, when precision needs to be prioritized over recall, a state-of-the-art model might not be the best choice. There is little in the literature that directly addresses training-time modifications to achieve higher precision information extraction. In this paper, we propose a neural semi-Markov structured support vector machine model that controls the precision-recall trade-off by assigning weights to different types of errors in the loss-augmented inference during training. The semi-Markov property provides more accurate phrase-level predictions, thereby improving performance. We empirically demonstrate the advantage of our model when high precision is required by comparing against strong baselines based on CRF. In our experiments with the CoNLL 2003 dataset, our model achieves a better precisionrecall trade-off at various precision levels.
引用
收藏
页码:5862 / 5866
页数:5
相关论文
共 50 条
  • [1] TaggerOne: joint named entity recognition and normalization with semi-Markov Models
    Leaman, Robert
    Lu, Zhiyong
    BIOINFORMATICS, 2016, 32 (18) : 2839 - 2846
  • [2] Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition
    Okanohara, Daisuke
    Miyao, Yusuke
    Tsuruoka, Yoshimasa
    Tsujii, Jun'ichi
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 465 - 472
  • [3] Named entity recognition for Manipuri using support vector machine
    Center for Development of Advanced Computing, Gulmohar Cross Road No 9, Juhu, Mumbai-400049, India
    不详
    不详
    PACLIC 23 - Proc. 23rd Pacific Asia Conf. Lang. Inf. Comput., 2009, (811-818):
  • [4] A New Fuzzy Support Vector Machine Method for Named Entity Recognition
    Mansouri, Alireza
    Affendey, Lilly Suriani
    Mamat, Ali
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 24 - 28
  • [5] Named Entity Recognition in Malayalam using Fuzzy Support Vector Machine
    Lakshmi, G.
    Panicker, Janu R.
    Meera, M.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), 2016, : 201 - 206
  • [6] Named entity recognition in Bengali and Hindi using support vector machine
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    LINGUISTICAE INVESTIGATIONES, 2011, 34 (01): : 35 - 67
  • [7] Named Entity Recognition Using a New Fuzzy Support Vector Machine
    Mansouri, Alireza
    Affendey, Lilly Suriani
    Mamat, Ali
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (02): : 320 - 325
  • [8] Machine condition recognition via hidden semi-Markov model
    Yang, Wenhui
    Chen, Lu
    COMPUTERS & INDUSTRIAL ENGINEERING, 2021, 158 (158)
  • [9] Support Vector Machine Hidden Semi-Markov Model-based Heart Sound Segmentation
    Springer, David B.
    Tarassenko, Lionel
    Clifford, Gari D.
    2014 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), VOL 41, 2014, 41 : 625 - 628
  • [10] Named entity recognition using support vector machine: A Language independent approach
    Ekbal, Asif
    Bandyopadhyay, Sivaji
    World Academy of Science, Engineering and Technology, 2009, 39 : 548 - 563