An Attention-Based Approach for Chemical Compound and Drug Named Entity Recognition

被引:0
|
作者
Yang P. [1 ]
Yang Z. [1 ]
Luo L. [1 ]
Lin H. [1 ]
Wang J. [1 ]
机构
[1] School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, Liaoning
来源
Jisuanji Yanjiu yu Fazhan/Computer Research and Development | 2018年 / 55卷 / 07期
基金
中国国家自然科学基金;
关键词
Attention; Chemical compound and drug name recognition; Conditional random fields (CRF); Deep learning; Long short-term memory (LSTM);
D O I
10.7544/issn1000-1239.2018.20170506
中图分类号
学科分类号
摘要
Recognizing chemical compound and drug name from unstructured data in the field of biomedical text mining is of great significance. The current popular approaches are based on CRF model which needs large amounts of hand-crafted features, and these approaches inevitably have the tagging non-consistency problem (the same mentions in a document are tagged different labels). In this paper, we propose an attention-based BiLSTM-CRF architecture to mitigate these aforementioned drawbacks. First, word embedding is obtained from vast amounts of unlabeled biomedical text. Then the characters of current word are fed to a BiLSTM layer to learn the character representation of this word. After this, word and character representations are transformed to another BiLSTM layer and the current adjacency context representation of this word is generated. Then we use attention mechanism to obtain the current word's context at document level on the basis of the adjacency context of all words in this document and the current word. At last, a CRF layer is used to predict the label sequence of this document according to the integration of the current adjacency context and the document-level context. Experimental results show that our method improves the consistency of mention's label in the same document, and it can also achieve better performance (an F-score of 90.77%) than the state-of-the-art methods on the BioCreative IV CHEMDNER corpus. © 2018, Science Press. All right reserved.
引用
收藏
页码:1548 / 1556
页数:8
相关论文
共 14 条
  • [1] Krallinger M., Leitner F., Rabal O., Et al., CHEMDNER: The drugs and chemical names extraction challenge, Journal of Cheminformatics, 7, pp. 1-11, (2015)
  • [2] Krallinger M., Rabal O., Lourenco A., Et al., Overview of the CHEMDNER patents task, Proc of the 15th BioCreative Challenge Evaluation Workshop, (2015)
  • [3] Leaman R., Wei C.-H., Lu Z., NCBI at the BioCreative IV CHEMDNER Task: Recognizing chemical names in PubMed articles with tmChem, Proc of the 15th BioCreative Challenge Evaluation Workshop, (2013)
  • [4] Lu Y., Ji D., Yao X., Et al., CHEMDNER system with mixed conditional random fields and multi-scale word clustering, Journal of Cheminformatics, 7, pp. 4-9, (2015)
  • [5] Lecun Y., Bengio Y., Hinton G., Deep learning, Nature, 521, 7553, pp. 436-444, (2015)
  • [6] Collobert R., Weston J., Bottou L., Et al., Natural language processing (almost) from scratch, Journal of Machine Learning Research, 12, 8, pp. 2493-2537, (2011)
  • [7] Huang Z., Xu W., Yu K., Bidirectional LSTM-CRF models for sequence tagging
  • [8] Ma X., Hovy E., End-to-end sequence labeling via bi-directional LSTM-CNNS-CRF, Proc of the 54th Annual Meeting of the Association for Computational Linguistics, (2016)
  • [9] Mikolov T., Sutskever I., Chen K., Et al., Distributed representations of words and phrases and their compositionality, Proc the 26th Advances in Neural Information Processing Systems, (2013)
  • [10] Pennington J., Socher R., Manning C., Glove: Global vectors for word representation, Proc of the 2014 Conf on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532-1543, (2014)