Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks

被引:13
|
作者
Zhang, Canlin [1 ]
Bis, Daniel [2 ]
Liu, Xiuwen [2 ]
He, Zhe [3 ]
机构
[1] Florida State Univ, Dept Math, Tallahassee, FL 32306 USA
[2] Florida State Univ, Dept Comp Sci, Tallahassee, FL 32306 USA
[3] Florida State Univ, Sch Informat, Tallahassee, FL 32306 USA
关键词
Word sense disambiguation; LSTM; Self-attention; Biomedical;
D O I
10.1186/s12859-019-3079-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background In recent years, deep learning methods have been applied to many natural language processing tasks to achieve state-of-the-art performance. However, in the biomedical domain, they have not out-performed supervised word sense disambiguation (WSD) methods based on support vector machines or random forests, possibly due to inherent similarities of medical word senses. Results In this paper, we propose two deep-learning-based models for supervised WSD: a model based on bi-directional long short-term memory (BiLSTM) network, and an attention model based on self-attention architecture. Our result shows that the BiLSTM neural network model with a suitable upper layer structure performs even better than the existing state-of-the-art models on the MSH WSD dataset, while our attention model was 3 or 4 times faster than our BiLSTM model with good accuracy. In addition, we trained "universal" models in order to disambiguate all ambiguous words together. That is, we concatenate the embedding of the target ambiguous word to the max-pooled vector in the universal models, acting as a "hint". The result shows that our universal BiLSTM neural network model yielded about 90 percent accuracy. Conclusion Deep contextual models based on sequential information processing methods are able to capture the relative contextual information from pre-trained input word embeddings, in order to provide state-of-the-art results for supervised biomedical WSD tasks.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Hybrid attention-based Long Short-Term Memory network for sarcasm identification
    Pandey, Rajnish
    Kumar, Abhinav
    Singh, Jyoti Prakash
    Tripathi, Sudhakar
    APPLIED SOFT COMPUTING, 2021, 106
  • [32] Attention-based long short-term memory network temperature prediction model
    Kun, Xiao
    Shan, Tian
    Yi, Tan
    Chao, Chen
    PROCEEDINGS OF 2021 7TH INTERNATIONAL CONFERENCE ON CONDITION MONITORING OF MACHINERY IN NON-STATIONARY OPERATIONS (CMMNO), 2021, : 278 - 281
  • [33] Real Time Attention based Bidirectional Long Short-Term Memory Networks for Air Pollution Forecasting
    Madam, Divyam
    Dua, Radhika
    Mukherjee, Prerana
    Lall, Brejesh
    2019 IEEE FIFTH INTERNATIONAL CONFERENCE ON BIG DATA COMPUTING SERVICE AND APPLICATIONS (IEEE BIGDATASERVICE 2019), 2019, : 151 - 158
  • [34] Intrusion Detection Based on Bidirectional Long Short-Term Memory with Attention Mechanism
    Yang, Yongjie
    Tu, Shanshan
    Ali, Raja Hashim
    Alasmary, Hisham
    Waqas, Muhammad
    Amjad, Muhammad Nouman
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (01): : 801 - 815
  • [35] Speech emotion recognition based on convolutional neural network with attention-based bidirectional long short-term memory network and multi-task learning
    Liu, Zhen-Tao
    Han, Meng-Ting
    Wu, Bao-Han
    Rehman, Abdul
    APPLIED ACOUSTICS, 2023, 202
  • [36] TA-BLSTM: Tag Attention-based Bidirectional Long Short-Term Memory for Service Recommendation in Mashup Creation
    Shi, Min
    Tang, Yufei
    Liu, Jianxun
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [37] Classification of causes of speech recognition errors using attention-based bidirectional long short-term memory and modulation spectrum
    Santoso, Jennifer
    Yamada, Takeshi
    Makino, Shoji
    2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2019, 2019, : 302 - 306
  • [38] Classification of causes of speech recognition errors using attention-based bidirectional long short-term memory and modulation spectrum
    Santoso, Jennifer
    Yamada, Takeshi
    Makino, Shoji
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 302 - 306
  • [39] An attention-based bidirectional long short-term memory based optimal deep learning technique for bone cancer detection and classifications
    Vaiyapuri, Thavavel
    Balaji, Prasanalakshmi
    Shridevi, S.
    Dharmarajlu, Santhi Muttipoll
    Alaseem, Nourah Ali
    AIMS MATHEMATICS, 2024, 9 (06): : 16704 - 16720
  • [40] Driver Identification Based on Gear Shift Events and Attention-Based Bidirectional Long Short-Term Memory for Manual Transmission System
    Deng, Tiancheng
    He, Xin
    Xu, Li
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 4361 - 4366