Deep neural networks ensemble for detecting medication mentions in tweets

被引:20
|
作者
Weissenbacher, Davy [1 ]
Sarker, Abeed [1 ]
Klein, Ari [1 ]
O'Connor, Karen [1 ]
Magge, Arjun [2 ]
Gonzalez-Hernandez, Graciela [1 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, Perelman Sch Med, 480-492-0477,404 Blockley Hall,423 Guardian Dr, Philadelphia, PA 19104 USA
[2] Arizona State Univ, Biodesign Ctr Environm Hlth Engn, Tempe, AZ USA
关键词
social media; pharmacovigilance; drug name detection; ensemble learning; text classification; TWITTER; MODELS;
D O I
10.1093/jamia/ocz156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Twitter posts are now recognized as an important source of patient-generated data, providing unique insights into population health. A fundamental step toward incorporating Twitter data in pharmacoepidemiologic research is to automatically recognize medication mentions in tweets. Given that lexical searches for medication names suffer from low recall due to misspellings or ambiguity with common words, we propose a more advanced method to recognize them. Materials and Methods: We present Kusuri, an Ensemble Learning classifier able to identify tweets mentioning drug products and dietary supplements. Kusuri ("medication" in Japanese) is composed of 2 modules: first, 4 different classifiers (lexicon based, spelling variant based, pattern based, and a weakly trained neural network) are applied in parallel to discover tweets potentially containing medication names; second, an ensemble of deep neural networks encoding morphological, semantic, and long-range dependencies of important words in the tweets makes the final decision. Results: On a class-balanced (50-50) corpus of 15 005 tweets, Kusuri demonstrated performances close to human annotators with an F-1 score of 93.7%, the best score achieved thus far on this corpus. On a corpus made of all tweets posted by 112 Twitter users (98 959 tweets, with only 0.26% mentioning medications), Kusuri obtained an F-1 score of 78.8%. To the best of our knowledge, Kusuri is the first system to achieve this score on such an extremely imbalanced dataset. Conclusions: The system identifies tweets mentioning drug names with performance high enough to ensure its usefulness, and is ready to be integrated in pharmacovigilance, toxicovigilance, or more generally, public health pipelines that depend on medication name mentions.
引用
收藏
页码:1618 / 1626
页数:9
相关论文
共 50 条
  • [21] Detecting atrial fibrillation by deep convolutional neural networks
    Xia, Yong
    Wulan, Naren
    Wang, Kuanquan
    Zhang, Henggui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 93 : 84 - 92
  • [22] Sentiment Polarity Detection in Bengali Tweets Using Deep Convolutional Neural Networks
    Sarkar, Kamal
    JOURNAL OF INTELLIGENT SYSTEMS, 2019, 28 (03) : 377 - 386
  • [23] COVID-19 outbreak: An ensemble pre-trained deep learning model for detecting informative tweets
    Malla, SreeJagadeesh
    Alphonse, P. J. A.
    APPLIED SOFT COMPUTING, 2021, 107
  • [24] COVID-19 outbreak: An ensemble pre-trained deep learning model for detecting informative tweets
    Malla, SreeJagadeesh
    P.J.A., Alphonse
    Malla, SreeJagadeesh (malla.sree@gmail.com), 1600, Elsevier Ltd (107):
  • [25] ABOUT AN ALGORITHM FOR CONSISTENT WEIGHTS INITIALIZATION OF DEEP NEURAL NETWORKS AND NEURAL NETWORKS ENSEMBLE LEARNING
    Drokin, I. S.
    VESTNIK SANKT-PETERBURGSKOGO UNIVERSITETA SERIYA 10 PRIKLADNAYA MATEMATIKA INFORMATIKA PROTSESSY UPRAVLENIYA, 2016, 12 (04): : 66 - 74
  • [26] NLP@UNED at SMM4H 2019: Neural Networks Applied to Automatic Classifications of Adverse Effects Mentions in Tweets
    Cortes-Tejada, Javier
    Martinez-Romo, Juan
    Araujo, Lourdes
    SOCIAL MEDIA MINING FOR HEALTH APPLICATIONS (#SMM4H) WORKSHOP & SHARED TASK, 2019, : 93 - 95
  • [27] An ensemble framework of deep neural networks for colorectal polyp classification
    Younas, Farah
    Usman, Muhammad
    Yan, Wei Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 18925 - 18946
  • [28] Classification of skin lesions using an ensemble of deep neural networks
    Harangi, Balazs
    Baran, Agnes
    Hajdu, Andras
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 2575 - 2578
  • [29] An ensemble of deep neural networks for kidney ultrasound image classification
    Sudharson, S.
    Kokil, Priyanka
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2020, 197
  • [30] Deep Neural Networks Guided Ensemble Learning for Point Estimation
    Zhan, Tianyu
    Fu, Haoda
    Kang, Jian
    STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2024, 16 (02): : 270 - 278