Deep neural networks ensemble for detecting medication mentions in tweets

被引:20
|
作者
Weissenbacher, Davy [1 ]
Sarker, Abeed [1 ]
Klein, Ari [1 ]
O'Connor, Karen [1 ]
Magge, Arjun [2 ]
Gonzalez-Hernandez, Graciela [1 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, Perelman Sch Med, 480-492-0477,404 Blockley Hall,423 Guardian Dr, Philadelphia, PA 19104 USA
[2] Arizona State Univ, Biodesign Ctr Environm Hlth Engn, Tempe, AZ USA
关键词
social media; pharmacovigilance; drug name detection; ensemble learning; text classification; TWITTER; MODELS;
D O I
10.1093/jamia/ocz156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Twitter posts are now recognized as an important source of patient-generated data, providing unique insights into population health. A fundamental step toward incorporating Twitter data in pharmacoepidemiologic research is to automatically recognize medication mentions in tweets. Given that lexical searches for medication names suffer from low recall due to misspellings or ambiguity with common words, we propose a more advanced method to recognize them. Materials and Methods: We present Kusuri, an Ensemble Learning classifier able to identify tweets mentioning drug products and dietary supplements. Kusuri ("medication" in Japanese) is composed of 2 modules: first, 4 different classifiers (lexicon based, spelling variant based, pattern based, and a weakly trained neural network) are applied in parallel to discover tweets potentially containing medication names; second, an ensemble of deep neural networks encoding morphological, semantic, and long-range dependencies of important words in the tweets makes the final decision. Results: On a class-balanced (50-50) corpus of 15 005 tweets, Kusuri demonstrated performances close to human annotators with an F-1 score of 93.7%, the best score achieved thus far on this corpus. On a corpus made of all tweets posted by 112 Twitter users (98 959 tweets, with only 0.26% mentioning medications), Kusuri obtained an F-1 score of 78.8%. To the best of our knowledge, Kusuri is the first system to achieve this score on such an extremely imbalanced dataset. Conclusions: The system identifies tweets mentioning drug names with performance high enough to ensure its usefulness, and is ready to be integrated in pharmacovigilance, toxicovigilance, or more generally, public health pipelines that depend on medication name mentions.
引用
收藏
页码:1618 / 1626
页数:9
相关论文
共 50 条
  • [11] Detecting Malware with an Ensemble Method Based on Deep Neural Network
    Yan, Jinpei
    Qi, Yong
    Rao, Qifan
    SECURITY AND COMMUNICATION NETWORKS, 2018,
  • [12] Ensemble Deep TimeNet : An Ensemble Learning Approach with Deep Neural Networks for Time Series
    Pathak, Sudipta
    Cai, Xingyu
    Rajasekaran, Sanguthevar
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2018,
  • [13] DiffChaser: Detecting Disagreements for Deep Neural Networks
    Xie, Xiaofei
    Ma, Lei
    Wang, Haijun
    Li, Yuekang
    Liu, Yang
    Li, Xiaohong
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 5772 - 5778
  • [14] Detecting Malware Using Deep Neural Networks
    T. D. Ovasapyan
    M. A. Volkovskii
    A. S. Makarov
    Automatic Control and Computer Sciences, 2024, 58 (8) : 1147 - 1155
  • [15] Detecting Information Relays in Deep Neural Networks
    Hintze, Arend
    Adami, Christoph
    ENTROPY, 2023, 25 (03)
  • [16] Detecting Entanglement With Deep Quantum Neural Networks
    Qiu, Peng-Hui
    Chen, Xiao-Guang
    Shi, Yi-Wei
    IEEE ACCESS, 2019, 7 : 94310 - 94320
  • [17] Emotion Analysis From Turkish Tweets Using Deep Neural Networks
    Tocoglu, Mansur Alp
    Ozturkmenoglu, Okan
    Alpkocak, Adil
    IEEE ACCESS, 2019, 7 : 183061 - 183069
  • [18] Identifying Tweets with Personal Medication Intake Mentions using Attentive Character and Localized Context Representations
    Selvarajah, Jarashanth
    Nawarathna, Ruwan
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2022, 28 (12) : 1312 - 1329
  • [19] Detecting adversarial example attacks to deep neural networks
    Carrara, Fabio
    Falchi, Fabrizio
    Caldelli, Roberto
    Amato, Giuseppe
    Fumarola, Roberta
    Becarelli, Rudy
    PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [20] Detecting pulmonary Coccidioidomycosis with deep convolutional neural networks
    Ott, Jordan
    Bruyette, David
    Arbuckle, Cody
    Balsz, Dylan
    Hecht, Silke
    Shubitz, Lisa
    Baldi, Pierre
    MACHINE LEARNING WITH APPLICATIONS, 2021, 5