Deep neural networks ensemble for detecting medication mentions in tweets

被引:20
|
作者
Weissenbacher, Davy [1 ]
Sarker, Abeed [1 ]
Klein, Ari [1 ]
O'Connor, Karen [1 ]
Magge, Arjun [2 ]
Gonzalez-Hernandez, Graciela [1 ]
机构
[1] Univ Penn, Dept Biostat Epidemiol & Informat, Perelman Sch Med, 480-492-0477,404 Blockley Hall,423 Guardian Dr, Philadelphia, PA 19104 USA
[2] Arizona State Univ, Biodesign Ctr Environm Hlth Engn, Tempe, AZ USA
关键词
social media; pharmacovigilance; drug name detection; ensemble learning; text classification; TWITTER; MODELS;
D O I
10.1093/jamia/ocz156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Twitter posts are now recognized as an important source of patient-generated data, providing unique insights into population health. A fundamental step toward incorporating Twitter data in pharmacoepidemiologic research is to automatically recognize medication mentions in tweets. Given that lexical searches for medication names suffer from low recall due to misspellings or ambiguity with common words, we propose a more advanced method to recognize them. Materials and Methods: We present Kusuri, an Ensemble Learning classifier able to identify tweets mentioning drug products and dietary supplements. Kusuri ("medication" in Japanese) is composed of 2 modules: first, 4 different classifiers (lexicon based, spelling variant based, pattern based, and a weakly trained neural network) are applied in parallel to discover tweets potentially containing medication names; second, an ensemble of deep neural networks encoding morphological, semantic, and long-range dependencies of important words in the tweets makes the final decision. Results: On a class-balanced (50-50) corpus of 15 005 tweets, Kusuri demonstrated performances close to human annotators with an F-1 score of 93.7%, the best score achieved thus far on this corpus. On a corpus made of all tweets posted by 112 Twitter users (98 959 tweets, with only 0.26% mentioning medications), Kusuri obtained an F-1 score of 78.8%. To the best of our knowledge, Kusuri is the first system to achieve this score on such an extremely imbalanced dataset. Conclusions: The system identifies tweets mentioning drug names with performance high enough to ensure its usefulness, and is ready to be integrated in pharmacovigilance, toxicovigilance, or more generally, public health pipelines that depend on medication name mentions.
引用
收藏
页码:1618 / 1626
页数:9
相关论文
共 50 条
  • [41] Ensemble Malware Classification System Using Deep Neural Networks
    Narayanan, Barath Narayanan
    Davuluru, Venkata Salini Priyamvada
    ELECTRONICS, 2020, 9 (05)
  • [42] Snapshot boosting: a fast ensemble framework for deep neural networks
    Zhang, Wentao
    Jiang, Jiawei
    Shao, Yingxia
    Cui, Bin
    SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (01)
  • [43] PlexNet: An Ensemble of Deep Neural Networks for Biometric Template Protection
    Singh, Ashutosh
    Srivastva, Ranjeet
    Singh, Yogendra Narain
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (04) : 269 - 280
  • [44] An Ensemble Learning for Detecting Situational Awareness Tweets during Environmental Hazards
    Alshehri, Adel
    Alahamri, Saeed
    2019 13TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON), 2019,
  • [45] Detecting Adversarial Examples on Deep Neural Networks With Mutual Information Neural Estimation
    Gao, Song
    Wang, Ruxin
    Wang, Xiaoxuan
    Yu, Shui
    Dong, Yunyun
    Yao, Shaowen
    Zhou, Wei
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (06) : 5168 - 5181
  • [46] Detecting Customer Induced Damages in Motherboards with Deep Neural Networks
    Alves, Danilo
    Farias, Victor
    Chaves, Iago
    Chao, Richard
    Madeiro, Joao Paulo
    Gomes, Joao Paulo
    Machado, Javam
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [47] Detecting Malicious PowerShell Commands using Deep Neural Networks
    Hendler, Danny
    Kels, Shay
    Rubin, Amir
    PROCEEDINGS OF THE 2018 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (ASIACCS'18), 2018, : 187 - 197
  • [48] Deep Neural Networks for Detecting Asteroids in the ATLAS Data Pipeline
    Kaplan, Noah
    Loveland, Rohan
    Denneau, Larry
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1387 - 1392
  • [49] Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks
    Xu, Weilin
    Evans, David
    Qi, Yanjun
    25TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2018), 2018,
  • [50] Detecting Composite Image Manipulation based on Deep Neural Networks
    Choi, Hak-Yeol
    Jang, Han-Ul
    Kim, Dongkyu
    Son, Jeongho
    Mun, Seung-Min
    Choi, Sunghee
    Lee, Heung-Kyu
    2017 INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2017,