An exploration into CTEPH medications: Combining natural language processing, embedding learning, in vitro models, and real-world evidence for drug repurposing

被引:0
|
作者
Steiert, Daniel [1 ,2 ,3 ]
Wittig, Corey [1 ,2 ,3 ]
Banerjee, Priyanka [2 ,3 ,4 ]
Preissner, Robert [2 ,3 ,4 ]
Szulcek, Robert [1 ,2 ,3 ,5 ]
机构
[1] Charite Univ Med Berlin, Inst Physiol, Lab Vitro Modeling Syst Pulm & Thrombot Dis, Berlin, Germany
[2] Free Univ Berlin, Berlin, Germany
[3] Humboldt Univ, Berlin, Germany
[4] Charite Univ Med Berlin, Inst Physiol, Struct Bioinformat Grp, Berlin, Germany
[5] Deutsch Herzzentrum Charite, Dept Cardiac Anesthesiol & Intens Care Med, Berlin, Germany
关键词
THROMBOEMBOLIC PULMONARY-HYPERTENSION; AMIODARONE; RIOCIGUAT; PREDICTION; KNOWLEDGE;
D O I
10.1371/journal.pcbi.1012417
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background In the modern era, the growth of scientific literature presents a daunting challenge for researchers to keep informed of advancements across multiple disciplines. Objective We apply natural language processing (NLP) and embedding learning concepts to design PubDigest, a tool that combs PubMed literature, aiming to pinpoint potential drugs that could be repurposed. Methods Using NLP, especially term associations through word embeddings, we explored unrecognized relationships between drugs and diseases. To illustrate the utility of PubDigest, we focused on chronic thromboembolic pulmonary hypertension (CTEPH), a rare disease with an overall limited number of scientific publications. Results Our literature analysis identified key clinical features linked to CTEPH by applying term frequency-inverse document frequency (TF-IDF) scoring, a technique measuring a term's significance in a text corpus. This allowed us to map related diseases. One standout was venous thrombosis (VT), which showed strong semantic links with CTEPH. Looking deeper, we discovered potential repurposing candidates for CTEPH through large-scale neural network-based contextualization of literature and predictive modeling on both the CTEPH and the VT literature corpora to find novel, yet unrecognized associations between the two diseases. Alongside the anti-thrombotic agent caplacizumab, benzofuran derivatives were an intriguing find. In particular, the benzofuran derivative amiodarone displayed potential anti-thrombotic properties in the literature. Our in vitro tests confirmed amiodarone's ability to reduce platelet aggregation significantly by 68% (p = 0.02). However, real-world clinical data indicated that CTEPH patients receiving amiodarone treatment faced a significant 15.9% higher mortality risk (p<0.001). Conclusions While NLP offers an innovative approach to interpreting scientific literature, especially for drug repurposing, it is crucial to combine it with complementary methods like in vitro testing and real-world evidence. Our exploration with benzofuran derivatives and CTEPH underscores this point. Thus, blending NLP with hands-on experiments and real-world clinical data can pave the way for faster and safer drug repurposing approaches, especially for rare diseases like CTEPH.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Natural language processing - relevance to patient outcomes and real-world evidence
    Stewart, Robert
    Chaturvedi, Jaya
    Roberts, Angus
    EXPERT REVIEW OF PHARMACOECONOMICS & OUTCOMES RESEARCH, 2024, 24 (01) : 5 - 9
  • [2] NoisywikiHow: A Benchmark for Learning with Real-world Noisy Labels in Natural Language Processing
    Wu, Tingting
    Ding, Xiao
    Tang, Minji
    Zhang, Hao
    Qin, Bing
    Liu, Ting
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 4856 - 4873
  • [3] Natural language processing-optimized case selection for real-world evidence studies.
    Koskimaki, Jacob
    Hu, Jenny
    Zhang, Yiduo
    Mena, Jose
    Jones, Nehanda
    Lipschultz, Elizabeth
    Vaidya, Vivek Prabhakar
    Altay, Gabriel
    Erese, Vance Andrei
    Swaminathan, Krishna Kumar
    Mendonca, Emma
    Dutt, Tarun
    Singh, Kuldeep
    King, Tian
    Lakkimsetty, Vinay Phani Santosh
    Al-Olimat, Hussein
    Manning, Brittany
    Komatsoulis, George Anthony
    Chu, Simon
    Ottens, Jeff
    JOURNAL OF CLINICAL ONCOLOGY, 2022, 40 (16)
  • [4] A Natural Language Processing System for Extracting Evidence of Drug Repurposing from Scientific Publications
    Subramanian, Shivashankar
    Baldini, Ioana
    Ravichandran, Sushma
    Katz-Rogozhnikov, Dmitriy A.
    Ramamurthy, Karthikeyan Natesan
    Sattigeri, Prasanna
    Varshney, Kush R.
    Wang, Annmarie
    Mangalath, Pradeep
    Kleiman, Laura B.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13376 - 13381
  • [5] Real-world evidence analysis of early-stage prostate cancer using natural language processing
    Juarez, Alvaro
    Carles, Joan
    Conde-Moreno, Antonio Jose
    Maroto-Rey, Pablo
    Puente, Javier
    Del Toro, Jacobo Munoz
    Calderon, Jose M.
    Lopez, Maria
    Valdivieso, Juan
    Casadevall, David
    Taberna, Miren
    Alcaraz, Antonio
    JOURNAL OF CLINICAL ONCOLOGY, 2023, 41 (16)
  • [6] Empowering Large Language Models: Tool Learning for Real-World Interaction
    Wang, Hongru
    Qin, Yujia
    Lin, Yankai
    Pan, Jeff Z.
    Wong, Kam-Fai
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2983 - 2986
  • [7] Extracting medications and associated adverse drug events using a natural language processing system combining knowledge base and deep learning
    Chen, Long
    Gu, Yu
    Ji, Xin
    Sun, Zhiyong
    Li, Haodan
    Gao, Yuan
    Huang, Yang
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (01) : 56 - 64
  • [8] A real-world case study for automated ticket team assignment using natural language processing and explainable models
    Pavelski, Lucas Marcondes
    Braga, Rodrigo de Souza
    PROCEEDINGS OF THE 37TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE 2022, 2022,
  • [9] Machine learning-based approach for glioblastoma drug repurposing on real-world patient data
    Lin, Ko-Hong
    Kim, Yejin
    Lee, Dung-Fang
    Jiang, Xiaoqian
    CANCER RESEARCH, 2023, 83 (08)
  • [10] Caffeine Enhances Real-World Language Processing: Evidence From a Proofreading Task
    Brunye, Tad T.
    Mahoney, Caroline R.
    Rapp, David N.
    Ditman, Tali
    Taylor, Holly A.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-APPLIED, 2012, 18 (01) : 95 - 108