An exploration into CTEPH medications: Combining natural language processing, embedding learning, in vitro models, and real-world evidence for drug repurposing

被引:0
|
作者
Steiert, Daniel [1 ,2 ,3 ]
Wittig, Corey [1 ,2 ,3 ]
Banerjee, Priyanka [2 ,3 ,4 ]
Preissner, Robert [2 ,3 ,4 ]
Szulcek, Robert [1 ,2 ,3 ,5 ]
机构
[1] Charite Univ Med Berlin, Inst Physiol, Lab Vitro Modeling Syst Pulm & Thrombot Dis, Berlin, Germany
[2] Free Univ Berlin, Berlin, Germany
[3] Humboldt Univ, Berlin, Germany
[4] Charite Univ Med Berlin, Inst Physiol, Struct Bioinformat Grp, Berlin, Germany
[5] Deutsch Herzzentrum Charite, Dept Cardiac Anesthesiol & Intens Care Med, Berlin, Germany
关键词
THROMBOEMBOLIC PULMONARY-HYPERTENSION; AMIODARONE; RIOCIGUAT; PREDICTION; KNOWLEDGE;
D O I
10.1371/journal.pcbi.1012417
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background In the modern era, the growth of scientific literature presents a daunting challenge for researchers to keep informed of advancements across multiple disciplines. Objective We apply natural language processing (NLP) and embedding learning concepts to design PubDigest, a tool that combs PubMed literature, aiming to pinpoint potential drugs that could be repurposed. Methods Using NLP, especially term associations through word embeddings, we explored unrecognized relationships between drugs and diseases. To illustrate the utility of PubDigest, we focused on chronic thromboembolic pulmonary hypertension (CTEPH), a rare disease with an overall limited number of scientific publications. Results Our literature analysis identified key clinical features linked to CTEPH by applying term frequency-inverse document frequency (TF-IDF) scoring, a technique measuring a term's significance in a text corpus. This allowed us to map related diseases. One standout was venous thrombosis (VT), which showed strong semantic links with CTEPH. Looking deeper, we discovered potential repurposing candidates for CTEPH through large-scale neural network-based contextualization of literature and predictive modeling on both the CTEPH and the VT literature corpora to find novel, yet unrecognized associations between the two diseases. Alongside the anti-thrombotic agent caplacizumab, benzofuran derivatives were an intriguing find. In particular, the benzofuran derivative amiodarone displayed potential anti-thrombotic properties in the literature. Our in vitro tests confirmed amiodarone's ability to reduce platelet aggregation significantly by 68% (p = 0.02). However, real-world clinical data indicated that CTEPH patients receiving amiodarone treatment faced a significant 15.9% higher mortality risk (p<0.001). Conclusions While NLP offers an innovative approach to interpreting scientific literature, especially for drug repurposing, it is crucial to combine it with complementary methods like in vitro testing and real-world evidence. Our exploration with benzofuran derivatives and CTEPH underscores this point. Thus, blending NLP with hands-on experiments and real-world clinical data can pave the way for faster and safer drug repurposing approaches, especially for rare diseases like CTEPH.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Development of natural language processing (NLP) models for extracting key features from unstructured notes to create real-world data (RWD) assets for clinical research at scale
    Agrawal, Smita
    George, Rohini
    Vaidya, Vivek Prabhakar
    Chakkrapani, Sangavai
    Prajapati, Rambaksh
    Tankala, Srikanth
    Parmar, Dhaval
    Phani, Vinay
    Lakkimsetty, Santosh
    Bhardwaj, Tapasya
    Ashwani, Ashwani
    Mendonca, Emma
    Narayanan, Babu
    Swaminathan, Krishna Kumar
    Mukherjee, Pranay
    JOURNAL OF CLINICAL ONCOLOGY, 2023, 41 (16)
  • [42] Leveraging large language models through natural language processing to provide interpretable machine learning predictions of mental deterioration in real time
    de Arriba-Perez, Francisco
    Garcia-Mendez, Silvia
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [43] A real-world evaluation of the diagnostic accuracy of radiologists using positive predictive values verified from deep learning and natural language processing chest algorithms deployed retrospectively
    Bhatia, Bahadar S.
    Morlese, John F.
    Yusuf, Sarah
    Xie, Yiting
    Schallhorn, Bob
    Gruen, David
    BJR OPEN, 2023, 6 (01):
  • [44] Remdesivir associated with reduced mortality in hospitalized COVID-19 patients: treatment effectiveness using real-world data and natural language processing
    José Ramón Arribas López
    María Pilar Ruiz Seco
    Francisco Fanjul
    Beatriz Díaz Pollán
    Patricia González Ruano Pérez
    Adrián Ferre Beltrán
    Rosa De Miguel Buckley
    Laura Portillo Horcajada
    Cristina De Álvaro Pérez
    Paulo Jorge Barroso Santos Carvalho
    Melchor Riera Jaume
    BMC Infectious Diseases, 25 (1)
  • [45] Semi-automatic de-identification of hospital discharge summaries with natural language processing A case-study of performance and real-world usability
    Calapodescu, Ioan
    Rozier, David
    Artemova, Svetlana
    Bosson, Jean-Luc
    2017 IEEE INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2017, : 1106 - 1111
  • [46] Detection of Cardiac Attr Amyloidosis in a Real-World Hf Population by Natural Language Processing Guided Automated Data Extraction From Electronic Health Records
    Beles, Monika
    Hens, Dries
    Masuy, Imke
    Verstreken, Sofie
    Heggermont, Ward
    Dierckx, Riet
    Bartunek, Jozef
    Vanderheyden, Marc
    CIRCULATION, 2022, 146
  • [47] Model-Based Reasoning of Clinical Diagnosis in Integrative Medicine: Real-World Methodological Study of Electronic Medical Records and Natural Language Processing Methods
    Geng, Wenye
    Qin, Xuanfeng
    Yang, Tao
    Cong, Zhilei
    Wang, Zhuo
    Kong, Qing
    Tang, Zihui
    Jiang, Lin
    JMIR MEDICAL INFORMATICS, 2020, 8 (12)
  • [48] Automated abstraction of real-world clinical outcome in lung cancer: A natural language processing and artificial intelligence approach from electronic health records.
    Ma, Meng
    Redfern, Arielle
    Zhou, Xiang
    Li, Dan
    Ru, Ying
    Lee, Kyeryoung
    Gilman, Christopher
    Liu, Zongzhi
    Jones, Scott
    Mai, Yun
    Deitz, Matthew
    Gong, Yunrou
    Mullaney, Tommy
    Prentice, Tony
    Chen, Rong
    Schadt, Eric
    Wang, Xiaoyan
    JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (15)
  • [49] Identifying Diabetes Related-Complications in a Real-World Free-Text Electronic Medical Records in Hebrew Using Natural Language Processing Techniques
    Saban, Mor
    Lutski, Miri
    Zucker, Inbar
    Uziel, Moshe
    Ben-Moshe, Dror
    Israel, Ariel
    Vinker, Shlomo
    Golan-Cohen, Avivit
    Laufer, Izhar
    Green, Ilan
    Eldor, Roy
    Merzon, Eugene
    JOURNAL OF DIABETES SCIENCE AND TECHNOLOGY, 2024,
  • [50] Clinical characteristics of lurasidone-treated patients in Spain using Natural Language Processing A real-world data study with Electronic Health Records.
    De La Pinta, C.
    Gabarda, I.
    EUROPEAN PSYCHIATRY, 2022, 65 : S208 - S208