An exploration into CTEPH medications: Combining natural language processing, embedding learning, in vitro models, and real-world evidence for drug repurposing

被引:0
|
作者
Steiert, Daniel [1 ,2 ,3 ]
Wittig, Corey [1 ,2 ,3 ]
Banerjee, Priyanka [2 ,3 ,4 ]
Preissner, Robert [2 ,3 ,4 ]
Szulcek, Robert [1 ,2 ,3 ,5 ]
机构
[1] Charite Univ Med Berlin, Inst Physiol, Lab Vitro Modeling Syst Pulm & Thrombot Dis, Berlin, Germany
[2] Free Univ Berlin, Berlin, Germany
[3] Humboldt Univ, Berlin, Germany
[4] Charite Univ Med Berlin, Inst Physiol, Struct Bioinformat Grp, Berlin, Germany
[5] Deutsch Herzzentrum Charite, Dept Cardiac Anesthesiol & Intens Care Med, Berlin, Germany
关键词
THROMBOEMBOLIC PULMONARY-HYPERTENSION; AMIODARONE; RIOCIGUAT; PREDICTION; KNOWLEDGE;
D O I
10.1371/journal.pcbi.1012417
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background In the modern era, the growth of scientific literature presents a daunting challenge for researchers to keep informed of advancements across multiple disciplines. Objective We apply natural language processing (NLP) and embedding learning concepts to design PubDigest, a tool that combs PubMed literature, aiming to pinpoint potential drugs that could be repurposed. Methods Using NLP, especially term associations through word embeddings, we explored unrecognized relationships between drugs and diseases. To illustrate the utility of PubDigest, we focused on chronic thromboembolic pulmonary hypertension (CTEPH), a rare disease with an overall limited number of scientific publications. Results Our literature analysis identified key clinical features linked to CTEPH by applying term frequency-inverse document frequency (TF-IDF) scoring, a technique measuring a term's significance in a text corpus. This allowed us to map related diseases. One standout was venous thrombosis (VT), which showed strong semantic links with CTEPH. Looking deeper, we discovered potential repurposing candidates for CTEPH through large-scale neural network-based contextualization of literature and predictive modeling on both the CTEPH and the VT literature corpora to find novel, yet unrecognized associations between the two diseases. Alongside the anti-thrombotic agent caplacizumab, benzofuran derivatives were an intriguing find. In particular, the benzofuran derivative amiodarone displayed potential anti-thrombotic properties in the literature. Our in vitro tests confirmed amiodarone's ability to reduce platelet aggregation significantly by 68% (p = 0.02). However, real-world clinical data indicated that CTEPH patients receiving amiodarone treatment faced a significant 15.9% higher mortality risk (p<0.001). Conclusions While NLP offers an innovative approach to interpreting scientific literature, especially for drug repurposing, it is crucial to combine it with complementary methods like in vitro testing and real-world evidence. Our exploration with benzofuran derivatives and CTEPH underscores this point. Thus, blending NLP with hands-on experiments and real-world clinical data can pave the way for faster and safer drug repurposing approaches, especially for rare diseases like CTEPH.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Risk Factors for Silent Brain Infarcts and White Matter Disease in a Real-World Cohort Identified by Natural Language Processing
    Leung, Lester Y.
    Zhou, Yichen
    Fu, Sunyang
    Zheng, Chengyi
    Luetmer, Patrick H.
    Kallmes, David F.
    Liu, Hongfang
    Chen, Wansu
    Kent, David M.
    MAYO CLINIC PROCEEDINGS, 2022, 97 (06) : 1114 - 1122
  • [32] USE OF NATURAL LANGUAGE PROCESSING TO AUGMENT REAL-WORLD DATA (RWD) AND IDENTIFY ELIGIBLE PATIENTS AT SCALE FOR ONCOLOGY STUDIES
    Raju, A.
    Doko, G.
    Su, Z.
    Paulus, J.
    Robert, N.
    VALUE IN HEALTH, 2024, 27 (06) : S265 - S265
  • [33] Using natural language processing to facilitate the harmonisation of mental health questionnaires: a validation study using real-world data
    McElroy, Eoin
    Wood, Thomas
    Bond, Raymond
    Mulvenna, Maurice
    Shevlin, Mark
    Ploubidis, George B.
    Hoffmann, Mauricio Scopel
    Moltrecht, Bettina
    BMC PSYCHIATRY, 2024, 24 (01)
  • [34] Identifying Symptoms Prior to Pancreatic Ductal Adenocarcinoma Diagnosis in Real-World Care Settings: Natural Language Processing Approach
    Xie, Fagen
    Chang, Jenny
    Luong, Tiffany
    Wu, Bechien
    Lustigova, Eva
    Shrader, Eva
    Chen, Wansu
    JMIR AI, 2024, 3
  • [35] IDENTIFYING REASONS FOR STATIN NONADHERENCE IN A DIVERSE, REAL-WORLD POPULATION USING ELECTRONIC HEALTH RECORDS AND NATURAL LANGUAGE PROCESSING
    Sarraju, Ashish
    Coquet, Jean
    Chan, Antonia
    Ngo, Summer
    Lossio-Ventura, Juan Antonio
    Hernandez-Boussard, Tina
    Rodriguez, Fatima
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2021, 77 (18) : 1665 - 1665
  • [36] Integrating natural language processing with image document analysis: what we learned from two real-world applications
    Jinying Chen
    Huaigu Cao
    Premkumar Natarajan
    International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 235 - 247
  • [37] LARGE LANGUAGE MODELS AND CASE REPORTS: AN INNOVATIVE APPROACH TO REAL-WORLD DATA FOR RARE DISEASE NATURAL HISTORY ANALYSIS
    Paek, H.
    Lee, K.
    Huang, L. C.
    Annan, A.
    Rastergar-mojarad, M.
    Wang, X.
    VALUE IN HEALTH, 2024, 27 (12) : S582 - S582
  • [38] The Real-World Experiences of Persons With Multiple Sclerosis During the First COVID-19 Lockdown: Application of Natural Language Processing
    Chiavi, Deborah
    Haag, Christina
    Chan, Andrew
    Kamm, Christian Philipp
    Sieber, Chloe
    Stanikic, Mina
    Rodgers, Stephanie
    Pot, Caroline
    Kesselring, Juerg
    Salmen, Anke
    Rapold, Irene
    Calabrese, Pasquale
    Manjaly, Zina-Mary
    Gobbi, Claudio
    Zecca, Chiara
    Walther, Sebastian
    Stegmayer, Katharina
    Hoepner, Robert
    Puhan, Milo
    von Wyl, Viktor
    JMIR MEDICAL INFORMATICS, 2022, 10 (11)
  • [39] Real-World Insights Into Dementia Diagnosis Trajectory and Clinical Practice Patterns Unveiled by Natural Language Processing: Development and Usability Study
    Paek, Hunki
    Fortinsky, Richard H.
    Lee, Kyeryoung
    Huang, Liang-Chin
    Maghaydah, Yazeed S.
    Kuchel, George A.
    Wang, Xiaoyan
    JMIR AGING, 2025, 8
  • [40] Natural Language Processing Extracted Social And Behavioral Determinants Of Health And Newer Glucose-lowering Drug Initiation Among Real-world Patients With Type 2 Diabetes
    Guo, Jingchuan
    Wu, Yonghui
    Guo, Yi
    Chen, Aokun
    Yu, Zehao
    Yang, Xi
    Magnani, Jared W.
    Hernandez, Inmaculada
    O'Neal, LaToya
    Shenkman, Elizabeth
    Bian, Jiang
    CIRCULATION, 2022, 145