A semantic Similarity-Based approach to extract respiratory disease-symptom relations from biomedical literature

被引:0
|
作者
Celikten, Azer [1 ,3 ]
Bulut, Hasan [1 ]
Onan, Aytug [2 ]
机构
[1] Ege Univ, Fac Engn, Dept Comp Engn, TR-35100 Izmir, Turkiye
[2] Izmir Katip Celebi Univ, Fac Engn & Architecture, Dept Comp Engn, TR-35620 Izmir, Turkiye
[3] Akgun Technol, TR-06930 Ankara, Turkiye
来源
JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY | 2024年 / 40卷 / 01期
关键词
Information extraction; biomedical named entity recognition; biomedical relation extraction; disease-symptom relations; text mining; PERFORMANCE;
D O I
10.17341/gazimmfd.1354324
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the biomedical domain, the surge in article volume means valuable insights on diseases and symptoms are oftenhidden in academic literature. Leveraging natural language processing and text mining to sift through biomedicaltexts is vital for advancing early diagnosis, enhancing clinical decision support systems, and refining ontologies.Particularly for respiratory diseases, which share symptoms like fever, cough, and breathlessness, differentiatingbetween diseases based on symptoms is crucial for early and accurate diagnosis. This study introduces a methodfor extracting disease-symptom relationships, aiming to identify rare symptoms not mentioned in health resources but potentially related to diseases, and to ascertain the association strength between diseases and symptoms.Initially, a hybrid entity recognition approach was proposed for identifying diseases and symptoms in medicaltexts. Then, the diseases and symptoms were normalized, and their associations ranked by semantic similarityscores. Evaluated on a dataset of respiratory diseases, including academic article abstracts on asthma, bronchitis,pulmonary embolism, and COVID-19, the study uncovered rare symptoms in addition to characteristic ones. Thedot product similarity method proved more effective, achieving an average similarity score of 0.66, in establishingthe associations between diseases and symptoms, revealing the significance of literature validation in identifying rare symptom-disease relations
引用
收藏
页码:121 / 134
页数:14
相关论文
共 50 条
  • [41] Pattern-based Extraction of Disease Drug Combination Knowledge from Biomedical Literature
    Liu, Jing
    Abeysinghe, Rashmie
    Zheng, Fengbo
    Cui, Licong
    2019 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2019, : 196 - 202
  • [42] From senses to texts: An all-in-one graph-based approach for measuring semantic similarity
    Pilehvar, Mohammad Taher
    Navigli, Roberto
    ARTIFICIAL INTELLIGENCE, 2015, 228 : 95 - 128
  • [43] Discovering Semantic Relations from Unstructured Data for Ontology Enrichment Asssociation rules based approach
    Paiva, Luis
    Costa, Ruben
    Figueiras, Paulo
    Lima, Celson
    PROCEEDINGS OF THE 2014 9TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2014), 2014,
  • [44] A novel feature-based approach to extract drug-drug interactions from biomedical text
    Bui, Quoc-Chinh
    Sloot, Peter M. A.
    van Mulligen, Erik M.
    Kors, Jan A.
    BIOINFORMATICS, 2014, 30 (23) : 3365 - 3371
  • [45] Extracting Features from Text Flows based on Semantic Similarity for Text Classification: an Approach Inspired by Audio Analysis
    Vasconcelos, Larissa Lucena
    Campelo, Claudio E. C.
    Journal of the Brazilian Computer Society, 2024, 30 (01) : 297 - 314
  • [46] A dictionary-based approach to normalizing gene names in one domain of knowledge from the biomedical literature
    Galvez, Carmen
    de Moya-Anegon, Felix
    JOURNAL OF DOCUMENTATION, 2012, 68 (01) : 5 - 30
  • [47] A case study: semantic integration of gene-disease associations for type 2 diabetes mellitus from literature and biomedical data resources
    Rebholz-Schuhmann, Dietrich
    Grabmueller, Christoph
    Kavaliauskas, Silvestras
    Croset, Samuel
    Woollard, Peter
    Backofen, Rolf
    Filsells, Wendy
    Clark, Dominic
    DRUG DISCOVERY TODAY, 2014, 19 (07) : 882 - 889
  • [48] A semi-supervised approach to extract pharmacogenomics-specific drug-gene pairs from biomedical literature for personalized medicine
    Xu, Rong
    Wang, QuanQiu
    JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (04) : 585 - 593
  • [49] Disease causality extraction based on lexical semantics and document-clause frequency from biomedical literature
    Lee, Dong-gi
    Shin, Hyunjung
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2017, 17
  • [50] Disease causality extraction based on lexical semantics and document-clause frequency from biomedical literature
    Dong-gi Lee
    Hyunjung Shin
    BMC Medical Informatics and Decision Making, 17