A semantic Similarity-Based approach to extract respiratory disease-symptom relations from biomedical literature

被引:0
|
作者
Celikten, Azer [1 ,3 ]
Bulut, Hasan [1 ]
Onan, Aytug [2 ]
机构
[1] Ege Univ, Fac Engn, Dept Comp Engn, TR-35100 Izmir, Turkiye
[2] Izmir Katip Celebi Univ, Fac Engn & Architecture, Dept Comp Engn, TR-35620 Izmir, Turkiye
[3] Akgun Technol, TR-06930 Ankara, Turkiye
来源
JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY | 2024年 / 40卷 / 01期
关键词
Information extraction; biomedical named entity recognition; biomedical relation extraction; disease-symptom relations; text mining; PERFORMANCE;
D O I
10.17341/gazimmfd.1354324
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the biomedical domain, the surge in article volume means valuable insights on diseases and symptoms are oftenhidden in academic literature. Leveraging natural language processing and text mining to sift through biomedicaltexts is vital for advancing early diagnosis, enhancing clinical decision support systems, and refining ontologies.Particularly for respiratory diseases, which share symptoms like fever, cough, and breathlessness, differentiatingbetween diseases based on symptoms is crucial for early and accurate diagnosis. This study introduces a methodfor extracting disease-symptom relationships, aiming to identify rare symptoms not mentioned in health resources but potentially related to diseases, and to ascertain the association strength between diseases and symptoms.Initially, a hybrid entity recognition approach was proposed for identifying diseases and symptoms in medicaltexts. Then, the diseases and symptoms were normalized, and their associations ranked by semantic similarityscores. Evaluated on a dataset of respiratory diseases, including academic article abstracts on asthma, bronchitis,pulmonary embolism, and COVID-19, the study uncovered rare symptoms in addition to characteristic ones. Thedot product similarity method proved more effective, achieving an average similarity score of 0.66, in establishingthe associations between diseases and symptoms, revealing the significance of literature validation in identifying rare symptom-disease relations
引用
收藏
页码:121 / 134
页数:14
相关论文
共 50 条
  • [21] Leveraging syntactic and semantic graph kernels to extract pharmacokinetic drug drug interactions from biomedical literature
    Zhang, Yaoyun
    Wu, Heng-Yi
    Xu, Jun
    Wang, Jingqi
    Soysal, Ergin
    Li, Lang
    Xu, Hua
    BMC SYSTEMS BIOLOGY, 2016, 10
  • [22] Optimizing graph-based patterns to extract biomedical events from the literature
    Haibin Liu
    Karin Verspoor
    Donald C Comeau
    Andrew D MacKinlay
    W John Wilbur
    BMC Bioinformatics, 16
  • [23] Optimizing graph-based patterns to extract biomedical events from the literature
    Liu, Haibin
    Verspoor, Karin
    Comeau, Donald C.
    MacKinlay, Andrew D.
    Wilbur, W. John
    BMC BIOINFORMATICS, 2015, 16
  • [24] Discovering relations between named entities from a large raw corpus using tree similarity-based clustering
    Zhang, M
    Su, J
    Wang, DM
    Zhou, GD
    Tan, CL
    NATURAL LANGUAGE PROCESSING - IJCNLP 2005, PROCEEDINGS, 2005, 3651 : 378 - 389
  • [25] A Semantic Approach for Mining Hidden Links from Complementary and Non-interactive Biomedical Literature
    Hu, Xiaohua
    Zhang, Xiaodan
    Yoo, Illhoi
    Zhang, Yanqing
    PROCEEDINGS OF THE SIXTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2006, : 200 - +
  • [26] Biomedical Relationship Extraction from Literature Based on Bio-Semantic Token Subsequences
    Katukuri, Jayasimha R.
    Xie, Ying
    Raghavan, Vijay V.
    2009 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2009, : 366 - +
  • [27] A similarity-based approach to leverage multi-cohort medical data on the diagnosis and prognosis of Alzheimer's disease
    Zhang, Hongjiu
    Zhu, Fan
    Dodge, Hiroko H.
    Higgins, Gerald A.
    Omenn, Gilbert S.
    Guan, Yuanfang
    GIGASCIENCE, 2018, 7 (07):
  • [28] Role of dermatomes in the determination of therapeutic characteristics of channel acupoints: A similarity-based analysis of data compiled from literature
    Ferreira A.S.
    Luiz A.B.
    Chinese Medicine, 8 (1)
  • [29] An approach for word categorization based on semantic similarity measure obtained from search engines
    Amasyah, M. Fatih
    2006 IEEE 14th Signal Processing and Communications Applications, Vols 1 and 2, 2006, : 53 - 56
  • [30] A semantic sequence similarity based approach for extracting medical entities from clinical conversations
    Satti, Fahad Ahmed
    Hussain, Musarrat
    Ali, Syed Imran
    Saleem, Misha
    Ali, Husnain
    Chung, Tae Choong
    Lee, Sungyoung
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)