Identification of Proteins and Genes Associated with Hedgehog Signaling Pathway Involved in Neoplasm Formation Using Text-Mining Approach

被引:0
|
作者
Biziukova, Nadezhda Yu. [1 ]
Ivanov, Sergey M. [1 ,2 ]
Tarasova, Olga A. [1 ]
机构
[1] Inst Biomed Chem, Dept Bioinformat, Moscow 119121, Russia
[2] Pirogov Russian Natl Res Med Univ, Dept Bioinformat, Moscow 117997, Russia
来源
BIG DATA MINING AND ANALYTICS | 2024年 / 7卷 / 01期
关键词
text-mining; data mining; Hedgehog pathway; neoplastic processes; enrichment analysis; pathology molecular mechanisms; CANCER PROGRESSION; WNT/BETA-CATENIN; ACTIVATION; EXPRESSION; GLI1; INHIBITION; WNT; PROLIFERATION; CELLS; SHH;
D O I
10.26599/BDMA.2023.9020007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analysis of molecular mechanisms that lead to the development of various types of tumors is essential for biology and medicine, because it may help to find new therapeutic opportunities for cancer treatment and cure including personalized treatment approaches. One of the pathways known to be important for the development of neoplastic diseases and pathological processes is the Hedgehog signaling pathway that normally controls human embryonic development. Systematic accumulation of various types of biological data, including interactions between proteins, regulation of genes transcription, proteomics, and metabolomics experiments results, allows the application of computational analysis of these big data for identification of key molecular mechanisms of certain diseases and pathologies and promising therapeutic targets. The aim of this study is to develop a computational approach for revealing associations between human proteins and genes interacting with the Hedgehog pathway components, as well as for identifying their roles in the development of various types of tumors. We automatically collect sets of abstract texts from the NCBI PubMed bibliographic database. For recognition of the Hedgehog pathway proteins and genes and neoplastic diseases we use a dictionary-based named entity recognition approach, while for all other proteins and genes machine learning method is used. For association extraction, we develop a set of semantic rules. We complete the results of the text analysis with the gene set enrichment analysis. The identified key pathways that may influence the Hedgehog pathway and their roles in tumor development are then verified using the information in the literature.
引用
收藏
页码:107 / 130
页数:24
相关论文
共 2 条
  • [1] A Gene Ranking Method Using Text-Mining for the Identification of Disease Related Genes
    Lee, Hyungmin
    Shin, Miyoung
    Hong, Munpyo
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 493 - 498
  • [2] Identification of candidate genes in Arabidopsis and Populus cell wall biosynthesis using text-mining, co-expression network analysis and comparative genomics
    Yang, Xiaohan
    Ye, Chu-Yu
    Bisaria, Anjali
    Tuskan, Gerald A.
    Kalluri, Udaya C.
    PLANT SCIENCE, 2011, 181 (06) : 675 - 687