Multi-target QPDR classification model for human breast and colon cancer-related proteins using star graph topological indices

被引:80
|
作者
Munteanu, Cristian Robert [2 ]
Magalhaes, Alexandre L. [2 ]
Uriarte, Eugenio [1 ,3 ]
Gonzalez-Diaz, Humberto [1 ,3 ]
机构
[1] Univ Santiago Compostela, Fac Pharm, Inst Ind Pharm, UBICA, Santiago De Compostela 15782, Spain
[2] Univ Porto, Dept Chem, Fac Sci, REQUIMTE, P-4169007 Oporto, Portugal
[3] Univ Santiago Compostela, Fac Pharm, Dept Organ Chem, Santiago De Compostela 15782, Spain
关键词
Input-coded multi-target QPDR; Star graph; Cancer theoretical model; Clinical proteomics; GDA method; AMINO-ACID-COMPOSITION; COAGULATION-FACTOR-XIII; SUPPORT VECTOR MACHINE; DNA PRIMARY SEQUENCES; SUBCELLULAR LOCATION; CELLULAR-AUTOMATA; CODON USAGE; QSAR MODEL; PREDICTION; REPRESENTATION;
D O I
10.1016/j.jtbi.2008.11.017
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The cancer diagnostic is a complex process and, sometimes, the specific markers can interfere or produce negative results. Thus, new simple and fast theoretical models are required. One option is the complex network graphs theory that permits us to describe any real system, from the small molecules to the complex genetic, neural or social networks by transforming real properties in topological indices. This work converts the protein primary structure data in specific Randic's star networks topological indices using the new sequence to star networks (S2SNet) application. A set of 1054 proteins were selected from previous works and contains proteins related or not with two types of cancer, human breast cancer (HBC) and human colon cancer (HCC). The general discriminant analysis method generates an input-coded multi-target classification model with the training/predicting set accuracies of 90.0% for the forward stepwise model type. In addition, a protein subset was modified by single amino acid mutations with higher log-odds PAM250 values and tested with the new classification if can be related with HBC or HCC. In conclusion, we shown that, using simple input data such is the primary protein sequence and the simples linear analysis, it is possible to obtain accurate classification models that can predict if a new protein related with two types of cancer. These results promote the use of the S2SNet in clinical proteomics. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:303 / 311
页数:9
相关论文
共 5 条
  • [1] Fragment-based in silico modeling of multi-target inhibitors against breast cancer-related proteins
    Speck-Planche, Alejandro
    Cordeiro, M. Natalia D. S.
    MOLECULAR DIVERSITY, 2017, 21 (03) : 511 - 523
  • [2] Fragment-based in silico modeling of multi-target inhibitors against breast cancer-related proteins
    Alejandro Speck-Planche
    M. Natália D. S. Cordeiro
    Molecular Diversity, 2017, 21 : 511 - 523
  • [3] Erratum to: Fragment-based in silico modeling of multi-target inhibitors against breast cancer-related proteins
    Alejandro Speck-Planche
    M. Natália D. S. Cordeiro
    Molecular Diversity, 2017, 21 : 525 - 525
  • [4] Multi-target meridians classification based on the topological structure of anti-cancer phytochemicals using deep learning
    Zhang, Sheng
    Zhang, Xianwei
    Du, Jiayin
    Wang, Wei
    Pi, Xitian
    JOURNAL OF ETHNOPHARMACOLOGY, 2024, 319
  • [5] Multi-target meridians classification based on the topological structure of anti-cancer phytochemicals using deep learning (vol 319, 117244, 2024)
    Zhang, Sheng
    Zhang, Xianwei
    Du, Jiayin
    Wang, Wei
    Pi, Xitian
    JOURNAL OF ETHNOPHARMACOLOGY, 2024, 326