Graphical Representation and Similarity Analysis of Protein Sequences Based on Fractal Interpolation

被引:24
|
作者
Hu, Hailong [1 ,2 ]
Li, Zhong [3 ]
Dong, Hongwei [4 ]
Zhou, Tianhe [3 ]
机构
[1] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 311300, Zhejiang, Peoples R China
[2] Zhejiang A&F Univ, Hangzhou 311300, Zhejiang, Peoples R China
[3] Zhejiang Sci Tech Univ, Coll Sci, Hangzhou 310018, Zhejiang, Peoples R China
[4] Jiangnan Univ, Dept Comp Sci, Wuxi 214122, Peoples R China
关键词
Protein sequence; graphic representation; fractal interpolation; principal component analysis; PHYSICOCHEMICAL PROPERTIES; DISTANCE; DIMENSION; ALIGNMENT; CURVE;
D O I
10.1109/TCBB.2015.2511731
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A new graphical representation of protein sequences is introduced in this paper. Nine main physicochemical properties of amino acids were used to obtain a 2D discrete point set for protein sequences by applying principal component analysis. The fractal method was then employed to interpolate discrete points in constructing a graphical representation of protein sequences. Fractal dimension of the protein curve was used to analyze the similarity of protein sequences by comparing the distance of vectors representing segments of protein sequences. The Jeffrey's and Matusita distance was modified in the similarity comparison of protein sequences with different lengths. Nine different species from Nicotinamide adenine dinucleotide (NADH) dehydrogenase 5 (ND5) protein sequences were tested as an example to demonstrate our method. Finally, a linear correlation and significance analysis was used to compare our results with other graphical representations referring to the ClustalW result. To confirm the validity of our method, eight species in NADH dehydrogenase 6 (ND6) protein families and twenty-seven species in beta-globin protein families were also analyzed. Experimental results show that the proposed method is effective for the similarity analysis of proteins.
引用
收藏
页码:182 / 192
页数:11
相关论文
共 50 条
  • [21] Analysis of similarity/dissimilarity of DNA sequences based on a class of 2D graphical representation
    Yao, Yu-Hua
    Dai, Qi
    Nan, Xu-Ying
    He, Ping-An
    Nie, Zuo-Ming
    Zhou, Song-Ping
    Zhang, Yao-Zhou
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (10) : 1632 - 1639
  • [22] Analysis of similarity/dissimilarity of DNA sequences based on novel 2-D graphical representation
    Randic, M
    Vracko, M
    Lers, N
    Plavsic, D
    CHEMICAL PHYSICS LETTERS, 2003, 371 (1-2) : 202 - 207
  • [23] Analysis of DNA sequences similarity based on a new 3-D graphical representation method
    Singh, Kshatrapal
    Kumar, Ashish
    Gupta, Manoj Kumar
    ROMANIAN JOURNAL OF INFORMATION TECHNOLOGY AND AUTOMATIC CONTROL-REVISTA ROMANA DE INFORMATICA SI AUTOMATICA, 2021, 31 (03): : 7 - 14
  • [24] On graphical and numerical representation of protein sequences
    Bai, FL
    Wang, TM
    JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2006, 23 (05): : 537 - 545
  • [25] A new graphical representation and its application in similarity/dissimilarity analysis of DNA sequences
    Luo, Jiawei
    Guo, Jiachen
    Li, Yang
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [26] On the similarity/dissimilarity of DNA sequences based on 4D graphical representation
    TANG XiaoChan
    Science Bulletin, 2010, (08) : 701 - 704
  • [27] On the similarity/dissimilarity of DNA sequences based on 4D graphical representation
    Tang XiaoChan
    Zhou PanPan
    Qiu WenYuan
    CHINESE SCIENCE BULLETIN, 2010, 55 (08): : 701 - 704
  • [28] A new 3D graphical representation for similarity/dissimilarity studies of protein sequences
    Chen, Yan
    Li, Kang-Shun
    Chang, Shan
    Yang, Lei
    Computer Modelling and New Technologies, 2014, 18 (12): : 296 - 303
  • [29] Analysis of similarity/dissimilarity of DNA sequences by a new 3D graphical representation
    Song, Jie
    JOURNAL OF BIOLOGICAL SYSTEMS, 2007, 15 (03) : 287 - 297
  • [30] Similarity studies of DNA sequences based on a new 2D graphical representation
    Huang, Guohua
    Liao, Bo
    Li, Yongfan
    Yu, Yougui
    BIOPHYSICAL CHEMISTRY, 2009, 143 (1-2) : 55 - 59