Analysis and prediction of single-stranded and double-stranded DNA binding proteins based on protein sequences

被引:10
|
作者
Wang, Wei [1 ,2 ]
Sun, Lin [1 ]
Zhang, Shiguang [1 ]
Zhang, Hongjun [3 ]
Shi, Jinling [4 ]
Xu, Tianhe [1 ]
Li, Keliang [1 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Henan Province, Peoples R China
[2] Engn Technol Res Ctr Comp Intelligence & Data Min, Lab Computat Intelligence & Informat Proc, Xinxiang 453007, Henan Province, Peoples R China
[3] Anyang Univ, Sch Aviat Engn, Anyang 455000, Henan Province, Peoples R China
[4] Xuchang Univ, Sch Int Educ, Xuchang 461000, Henan Province, Peoples R China
来源
BMC BIOINFORMATICS | 2017年 / 18卷
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
SSBs (Single-stranded DNA-binding proteins); DSBs (Double-stranded DNA-binding proteins); Binding specificity; Protein sequence; SUBCELLULAR-LOCALIZATION; OB-FOLD; EVOLUTIONARY; RECOGNITION; SPECIFICITY; FEATURES; SITES; IDENTIFICATION; INTERFACE; DOMAINS;
D O I
10.1186/s12859-017-1715-8
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: DNA-binding proteins perform important functions in a great number of biological activities. DNA-binding proteins can interact with ssDNA (single-stranded DNA) or dsDNA (double-stranded DNA), and DNA-binding proteins can be categorized as single-stranded DNA-binding proteins (SSBs) and double-stranded DNA-binding proteins (DSBs). The identification of DNA-binding proteins from amino acid sequences can help to annotate protein functions and understand the binding specificity. In this study, we systematically consider a variety of schemes to represent protein sequences: OAAC (overall amino acid composition) features, dipeptide compositions, PSSM (position-specific scoring matrix profiles) and split amino acid composition (SAA), and then we adopt SVM (support vector machine) and RF (random forest) classification model to distinguish SSBs from DSBs. Results: Our results suggest that some sequence features can significantly differentiate DSBs and SSBs. Evaluated by 10 fold cross-validation on the benchmark datasets, our prediction method can achieve the accuracy of 88.7% and AUC (area under the curve) of 0.919. Moreover, our method has good performance in independent testing. Conclusions: Using various sequence-derived features, a novel method is proposed to distinguish DSBs and SSBs accurately. The method also explores novel features, which could be helpful to discover the binding specificity of DNA-binding proteins.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] The process of displacing the single-stranded DNA-binding protein from single-stranded DNA by RecO and RecR proteins
    Inoue, Jin
    Honda, Masayoshi
    Ikawa, Shukuko
    Shibata, Takehiko
    Mikawa, Tsutomu
    NUCLEIC ACIDS RESEARCH, 2008, 36 (01) : 94 - 109
  • [32] Hybridization of DNA and PNA molecular beacons to single-stranded and double-stranded DNA targets
    Kuhn, H
    Demidov, VV
    Coull, JM
    Fiandaca, MJ
    Gildea, BD
    Frank-Kamenetskii, MD
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2002, 124 (06) : 1097 - 1103
  • [33] NEW CHROMATOGRAPHIC METHOD FOR SEPARATION OF SINGLE-STRANDED DNA, DOUBLE-STRANDED DNA AND RNA
    UDVARDY, A
    VENETIANER, P
    ACTA BIOCHIMICA AND BIOPHYSICA ACADEMIAE SCIENTARIUM HUNGARICA, 1971, 6 (01): : 27 - +
  • [34] Model for helicase translocating along single-stranded DNA and unwinding double-stranded DNA
    Xie, Ping
    BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS, 2006, 1764 (11): : 1719 - 1729
  • [35] On-line melting of double-stranded DNA for analysis of single-stranded DNA using capillary electrophoresis
    Kuypers, AWHM
    Linssen, PCM
    Willems, PMW
    Mensink, EJBM
    JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES, 1996, 675 (02): : 205 - 211
  • [36] PARTITION OF DOUBLE-STRANDED AND SINGLE-STRANDED DEOXYRIBONUCLEIC ACID
    ALBERTSSON, PA
    ARCHIVES OF BIOCHEMISTRY AND BIOPHYSICS, 1962, : 264 - &
  • [37] RADIOSENSITIVITY OF SINGLE-STRANDED AND DOUBLE-STRANDED DEOXYRIBONUCLEIC ACID
    WEISSBERGER, E
    OKADA, S
    INTERNATIONAL JOURNAL OF RADIATION BIOLOGY AND RELATED STUDIES IN PHYSICS CHEMISTRY AND MEDICINE, 1961, 3 (03): : 331 - 333
  • [38] Single Molecular Investigation of Influence of Silver Ions on Double-Stranded and Single-Stranded DNA
    Wang, Wenting
    Yang, Guangcan
    Wang, Yanwei
    JOURNAL OF PHYSICAL CHEMISTRY B, 2025, 129 (09): : 2426 - 2432
  • [39] A SILENCER ELEMENT FOR THE LIPOPROTEIN-LIPASE GENE PROMOTER AND COGNATE DOUBLE-STRANDED AND SINGLE-STRANDED DNA-BINDING PROTEINS
    TANUMA, Y
    NAKABAYASHI, H
    ESUMI, M
    ENDO, H
    MOLECULAR AND CELLULAR BIOLOGY, 1995, 15 (01) : 517 - 523
  • [40] A gold nanoparticle based fluorescent probe for simultaneous recognition of single-stranded DNA and double-stranded DNA
    Haiyan Ma
    Zongbing Li
    Ning Xue
    Zhiyuan Cheng
    Xiangmin Miao
    Microchimica Acta, 2018, 185