Using hidden Markov models to predict DNA-binding proteins with sequence and structure information

被引:0
|
作者
Yi-Yu Hsu
Wei-Jhih Chen
Shu-Hui Chen
Hung-Yu Kao
机构
[1] National Cheng Kung University,Department of Computer Science and Information Engineering
[2] National Cheng Kung University,Institute of Medical Informatics
[3] National Cheng Kung University,Department of Chemistry
来源
Soft Computing | 2014年 / 18卷
关键词
Machine learning; Hidden Markov Model; DNA-binding proteins ; Support vector machines;
D O I
暂无
中图分类号
学科分类号
摘要
In the post-genome period, the protein domain structures are published rapidly, but they have not been studied comprehensively. To figure out the cell function, the protein–DNA interactions decrypt the protein domain structures in recent research. Several machine-learning based methods are applied to the issue; however, they are not efficient to translate the tertiary structure characteristics of proteins into appropriate features for predicting the DNA-binding proteins. In this work, a novel machine-learning approach based on hidden Markov models identifies the characteristics of DNA-binding proteins with their amino acid sequences and tertiary structures. After we distill the features from DNA-binding proteins, a support vector machine based classifier predicts general DNA-binding proteins with the accuracy of 88.45 % through fivefolds cross-validation. Furthermore, we construct a response element specific classifier for predicting response element specific DNA-binding proteins, and the performance achieves the precision of 96.57 % with recall rate as 88.83 % in average. To verify the prediction of DNA-binding proteins, we used the DNA-binding proteins from MCF-7 that are likely to bind with estrogen response elements (ERE), and the results show that our methods can apply to practice.
引用
收藏
页码:2365 / 2376
页数:11
相关论文
共 50 条
  • [1] Using hidden Markov models to predict DNA-binding proteins with sequence and structure information
    Hsu, Yi-Yu
    Chen, Wei-Jhih
    Chen, Shu-Hui
    Kao, Hung-Yu
    SOFT COMPUTING, 2014, 18 (12) : 2365 - 2376
  • [2] Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins
    Kuznetsov, Igor B.
    Gou, Zhenkun
    Li, Run
    Hwang, Seungwoo
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 64 (01) : 19 - 27
  • [3] Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins
    Jones, S
    Shanahan, HP
    Berman, HM
    Thornton, JM
    NUCLEIC ACIDS RESEARCH, 2003, 31 (24) : 7189 - 7198
  • [4] Identification of DNA-Binding Proteins Using Support Vector Machine with Sequence Information
    Ma, Xin
    Wu, Jiansheng
    Xue, Xiaoyun
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2013, 2013
  • [5] Recognition models to predict DNA-binding specificities of homeodomain proteins
    Christensen, Ryan G.
    Enuameh, Metewo Selase
    Noyes, Marcus B.
    Brodsky, Michael H.
    Wolfe, Scot A.
    Stormo, Gary D.
    BIOINFORMATICS, 2012, 28 (12) : I84 - I89
  • [6] PreDNA: accurate prediction of DNA-binding sites in proteins by integrating sequence and geometric structure information
    Li, Tao
    Li, Qian-Zhong
    Liu, Shuai
    Fan, Guo-Liang
    Zuo, Yong-Chun
    Peng, Yong
    BIOINFORMATICS, 2013, 29 (06) : 678 - 685
  • [7] Sequence-based prediction of DNA-binding sites on DNA-binding proteins
    Gou, Z.
    Hwang, S.
    Kuznetsov, B., I
    PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON BIOINFORMATICS OF GENOME REGULATION AND STRUCTURE, VOL 1, 2006, : 268 - +
  • [8] Bayesian basecalling for DNA sequence analysis using hidden Markov models
    Liang, Kuo-ching
    Wang, Xiaodong
    Anastassiou, Dimitris
    2006 40TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS, VOLS 1-4, 2006, : 1599 - 1604
  • [9] Bayesian basecalling for DNA sequence analysis using hidden Markov models
    Liang, Kuo-ching
    Wang, Xiaodong
    Anastassiou, Dimitris
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (03) : 430 - 440
  • [10] STRUCTURE AND FUNCTION OF DNA-BINDING PROTEINS
    NELSON, HCM
    CURRENT OPINION IN GENETICS & DEVELOPMENT, 1995, 5 (02) : 180 - 189