Protein Secondary Structure Prediction Using Machine Learning

被引:0
|
作者
Saha, Sriparna [1 ]
Ekbal, Asif [1 ]
Sharma, Sidharth [1 ]
Bandyopadhyay, Sanghamitra [2 ]
Maulik, Ujjwal [3 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
[2] Indian Stat Inst, Machine Intelligence Unit, Kolkata, India
[3] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
来源
INTELLIGENT INFORMATICS | 2013年 / 182卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein structure prediction is an important component in understanding protein structures and functions. Accurate prediction of protein secondary structure helps in understanding protein folding. In many applications such as drug discovery it is required to predict the secondary structure of unknown proteins. In this paper we report our first attempt to secondary structure predication, and approach it as a sequence classification problem, where the task is equivalent to assigning a sequence of labels (i.e. helix, sheet, and coil) to the given protein sequence. We propose an ensemble technique that is based on two stochastic supervised machine learning algorithms, namely Maximum Entropy Markov Model (MEMM) and Conditional Random Field (CRF). We identify and implement a set of features that mostly deal with the contextual information. The proposed approach is evaluated with a benchmark dataset, and it yields encouraging performance to explore it further. We obtain the highest predictive accuracy of 61.26% and segment overlap score (SOY) of 52.30%.
引用
收藏
页码:57 / +
页数:2
相关论文
共 50 条
  • [1] Protein secondary structure prediction using machine learning
    Zhang, BF
    Chen, ZH
    Murphey, YL
    Proceedings of the International Joint Conference on Neural Networks (IJCNN), Vols 1-5, 2005, : 532 - 537
  • [2] MACHINE LEARNING APPROACH FOR THE PREDICTION OF PROTEIN SECONDARY STRUCTURE
    KING, RD
    STERNBERG, MJE
    JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (02) : 441 - 457
  • [3] PROTEIN SECONDARY STRUCTURE PREDICTION USING LOGIC-BASED MACHINE LEARNING
    MUGGLETON, S
    KING, RD
    STERNBERG, MJE
    PROTEIN ENGINEERING, 1992, 5 (07): : 647 - 657
  • [4] PROTEIN SECONDARY STRUCTURE PREDICTION USING LOGIC-BASED MACHINE LEARNING
    MUGGLETON, S
    KING, RD
    STERNBERG, MJE
    PROTEIN ENGINEERING, 1993, 6 (05): : 549 - 549
  • [5] Comparison of Machine Learning Classifiers for Protein Secondary Structure Prediction
    Aydin, Zafer
    Kaynar, Oguz
    Gormez, Yasin
    Isik, Yunus Emre
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [6] Protein Secondary Structure Prediction Based on Fusion of Machine Learning Classifiers
    de Oliveira, Gabriel Bianchin
    Pedrini, Helio
    Dias, Zanoni
    36TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2021, 2021, : 26 - 29
  • [7] Protein Secondary Structure Prediction based on CNN and Machine Learning Algorithms
    Ema, Romana Rahman
    Adnan, Md Nasim
    Khatun, Mt Akhi
    Galib, Syed Md.
    Kabir, Sk Shalauddin
    Hossain, Md Alam
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 74 - 81
  • [8] Review of Advances in Machine Learning Based Protein Secondary Structure Prediction
    Muhammad, Muhammad Yusuf
    Prasad, Rajesh
    Fonkam, Mathias
    Umar, Hadiza Ali
    2019 15TH INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2019,
  • [9] Machine learning techniques for protein secondary structure prediction: An overview and evaluation
    Yoo, Paul D.
    Zhou, Bing Bing
    Zomaya, Albert Y.
    CURRENT BIOINFORMATICS, 2008, 3 (02) : 74 - 86
  • [10] A comparison of two machine learning methods for protein secondary structure prediction
    Wang, LH
    Liu, J
    Zhou, HB
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 2730 - 2735