Protein Secondary Structure Prediction Using Machine Learning

被引:0
|
作者
Saha, Sriparna [1 ]
Ekbal, Asif [1 ]
Sharma, Sidharth [1 ]
Bandyopadhyay, Sanghamitra [2 ]
Maulik, Ujjwal [3 ]
机构
[1] Indian Inst Technol Patna, Dept Comp Sci & Engn, Patna, Bihar, India
[2] Indian Stat Inst, Machine Intelligence Unit, Kolkata, India
[3] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata, India
来源
INTELLIGENT INFORMATICS | 2013年 / 182卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein structure prediction is an important component in understanding protein structures and functions. Accurate prediction of protein secondary structure helps in understanding protein folding. In many applications such as drug discovery it is required to predict the secondary structure of unknown proteins. In this paper we report our first attempt to secondary structure predication, and approach it as a sequence classification problem, where the task is equivalent to assigning a sequence of labels (i.e. helix, sheet, and coil) to the given protein sequence. We propose an ensemble technique that is based on two stochastic supervised machine learning algorithms, namely Maximum Entropy Markov Model (MEMM) and Conditional Random Field (CRF). We identify and implement a set of features that mostly deal with the contextual information. The proposed approach is evaluated with a benchmark dataset, and it yields encouraging performance to explore it further. We obtain the highest predictive accuracy of 61.26% and segment overlap score (SOY) of 52.30%.
引用
收藏
页码:57 / +
页数:2
相关论文
共 50 条
  • [21] Prediction of Protein Secondary Structure using Support Vector Machine with PSSM Profiles
    Wang, Yanchun
    Cheng, Jinyong
    Liu, Yihui
    Chen, Yehong
    2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2016, : 502 - 505
  • [22] Protein secondary structure prediction with high accuracy using Support Vector Machine
    Shoyaib, Mohammad
    Baker, Syed Murtuza
    Jabid, Taskeed
    Anwar, Firoz
    Khan, Haseena
    PROCEEDINGS OF 10TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2007), 2007, : 99 - +
  • [23] Protein structure prediction (RMSD ≤ 5 Å) using machine learning models
    Pathak, Yadunath
    Rana, Prashant Singh
    Singh, P. K.
    Saraswat, Mukesh
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2016, 14 (01) : 71 - 85
  • [24] New machine learning methods for prediction of protein secondary structures
    Blazewicz, Jacek
    Lukasiak, Piotr
    Wilk, Szymon
    CONTROL AND CYBERNETICS, 2007, 36 (01): : 183 - 201
  • [25] Secondary Structure Prediction of Protein using Resilient Back Propagation Learning Algorithm
    Dongardive, Jyotshna
    Abraham, Siby
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2015, 6 (1-2): : 22 - 29
  • [26] Protein secondary structure prediction using neural networks and deep learning: A review
    Wardah, Wafaa
    Khan, M. G. M.
    Sharma, Alok
    Rashid, Mahmood A.
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 81 : 1 - 8
  • [27] Protein secondary structure prediction using support vector machine with advanced encoding schemes
    Hu, HJ
    Pan, Y
    Harrison, R
    Tai, PC
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY VI, 2004, 5433 : 80 - 87
  • [28] Protein Secondary Structure Prediction Based on Deep Learning
    Zheng, Lin
    Li, Hong-ling
    Wu, Nan
    Ao, Li
    3RD INTERNATIONAL SYMPOSIUM ON MECHATRONICS AND INDUSTRIAL INFORMATICS, (ISMII 2017), 2017, : 171 - 177
  • [29] A Deep Learning Approach for Prediction of Protein Secondary Structure
    Zubair, Muhammad
    Hanif, Muhammad Kashif
    Alabdulkreem, Eatedal
    Ghadi, Yazeed
    Khan, Muhammad Irfan
    Sarwar, Muhammad Umer
    Hanif, Ayesha
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3705 - 3718
  • [30] Protein secondary structure prediction with Bayesian learning method
    Wang, PL
    Zhang, D
    14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, : 252 - 257