Identify origin of replication in Saccharomyces cerevisiae using two-step feature selection technique

被引:171
|
作者
Dao, Fu-Ying [1 ,2 ]
Lv, Hao [1 ,2 ]
Wang, Fang [1 ,2 ]
Feng, Chao-Qin [1 ,2 ]
Ding, Hui [1 ,2 ]
Chen, Wei [1 ,2 ,3 ]
Lin, Hao [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Life Sci & Technol, Minist Educ, Key Lab NeuroInformat, Chengdu 610054, Sichuan, Peoples R China
[2] Univ Elect Sci & Technol China, Ctr Informat Biol, Chengdu 610054, Sichuan, Peoples R China
[3] Chengdu Univ Tradit Chinese Med, Innovat Inst Chinese Med & Pharm, Chengdu 611730, Sichuan, Peoples R China
关键词
SEQUENCE-BASED PREDICTOR; UPDATED RESOURCE; WEB SERVER; DNA; YEAST; SITES; RNA; RECOGNITION; INITIATION; PROTEINS;
D O I
10.1093/bioinformatics/bty943
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation DNA replication is a key step to maintain the continuity of genetic information between parental generation and offspring. The initiation site of DNA replication, also called origin of replication (ORI), plays an extremely important role in the basic biochemical process. Thus, rapidly and effectively identifying the location of ORI in genome will provide key clues for genome analysis. Although biochemical experiments could provide detailed information for ORI, it requires high experimental cost and long experimental period. As good complements to experimental techniques, computational methods could overcome these disadvantages. Results Thus, in this study, we developed a predictor called iORI-PseKNC2.0 to identify ORIs in the Saccharomyces cerevisiae genome based on sequence information. The PseKNC including 90 physicochemical properties was proposed to formulate ORI and non-ORI samples. In order to improve the accuracy, a two-step feature selection was proposed to exclude redundant and noise information. As a result, the overall success rate of 88.53% was achieved in the 5-fold cross-validation test by using support vector machine. Availability and implementation Based on the proposed model, a user-friendly webserver was established and can be freely accessed at http://lin-group.cn/server/iORI-PseKNC2.0. The webserver will provide more convenience to most of wet-experimental scholars.
引用
收藏
页码:2075 / 2083
页数:9
相关论文
共 50 条
  • [21] Feature Tracking by Two-Step Optimization
    Schnorr, Andrea
    Helmrich, Dirk N.
    Denker, Dominik
    Kuhlen, Torsten W.
    Hentschel, Bernd
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (06) : 2219 - 2233
  • [22] McTwo: a two-step feature selection algorithm based on maximal information coefficient
    Ruiquan Ge
    Manli Zhou
    Youxi Luo
    Qinghan Meng
    Guoqin Mai
    Dongli Ma
    Guoqing Wang
    Fengfeng Zhou
    BMC Bioinformatics, 17
  • [23] McTwo: a two-step feature selection algorithm based on maximal information coefficient
    Ge, Ruiquan
    Zhou, Manli
    Luo, Youxi
    Meng, Qinghan
    Mai, Guoqin
    Ma, Dongli
    Wang, Guoqing
    Zhou, Fengfeng
    BMC BIOINFORMATICS, 2016, 17
  • [24] Development of a modularized two-step (M2S) chromosome integration technique for integration of multiple transcription units in Saccharomyces cerevisiae
    Li, Siwei
    Ding, Wentao
    Zhang, Xueli
    Jiang, Huifeng
    Bi, Changhao
    BIOTECHNOLOGY FOR BIOFUELS, 2016, 9
  • [25] Development of a modularized two-step (M2S) chromosome integration technique for integration of multiple transcription units in Saccharomyces cerevisiae
    Siwei Li
    Wentao Ding
    Xueli Zhang
    Huifeng Jiang
    Changhao Bi
    Biotechnology for Biofuels, 9
  • [26] USING A TWO-STEP CLUSTER ANALYSIS TO IDENTIFY NEUROPSYCHOLOGICAL SUBGROUPS IN SCHIZOPHRENIA
    Dawes, Sharron E.
    Palmer, B. W.
    Jeste, D. V.
    SCHIZOPHRENIA BULLETIN, 2009, 35 : 300 - 300
  • [27] DNA replication: Partners in the Okazaki two-step
    MacNeill, SA
    CURRENT BIOLOGY, 2001, 11 (20) : R842 - R844
  • [28] Functional analysis of a replication origin from Saccharomyces cerevisiae:: identification of a new replication enhancer
    Raychaudhuri, S
    Byers, R
    Upton, T
    Eisenberg, S
    NUCLEIC ACIDS RESEARCH, 1997, 25 (24) : 5057 - 5064
  • [29] Two-Step Process Using Immobilized Saccharomyces cerevisiae and Pichia stipitis for Ethanol Production from Ulva pertusa Kjellman Hydrolysate
    Lee, Sang-Eun
    Kim, Yi-Ok
    Choi, Woo Yong
    Kang, Do-Hyung
    Lee, Hyeon-Yong
    Jung, Kyung-Hwan
    JOURNAL OF MICROBIOLOGY AND BIOTECHNOLOGY, 2013, 23 (10) : 1434 - 1444
  • [30] Two compound replication origins in Saccharomyces cerevisiae contain redundant origin recognition complex binding sites
    Theis, JF
    Newlon, CS
    MOLECULAR AND CELLULAR BIOLOGY, 2001, 21 (08) : 2790 - 2801