Physicochemical property based computational scheme for classifying DNA sequence elements of Saccharomyces cerevisiae

被引:5
|
作者
Jaiswal, Atul Kumar [1 ]
Krishnamachari, Annangarachari [1 ]
机构
[1] JNU, Sch Computat & Integrat Sci, New Delhi 110067, India
关键词
DNA sequence elements; Physicochemical properties; DTPM; Computational scheme; Machine learning; MOLECULAR-DYNAMICS SIMULATIONS; UNIQUE TETRANUCLEOTIDE SEQUENCES; OLIGONUCLEOTIDES; DUPLEXES; GENES; PAIR;
D O I
10.1016/j.compbiolchem.2018.12.014
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
GenerationE of huge "omics" data necessitates the development and application of computational methods to annotate the data in terms of biological features. In the context of DNA sequence, it is important to unravel the hidden physicochemical signatures. For this purpose, we have considered various sequence elements such as promoter, ACS, LTRs, telomere, and retrotransposon of the model organism Saccharomyces cerevisiae. Contributions due to di-nucleotides play a major role in studying the DNA conformation profile. The physicochemical parameters used are hydrogen bonding energy, stacking energy and solvation energy per base pair. Our computational study shows that all sequence elements in this study have distinctive physicochemical signatures and the same can be exploited for prediction experiments. The order that we see in a DNA sequence is dictated by biological regions and hence, there exists role of dependency in the sequence makeup, keeping this in mind we are proposing two computational schemes (a) using a windowing block size procedure and (b) using di-nucleotide transitions. We obtained better discriminating profile when we analyzed the sequence data in windowing manner. In the second novel approach, we introduced the di-nucleotide transition probability matrix (DTPM) to study the hidden layer of information embedded in the sequences. DTPM has been used as weights for scanning and predictions. This proposed computational scheme incorporates the memory property which is more realistic to study the physicochemical properties embedded in DNA sequences. Our analysis shows that the DTPM scheme performs better than the existing method in this applied region. Characterization of these elements will be a key to genome editing applications and advanced machine learning approaches may also require such distinctive profiles as useful input features.
引用
收藏
页码:193 / 201
页数:9
相关论文
共 50 条
  • [1] DNA sequence and functional analysis of homologous ARS elements of Saccharomyces cerevisiae and S-carlsbergensis
    Theis, JF
    Yang, C
    Schaefer, CB
    Newlon, CS
    GENETICS, 1999, 152 (03) : 943 - 952
  • [2] RNA-DNA sequence differences in Saccharomyces cerevisiae
    Wang, Isabel X.
    Grunseich, Christopher
    Chung, Youree G.
    Kwak, Hojoong
    Ramrattan, Girish
    Zhu, Zhengwei
    Cheung, Vivian G.
    GENOME RESEARCH, 2016, 26 (11) : 1544 - 1554
  • [3] DNA sequence analysis of spontaneous mutagenesis in Saccharomyces cerevisiae
    Kunz, BA
    Ramachandran, K
    Vonarx, EJ
    GENETICS, 1998, 148 (04) : 1491 - 1505
  • [4] INSTABILITY OF SIMPLE SEQUENCE DNA IN SACCHAROMYCES-CEREVISIAE
    HENDERSON, ST
    PETES, TD
    MOLECULAR AND CELLULAR BIOLOGY, 1992, 12 (06) : 2749 - 2757
  • [5] DNA sequence elements required for partitioning competence of the Saccharomyces cerevisiae 2-micron plasmid STB locus
    McQuaid, Mary E.
    Polvi, Elizabeth J.
    Dobson, Melanie J.
    NUCLEIC ACIDS RESEARCH, 2019, 47 (02) : 716 - 728
  • [6] DNA-SEQUENCE ANALYSIS OF ARS ELEMENTS FROM CHROMOSOME-III OF SACCHAROMYCES-CEREVISIAE - IDENTIFICATION OF A NEW CONSERVED SEQUENCE
    PALZKILL, TG
    OLIVER, SG
    NEWLON, CS
    NUCLEIC ACIDS RESEARCH, 1986, 14 (15) : 6247 - 6264
  • [7] DNA elements important for CAG•CTG repeat thresholds in Saccharomyces cerevisiae
    Dixon, MJ
    Lahue, RS
    NUCLEIC ACIDS RESEARCH, 2004, 32 (04) : 1289 - 1297
  • [8] GROWTH AND THE DNA-DIVISION SEQUENCE IN THE YEAST SACCHAROMYCES-CEREVISIAE
    SINGER, RA
    JOHNSTON, GC
    EXPERIMENTAL CELL RESEARCH, 1985, 157 (02) : 387 - 396
  • [9] Sequence analysis of DNA randomly amplified from the Saccharomyces cerevisiae genome
    McGrath, A
    Higgins, DG
    McCarthy, TV
    MOLECULAR AND CELLULAR PROBES, 1998, 12 (06) : 397 - 405
  • [10] Identification of DNA cis elements essential for expansion of ribosomal DNA repeats in Saccharomyces cerevisiae
    Kobayashi, T
    Nomura, M
    Horiuchi, T
    MOLECULAR AND CELLULAR BIOLOGY, 2001, 21 (01) : 136 - 147