A uniform projection method for motif discovery in DNA sequences

被引:19
|
作者
Raphael, B [1 ]
Liu, LT [1 ]
Varghese, G [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
关键词
motif discovery; transcription factor binding sites; random projection; combinatorial designs; low-discrepancy sequences;
D O I
10.1109/TCBB.2004.14
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Buhler and Tompa [5] introduced the random projection algorithm for the motif discovery problem and demonstrated that this algorithm performs well on both simulated and biological samples. We describe a modification of the random projection algorithm, called the uniform projection algorithm, which utilizes a different choice of projections. We replace the random selection of projections by a greedy heuristic that approximately equalizes the coverage of the projections. We show that this change in selection of projections leads to improved performance on motif discovery problems. Furthermore, the uniform projection algorithm is directly applicable to other problems where the random projection algorithm has been used, including comparison of protein sequence databases.
引用
收藏
页码:91 / 94
页数:4
相关论文
共 50 条
  • [41] An approximation algorithm for alignment of multiple sequences using motif discovery
    Parida, L
    Floratos, A
    Rigoutsos, I
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 1999, 3 (2-3) : 247 - 275
  • [42] A Graph-Theoretical Approach for Motif Discovery in Protein Sequences
    Czeizler, Elena
    Hirvola, Tommi
    Karhu, Kalle
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (01) : 121 - 130
  • [43] Efficient automatic exact motif discovery algorithms for biological sequences
    Karci, Ali
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (04) : 7952 - 7963
  • [44] A Suite of Techniques to Improve Random Projection in Time Series Motif Discovery
    Dang Xuan Binh
    Duong Tuan Anh
    2016 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES, RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2016, : 13 - 18
  • [45] HeliCis: a DNA motif discovery tool for colocalized motif pairs with periodic spacing
    Larsson, Erik
    Lindahl, Per
    Mostad, Petter
    BMC BIOINFORMATICS, 2007, 8 (1) : 418
  • [46] HeliCis: a DNA motif discovery tool for colocalized motif pairs with periodic spacing
    Erik Larsson
    Per Lindahl
    Petter Mostad
    BMC Bioinformatics, 8
  • [47] RMotifGen: random motif generator for DNA and protein sequences
    Rouchka, Eric C.
    Hardin, C. Timothy
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [48] Accelerating Motif Finding in DNA Sequences with Multicore CPUs
    Perera, Pramitha
    Ragel, Roshan
    2013 8TH IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2013, : 242 - +
  • [49] Direct Imaging of DNA Motif Sequences With Encoded Nanoparticles
    Fernandez, Renny E.
    Mastrangelo, Carlos H.
    2011 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2011, : 4770 - 4773
  • [50] DNA Motif Recognition Modeling from Protein Sequences
    Wong, Ka-Chun
    ISCIENCE, 2018, 7 : 198 - +