A uniform projection method for motif discovery in DNA sequences

被引:19
|
作者
Raphael, B [1 ]
Liu, LT [1 ]
Varghese, G [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
关键词
motif discovery; transcription factor binding sites; random projection; combinatorial designs; low-discrepancy sequences;
D O I
10.1109/TCBB.2004.14
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Buhler and Tompa [5] introduced the random projection algorithm for the motif discovery problem and demonstrated that this algorithm performs well on both simulated and biological samples. We describe a modification of the random projection algorithm, called the uniform projection algorithm, which utilizes a different choice of projections. We replace the random selection of projections by a greedy heuristic that approximately equalizes the coverage of the projections. We show that this change in selection of projections leads to improved performance on motif discovery problems. Furthermore, the uniform projection algorithm is directly applicable to other problems where the random projection algorithm has been used, including comparison of protein sequence databases.
引用
收藏
页码:91 / 94
页数:4
相关论文
共 50 条
  • [31] Graphical approach for motif recognition in DNA sequences
    Yang, X
    Rajapakse, JC
    PROCEEDINGS OF THE 2004 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2004, : 147 - 152
  • [32] Fast structured motif search in DNA sequences
    Halachev, Mihail
    Shiri, Nematollaah
    Communications in Computer and Information Science, 2008, 13 : 58 - 73
  • [33] An efficient method for significant motifs discovery from multiple DNA sequences
    Al-Ssulami, Abdulrakeeb M.
    Azmi, Aqil M.
    Mathkour, Hassan
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2017, 15 (04)
  • [34] Performance evaluation of DNA motif discovery programs
    Singh, Chandra Prakash
    Khan, Feroz
    Mishra, Bhartendu Nath
    Chauhan, Durg Singh
    BIOINFORMATION, 2008, 3 (05) : 205 - 212
  • [35] Seeder: discriminative seeding DNA motif discovery
    Fauteux, Francois
    Blanchette, Mathieu
    Stromvik, Martina V.
    BIOINFORMATICS, 2008, 24 (20) : 2303 - 2307
  • [36] The Limits of De Novo DNA Motif Discovery
    Simcha, David
    Price, Nathan D.
    Geman, Donald
    PLOS ONE, 2012, 7 (11):
  • [37] Enhancing Gibbs Sampling Method for Motif Finding in DNA with Initial Graph Representation of Sequences
    Stepancic, Ziva
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2014, 21 (10) : 741 - 752
  • [38] Motif Discovery and Phylogenetic Analysis of Hepatitis B Virus Sequences
    ZHANG Qi
    ZHANG Jun-peng
    GAO Jian-mei
    HE Jian-feng
    YAN Xin-min
    MA Lei
    LI Jiu-yong
    Chinese Journal of Biomedical Engineering, 2016, 25 (03) : 121 - 126
  • [39] Greedy mixture learning for multiple motif discovery in biological sequences
    Blekas, K
    Fotiadis, DI
    Likas, A
    BIOINFORMATICS, 2003, 19 (05) : 607 - 617
  • [40] An Approximation Algorithm for Alignment of Multiple Sequences using Motif Discovery
    Laxmi Parida
    Aris Floratos
    Isidore Rigoutsos
    Journal of Combinatorial Optimization, 1999, 3 : 247 - 275