Finding optimal pairs of patterns

被引:0
|
作者
Bannai, H [1 ]
Hyyrö, H
Shinohara, A
Takeda, M
Nakai, K
Miyano, S
机构
[1] Univ Tokyo, Inst Med Sci, Human Genome Ctr, Tokyo 1088639, Japan
[2] Kyushu Univ 33, Dept Informat, Fukuoka 8128581, Japan
来源
ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS | 2004年 / 3240卷
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We consider the problem of finding the optimal pair of string patterns for discriminating between two sets of strings, i.e. finding the pair of patterns that is best with respect to some appropriate scoring function that gives higher scores to pattern pairs which occur more in the strings of one set, but less in the other. We present an O(N-2) time algorithm for finding the optimal pair of substring patterns, where N is the total length of the strings. The algorithm looks for all possible Boolean combination of the patterns, e.g. patterns of the form p Lambda -q, which indicates that the pattern pair is considered to match a given string s, if p occurs in s, AND (sic)q does NOT occur in s. The same algorithm can be applied to a variant of the problem where we axe given a single set of sequences along with a numeric attribute assigned to each sequence, and the problem is to find the optimal pattern pair whose occurrence in the sequences is correlated with this numeric attribute. An efficient implementation based on suffix arrays is presented, and the algorithm is applied to several nucleotide sequence datasets of moderate size, combined with microarray gene expression data, aiming to find regulatory elements that cooperate, complement, or compete with each other in enhancing and/or silencing certain genomic functions.
引用
收藏
页码:450 / 462
页数:13
相关论文
共 50 条
  • [31] Finding optimal flows efficiently
    Mhalla, Mehdi
    Perdrix, Simon
    AUTOMATA, LANGUAGES AND PROGRAMMING, PT 1, PROCEEDINGS, 2008, 5125 : 857 - +
  • [32] Finding optimal decision trees
    Masa, Petr
    Kocka, Tomas
    INTELLIGENT INFORMATION PROCESSING AND WEB MINING, PROCEEDINGS, 2006, : 173 - +
  • [33] Finding the Optimal Shape of the Electromagnet
    Zajaczek, Stanislav
    Ivanek, Lubomir
    Phan Thi Thanh Thao
    11TH INTERNATIONAL SCIENTIFIC CONFERENCE ELECTRIC POWER ENGINEERING 2010, PROCEEDINGS, 2010, : 761 - 763
  • [34] Finding Optimal Shadows of Polytopes
    T. Burger
    P. Gritzmann
    Discrete & Computational Geometry, 2000, 24 : 219 - 240
  • [35] Finding an optimal mirror site
    Swain, M
    Kim, YG
    Proceedings of the IEEE SoutheastCon 2004: EXCELLENCE IN ENGINEERING, SCIENCE, AND TECHNOLOGY, 2005, : 495 - 500
  • [36] Finding optimal shadows of polytopes
    Burger, T
    Gritzmann, P
    DISCRETE & COMPUTATIONAL GEOMETRY, 2000, 24 (2-3) : 219 - 239
  • [37] Finding near optimal flows
    Ahmed, Magdy A.
    AEJ - Alexandria Engineering Journal, 2004, 43 (01): : 7 - 10
  • [38] Finding Optimal Resources for IT Services
    Raut, Sumit
    Somasundaram, Muralidharan
    ADVANCES IN PRODUCTION MANAGEMENT SYSTEMS: COMPETITIVE MANUFACTURING FOR INNOVATIVE PRODUCTS AND SERVICES, AMPS 2012, PT I, 2013, 397 : 708 - 715
  • [39] Finding the Optimal Subspace for Clustering
    Goebl, Sebastian
    He, Xiao
    Plant, Claudia
    Boehm, Christian
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 130 - 139
  • [40] On Finding Optimal (Dynamic) Arborescences
    Espada, Joaquim
    Francisco, Alexandre P.
    Rocher, Tatiana
    Russo, Luis M. S.
    Vaz, Catia
    ALGORITHMS, 2023, 16 (12)