Finding optimal pairs of patterns

被引:0
|
作者
Bannai, H [1 ]
Hyyrö, H
Shinohara, A
Takeda, M
Nakai, K
Miyano, S
机构
[1] Univ Tokyo, Inst Med Sci, Human Genome Ctr, Tokyo 1088639, Japan
[2] Kyushu Univ 33, Dept Informat, Fukuoka 8128581, Japan
来源
ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS | 2004年 / 3240卷
关键词
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
We consider the problem of finding the optimal pair of string patterns for discriminating between two sets of strings, i.e. finding the pair of patterns that is best with respect to some appropriate scoring function that gives higher scores to pattern pairs which occur more in the strings of one set, but less in the other. We present an O(N-2) time algorithm for finding the optimal pair of substring patterns, where N is the total length of the strings. The algorithm looks for all possible Boolean combination of the patterns, e.g. patterns of the form p Lambda -q, which indicates that the pattern pair is considered to match a given string s, if p occurs in s, AND (sic)q does NOT occur in s. The same algorithm can be applied to a variant of the problem where we axe given a single set of sequences along with a numeric attribute assigned to each sequence, and the problem is to find the optimal pattern pair whose occurrence in the sequences is correlated with this numeric attribute. An efficient implementation based on suffix arrays is presented, and the algorithm is applied to several nucleotide sequence datasets of moderate size, combined with microarray gene expression data, aiming to find regulatory elements that cooperate, complement, or compete with each other in enhancing and/or silencing certain genomic functions.
引用
收藏
页码:450 / 462
页数:13
相关论文
共 50 条
  • [41] On Finding the Optimal BDD Relaxation
    Bergman, David
    Cire, Andre Augusto
    INTEGRATION OF AI AND OR TECHNIQUES IN CONSTRAINT PROGRAMMING, CPAIOR 2017, 2017, 10335 : 41 - 50
  • [42] Finding and explaining optimal treatments
    Bielza, C
    del Pozo, JAF
    Lucas, P
    ARTIFICIAL INTELLIGENCE IN MEDICINE, PROCEEDINGS, 2003, 2780 : 299 - 303
  • [43] On the algorithm for finding an optimal routing
    Wang, XP
    Wang, QA
    Wen, YD
    Zeng, LB
    7TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL XI, PROCEEDINGS: COMMUNICATION, NETWORK AND CONTROL SYSTEMS, TECHNOLOGIES AND APPLICATIONS: II, 2003, : 272 - 275
  • [44] POWERS OF TWO Finding the Essence of Innovation in Creative Pairs
    Lewis, Sarah
    NEW YORK TIMES BOOK REVIEW, 2014, 119 (34): : 24 - 24
  • [45] A combinatorial procedure for finding isolating neighbourhoods and index pairs
    Szymczak, A
    PROCEEDINGS OF THE ROYAL SOCIETY OF EDINBURGH SECTION A-MATHEMATICS, 1997, 127 : 1075 - 1088
  • [46] A METHOD FOR FINDING PAIRS OF ANTIPARALLEL STRAIGHT-LINES
    SCHER, A
    SHNEIER, M
    ROSENFELD, A
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1982, 4 (03) : 316 - 323
  • [47] A partition based method for finding highly correlated pairs
    Li, Shuxin
    Lang, Sheau-Dong
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2010, 2 (04) : 334 - 350
  • [48] Finding multiple core-periphery pairs in networks
    Kojaku, Sadamori
    Masuda, Naoki
    PHYSICAL REVIEW E, 2017, 96 (05)
  • [49] Powers of Two: Finding the Essence of Innovation in Creative Pairs
    Hughes, Mary Ann
    LIBRARY JOURNAL, 2014, 139 (13) : 111 - 111
  • [50] A QUICK METHOD FOR FINDING SHORTEST PAIRS OF DISJOINT PATHS
    SUURBALLE, JW
    TARJAN, RE
    NETWORKS, 1984, 14 (02) : 325 - 336