Efficient Maximal Biclique Enumeration on Large Uncertain Bipartite Graphs

被引:3
|
作者
Wang, Jianhua [1 ]
Yang, Jianye [2 ]
Ma, Ziyi [1 ]
Zhang, Chengyuan [1 ]
Yang, Shiyu [2 ]
Zhang, Wenjie [3 ]
机构
[1] Hunan Univ, Coll Comp Sci & Elect Engn, Changsha 410012, Peoples R China
[2] Guangzhou Univ, Cyberspace Inst Adv Technol, Guangzhou 510006, Peoples R China
[3] Univ New South Wales, Sch Comp Sci & Engn, Sydney, NSW 2052, Australia
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Maximal biclique; uncertain bipartite graph; search strategy; pruning technique; CLIQUES;
D O I
10.1109/TKDE.2023.3272110
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we study the problem of maximal biclique enumeration on large uncertain bipartite graphs. Given an uncertain bipartite graph G=(U,V,E,p), a probability threshold tau, and two size constraints alpha and beta, we aim to efficiently enumerate all maximal tau-bicliques in G, where a maximal tau-biclique B(L,R) is a complete subgraph of G with (1) the probability of B is no less than tau, (2) |L| >= alpha and |R| >= beta, and (3) B is a maximal complete subgraph satisfying (1) and (2). This problem has many applications, such as biclustering of gene expression data, fraud detection, similar group identification, etc. Despite the wide range of applications, to the best of our knowledge, we note that there are no efficient and scalable solutions to this problem in the literature. This problem is computationally challenging due to its #P-completeness. In this article, we propose a competitive branch-and-bound method, namely MBEN, which explores the search space in a depth-first manner with a variety of pruning techniques. To improve the performance of MBEN, we propose several novel and efficient search processing optimizations. First, we always select the side with fewer candidates to expand the search space. With this search strategy, we have a chance to prune the fruitless branches early. Second, we devise an advanced pruning technique by considering the size pruning and probability pruning at the same time to boost the pruning capacity. Last, we implement MBEN with pre-allocated arrays and pointer maintaining techniques such that the frequent work sets creating operations can be substituted by array element switching operations. In addition, we introduce useful graph reduction techniques to further accelerate the computation. Comprehensive performance studies on 10 real datasets demonstrate that our proposals can significantly outperform the baseline methods by more than two orders of magnitude.
引用
收藏
页码:12634 / 12648
页数:15
相关论文
共 50 条
  • [31] Fast Maximal Clique Enumeration on Uncertain Graphs: A Pivot-based Approach
    Dai, Qiangqiang
    Li, Rong-Hua
    Liao, Meihao
    Chen, Hongzhi
    Wang, Guoren
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA (SIGMOD '22), 2022, : 2034 - 2047
  • [32] Maximal Energy Bipartite Graphs
    Jack H. Koolen
    Vincent Moulton
    Graphs and Combinatorics, 2003, 19 : 131 - 135
  • [33] Maximal paths in bipartite graphs
    Martin, R
    AMERICAN MATHEMATICAL MONTHLY, 2003, 110 (08): : 747 - 747
  • [34] Maximal energy bipartite graphs
    Koolen, JH
    Moulton, V
    GRAPHS AND COMBINATORICS, 2003, 19 (01) : 131 - 135
  • [35] Complexity of minimum biclique cover and minimum biclique decomposition for bipartite domino-free graphs
    Amilhastre, J
    Vilarem, M
    Janssen, P
    DISCRETE APPLIED MATHEMATICS, 1998, 86 (2-3) : 125 - 144
  • [36] A linear time algorithm for maximal clique enumeration in large sparse graphs
    Yu, Ting
    Liu, Mengchi
    INFORMATION PROCESSING LETTERS, 2017, 125 : 35 - 40
  • [37] Solving the maximum edge biclique packing problem on unbalanced bipartite graphs
    Acuna, V.
    Ferreira, C. E.
    Freire, A. S.
    Moreno, E.
    DISCRETE APPLIED MATHEMATICS, 2014, 164 : 2 - 12
  • [38] An Efficient Parallel Algorithm for Maximal Clique Enumeration in a Large Graph
    Tang X.-C.
    Zhou J.-W.
    Tian K.-F.
    Li Z.-H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (03): : 513 - 531
  • [39] Efficient Structural Clustering in Large Uncertain Graphs
    Liang, Yongjiang
    Hu, Tingting
    Zhao, Peixiang
    2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, : 1966 - 1969
  • [40] Enumeration for spanning forests of complete bipartite graphs
    Jin, YL
    Liu, CL
    ARS COMBINATORIA, 2004, 70 : 135 - 138