ARBic: an all-round biclustering algorithm for analyzing gene expression data

被引:5
|
作者
Liu, Xiangyu [1 ]
Yu, Ting [1 ]
Zhao, Xiaoyu [1 ]
Long, Chaoyi [1 ]
Han, Renmin [1 ]
Su, Zhengchang [3 ]
Li, Guojun [1 ,2 ]
机构
[1] Shandong Univ, Res Ctr Math & Interdisciplinary Sci, Jinan 250100, Peoples R China
[2] Liaocheng Univ, Sch Math Sci, Liaocheng 252000, Peoples R China
[3] Univ North Carolina Charlotte, Dept Bioinformat & Genom, Charlotte, NC 28223 USA
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1093/nargab/lqad009
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identifying significant biclusters of genes with specific expression patterns is an effective approach to reveal functionally correlated genes in gene expression data. However, none of existing algorithms can simultaneously identify both broader and narrower biclusters due to their failure of balancing between effectiveness and efficiency. We introduced ARBic, an algorithm which is capable of accurately identifying any significant biclusters of any shape, including broader, narrower and square, in any large scale gene expression dataset. ARBic was designed by integrating column-based and row-based strategies into a single biclustering procedure. The column-based strategy borrowed from RecBic, a recently published biclustering tool, extracts narrower biclusters, while the row-based strategy that iteratively finds the longest path in a specific directed graph, extracts broader ones. Being tested and compared to other seven salient biclustering algorithms on simulated datasets, ARBic achieves at least an average of 29% higher recovery, relevance and$\ {F}_1$ scores than the best existing tool. In addition, ARBic substantially outperforms all tools on real datasets and is more robust to noises, bicluster shapes and dataset types.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Distributed robust biclustering algorithm for gene expression analysis
    Tchagang, Alain B.
    Tewfik, Ahmed H.
    2007 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS, 2007, : 50 - +
  • [42] A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
    Chu, He-Ming
    Liu, Jin-Xing
    Zhang, Ke
    Zheng, Chun-Hou
    Wang, Juan
    Kong, Xiang-Zhen
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [43] A novel biclustering of gene expression data based on hybrid BAFS-BSA algorithm
    Cui, Yan
    Zhang, Rui
    Gao, Huacheng
    Lu, Yuanyuan
    Liu, Yinqiu
    Gao, Guangwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 14811 - 14824
  • [44] A binary biclustering algorithm based on the adjacency difference matrix for gene expression data analysis
    He-Ming Chu
    Jin-Xing Liu
    Ke Zhang
    Chun-Hou Zheng
    Juan Wang
    Xiang-Zhen Kong
    BMC Bioinformatics, 23
  • [45] A New Hybrid Cuckoo Search Algorithm for Biclustering of Microarray Gene-Expression Data
    Balamurugan, R.
    Natarajan, A. M.
    Premalatha, K.
    APPLIED ARTIFICIAL INTELLIGENCE, 2018, 32 (7-8) : 644 - 659
  • [46] UniBic: Sequential row-based biclustering algorithm for analysis of gene expression data
    Zhenjia Wang
    Guojun Li
    Robert W. Robinson
    Xiuzhen Huang
    Scientific Reports, 6
  • [47] Evolutionary Local Search Algorithm for the biclustering of gene expression data based on biological knowledge
    Maatouk, Ons
    Ayadi, Wassim
    Bouziri, Hend
    Duval, Beatrice
    APPLIED SOFT COMPUTING, 2021, 104
  • [48] Biclustering expression data using node addition algorithm
    Borah, B.
    Bhattacharyya, D. K.
    ADCOM 2007: PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2007, : 307 - 312
  • [49] UniBic: Sequential row-based biclustering algorithm for analysis of gene expression data
    Wang, Zhenjia
    Li, Guojun
    Robinson, Robert W.
    Huang, Xiuzhen
    SCIENTIFIC REPORTS, 2016, 6
  • [50] Co-evolutionary Algorithm for Analyzing Gene Expression Data
    Claver, Jimbo H.
    Ngongo, Isidore. S.
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, SIMULATION AND MODELLING, 2016, 41 : 120 - 124