A New Binary Biclustering Algorithm Based on Weight Adjacency Difference Matrix for Analyzing Gene Expression Data

被引:5
|
作者
Chu, He-Ming [1 ]
Kong, Xiang-Zhen [1 ]
Liu, Jin-Xing [1 ]
Zheng, Chun-Hou [1 ]
Zhang, Han [2 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao 276826, Shandong, Peoples R China
[2] Jishou Univ, Sch Informat Sci & Engn, Jishou 416000, Hunan, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Biclustering; gene expression data; weight matrix; binary matrix; HETEROGENEITY; PATHWAYS; PATTERNS;
D O I
10.1109/TCBB.2023.3283801
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Biclustering algorithms are essential for processing gene expression data. However, to process the dataset, most biclustering algorithms require preprocessing the data matrix into a binary matrix. Regrettably, this type of preprocessing may introduce noise or cause information loss in the binary matrix, which would reduce the biclustering algorithm's ability to effectively obtain the optimal biclusters. In this paper, we propose a new preprocessing method named Mean-Standard Deviation (MSD) to resolve the problem. Additionally, we introduce a new biclustering algorithm called Weight Adjacency Difference Matrix Binary Biclustering (W-AMBB) to effectively process datasets containing overlapping biclusters. The basic idea is to create a weighted adjacency difference matrix by applying weights to a binary matrix that is derived from the data matrix. This allows us to identify genes with significant associations in sample data by efficiently identifying similar genes that respond to specific conditions. Furthermore, the performance of the W-AMBB algorithm was tested on both synthetic and real datasets and compared with other classical biclustering methods. The experiment results demonstrate that the W-AMBB algorithm is significantly more robust than the compared biclustering methods on the synthetic dataset. Additionally, the results of the GO enrichment analysis show that the W-AMBB method possesses biological significance on real datasets.
引用
收藏
页码:2802 / 2809
页数:8
相关论文
共 50 条
  • [31] A linear time biclustering algorithm for time series gene expression data
    Madeira, SC
    Oliveira, AL
    ALGORITHMS IN BIOINFORMATICS, PROCEEDINGS, 2005, 3692 : 39 - 52
  • [32] Exact biclustering algorithm for the analysis of large gene expression data sets
    Oliver Voggenreiter
    Stefan Bleuler
    Wilhelm Gruissem
    BMC Bioinformatics, 13 (Suppl 18)
  • [33] An Overlapping Control-Biclustering Algorithm from Gene Expression Data
    Nepomuceno, Juan A.
    Troncoso, Alicia
    Aguilar-Ruiz, Jesus S.
    2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 1239 - +
  • [34] KMeans Greedy Search Hybrid Algorithm for Biclustering Gene Expression Data
    Das, Shyama
    Idicula, Sumam Mary
    ADVANCES IN COMPUTATIONAL BIOLOGY, 2010, 680 : 181 - 188
  • [35] An Efficiency and Fast Algorithm Based on Gene Expressing Data Biclustering
    Liu Wei
    Qu Bo
    Yang Zhong-xue
    Chen Ling
    2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 2, PROCEEDINGS, 2009, : 41 - +
  • [36] Biclustering gene expression data by random projection based on bucketing
    Liu, Juan
    Liu, Feng
    2008 INTERNATIONAL SPECIAL TOPIC CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS IN BIOMEDICINE, VOLS 1 AND 2, 2008, : 322 - 325
  • [37] Gibbs Sampling Based Bayesian Biclustering of Gene Expression Data
    Chen, Daoyuan
    Liu, Qinyi
    Meng, Jia
    Su, Jionglong
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 790 - 795
  • [38] Uncertain maximal frequent subgraph mining algorithm based on adjacency matrix and weight
    Di Wu
    Jiadong Ren
    Long Sheng
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 1445 - 1455
  • [39] Ensemble biclustering gene expression data based on the spectral clustering
    Yin, Lu
    Liu, Yongguo
    NEURAL COMPUTING & APPLICATIONS, 2018, 30 (08): : 2403 - 2416
  • [40] Ensemble biclustering gene expression data based on the spectral clustering
    Lu Yin
    Yongguo Liu
    Neural Computing and Applications, 2018, 30 : 2403 - 2416