A New Binary Biclustering Algorithm Based on Weight Adjacency Difference Matrix for Analyzing Gene Expression Data

被引:5
|
作者
Chu, He-Ming [1 ]
Kong, Xiang-Zhen [1 ]
Liu, Jin-Xing [1 ]
Zheng, Chun-Hou [1 ]
Zhang, Han [2 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao 276826, Shandong, Peoples R China
[2] Jishou Univ, Sch Informat Sci & Engn, Jishou 416000, Hunan, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Biclustering; gene expression data; weight matrix; binary matrix; HETEROGENEITY; PATHWAYS; PATTERNS;
D O I
10.1109/TCBB.2023.3283801
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Biclustering algorithms are essential for processing gene expression data. However, to process the dataset, most biclustering algorithms require preprocessing the data matrix into a binary matrix. Regrettably, this type of preprocessing may introduce noise or cause information loss in the binary matrix, which would reduce the biclustering algorithm's ability to effectively obtain the optimal biclusters. In this paper, we propose a new preprocessing method named Mean-Standard Deviation (MSD) to resolve the problem. Additionally, we introduce a new biclustering algorithm called Weight Adjacency Difference Matrix Binary Biclustering (W-AMBB) to effectively process datasets containing overlapping biclusters. The basic idea is to create a weighted adjacency difference matrix by applying weights to a binary matrix that is derived from the data matrix. This allows us to identify genes with significant associations in sample data by efficiently identifying similar genes that respond to specific conditions. Furthermore, the performance of the W-AMBB algorithm was tested on both synthetic and real datasets and compared with other classical biclustering methods. The experiment results demonstrate that the W-AMBB algorithm is significantly more robust than the compared biclustering methods on the synthetic dataset. Additionally, the results of the GO enrichment analysis show that the W-AMBB method possesses biological significance on real datasets.
引用
收藏
页码:2802 / 2809
页数:8
相关论文
共 50 条
  • [41] Uncertain maximal frequent subgraph mining algorithm based on adjacency matrix and weight
    Wu, Di
    Ren, Jiadong
    Sheng, Long
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2018, 9 (09) : 1445 - 1455
  • [42] Improving an Evolutionary Multi-objective Algorithm for the Biclustering of Gene Expression Data
    Brizuela, Carlos A.
    Luna-Taylor, Jorge E.
    Martinez-Perez, Israel
    Guillen, Hugo A.
    Rodriguez, David O.
    Beltran-Verdugo, Armando
    2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 221 - 228
  • [43] Semantic Biclustering: A New Way to Analyze and Interpret Gene Expression Data
    Klema, Jiri
    Malinka, Frantisek
    Zelezny, Filip
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2016, 2016, 9683 : 332 - 333
  • [44] POPBic: Pathway-Based Order Preserving Biclustering Algorithm Towards the Analysis of Gene Expression Data
    Mandal, Koyel
    Sarmah, Rosy
    Bhattacharyya, Dhruba Kumar
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2659 - 2670
  • [45] Performance Analysis of Gene Expression data using Biclustering Iterative Signature Algorithm
    Vengatesan, K.
    Singh, R. P.
    Bhaskar, Mahajan Sagar
    Padmanaban, Sanjeevikumar
    Ravishankar, T. Nadana
    Ramkumar, M.
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING, INSTRUMENTATION AND CONTROL TECHNOLOGIES (ICICICT), 2017, : 7 - 11
  • [46] Analyzing Large Gene Expression and Methylation Data Profiles Using StatBicRM: Statistical Biclustering-Based Rule Mining
    Maulik, Ujjwal
    Mallik, Saurav
    Mukhopadhyay, Anirban
    Bandyopadhyay, Sanghamitra
    PLOS ONE, 2015, 10 (04):
  • [47] Biclustering of gene expression data based on related genes and conditions extraction
    Yan, Dechun
    Wang, Jiajun
    PATTERN RECOGNITION, 2013, 46 (04) : 1170 - 1182
  • [48] Biclustering gene expression data based on a high dimensional geometric method
    Gan, XC
    Liew, AWC
    Yan, H
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3388 - 3393
  • [49] MSR-based algorithms for biclustering of microarray gene expression data
    Balamurugan, R.
    Raja, S. P.
    CURRENT SCIENCE, 2022, 123 (04): : 530 - 541
  • [50] Pattern-Based Biclustering with Constraints for Gene Expression Data Analysis
    Henriques, Rui
    Madeira, Sara C.
    PROGRESS IN ARTIFICIAL INTELLIGENCE-BK, 2015, 9273 : 326 - 339