Integrating Biological Heuristics and Gene Expression Data for Gene Regulatory Network Inference

被引:1
|
作者
Zarnegar, Armita [1 ]
Jelinek, Herbert F. [2 ]
Vamplew, Peter [3 ]
Stranieri, Andrew [3 ]
机构
[1] Swinburne Univ Technol, Fac Sci Engn & Technol, Melbourne, Vic, Australia
[2] Macquarie Univ, Australian Sch Adv Med, Sydney, NSW, Australia
[3] Federat Univ, Sch Sci Engn & IT, Ballarat, Vic, Australia
关键词
Gene expression; Gene regulatory network; Hubs; Association function; Correlation function; ESCHERICHIA-COLI; ALGORITHMS;
D O I
10.1145/3290688.3290741
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Gene Regulatory Networks (GRNs) offer enhanced insight into the biological functions and biochemical pathways of cells associated with gene regulatory mechanisms. However, obtaining accurate GRNs that explain gene expressions and functional associations remains a difficult task. Only a few studies have incorporated heuristics into a GRN discovery process. Doing so has the potential to improve accuracy and reduce the search space and computational time. A technique for GRN discovery that integrates heuristic information into the discovery process is advanced. The approach incorporates three elements: 1) a novel 2D visualized coexpression function that measures the association between genes; 2) a post-processing step that improves detection of up, down and self-regulation and 3) the application of heuristics to generate a Hub network as the backbone of the GRN. Using available microarray and next generation sequencing data from Escherichia coli, six synthetic benchmark GRN datasets were generated with the neighborhood addition and cluster addition methods available in SynTReN. Results of the novel 2D-visualization co-expression function were compared with results obtained using Pearson's correlation and mutual information. The performance of the biological genetics-based heuristics consisting of the 2D-Visualized Co-expression function, post-processing and Hub network was then evaluated by comparing the performance to the GRNs obtained by ARACNe and CLR. The 2D-Visualized Co-expression function significantly improved gene-gene association matching compared to Pearson's correlation coefficient (t = 3.46, df = 5, p = 0.02) and Mutual Information (t = 4.42, df = 5, p = 0.007). The heuristics model gave a 60% improvement against ARACNe (p = 0.02) and CLR (p = 0.019). Analysis of Escherichia coli data suggests that the GRN discovery technique proposed is capable of identifying significant transcriptional regulatory interactions and the corresponding regulatory networks.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] PoLoBag: Polynomial Lasso Bagging for signed gene regulatory network inference from expression data
    Roy, Gourab Ghosh
    Geard, Nicholas
    Verspoor, Karin
    He, Shan
    BIOINFORMATICS, 2020, 36 (21) : 5187 - 5193
  • [32] PoLoBag: Polynomial Lasso Bagging for signed gene regulatory network inference from expression data
    Ghosh Roy, Gourab
    Geard, Nicholas
    Verspoor, Karin
    He, Shan
    BIOINFORMATICS, 2021, 36 (21) : 5187 - 5193
  • [33] LogBTF: gene regulatory network inference using Boolean threshold network model from single-cell gene expression data
    Li, Lingyu
    Sun, Liangjie
    Chen, Guangyi
    Wong, Chi-Wing
    Ching, Wai-Ki
    Liu, Zhi-Ping
    BIOINFORMATICS, 2023, 39 (05)
  • [34] GeneSPIDER - gene regulatory network inference benchmarking with controlled network and data properties
    Tjarnberg, Andreas
    Morgan, Daniel C.
    Studham, Matthew
    Nordling, Torbjorn E. M.
    Sonnhammer, Erik L. L.
    MOLECULAR BIOSYSTEMS, 2017, 13 (07) : 1304 - 1312
  • [35] Gene regulatory network inference from sparsely sampled noisy data
    Aalto, Atte
    Viitasaari, Lauri
    Ilmonen, Pauliina
    Mombaerts, Laurent
    Goncalves, Jorge
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [36] Gene regulatory network inference from sparsely sampled noisy data
    Atte Aalto
    Lauri Viitasaari
    Pauliina Ilmonen
    Laurent Mombaerts
    Jorge Gonçalves
    Nature Communications, 11
  • [37] Reconstructing Gene Regulatory Network Using Heterogeneous Biological Data
    Ahmad, Farzana Kabir
    Yusoff, Nooraini
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2013, 8271 : 97 - 107
  • [38] Integration of Steady-State and Temporal Gene Expression Data for the Inference of Gene Regulatory Networks
    Wang, Yi Kan
    Hurley, Daniel G.
    Schnell, Santiago
    Print, Cristin G.
    Crampin, Edmund J.
    PLOS ONE, 2013, 8 (08):
  • [39] Gene biomarker prediction in glioma by integrating scRNA-seq data and gene regulatory network
    Qin, Guimin
    Du, Longting
    Ma, Yuying
    Yin, Yu
    Wang, Liming
    BMC MEDICAL GENOMICS, 2021, 14 (01)
  • [40] Gene biomarker prediction in glioma by integrating scRNA-seq data and gene regulatory network
    Guimin Qin
    Longting Du
    Yuying Ma
    Yu Yin
    Liming Wang
    BMC Medical Genomics, 14