Rich Chromatin Structure Prediction from Hi-C Data

被引:13
|
作者
Malik, Laraib [1 ]
Patro, Rob [1 ]
机构
[1] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
基金
美国国家科学基金会;
关键词
Prediction algorithms; Biological cells; Frequency-domain analysis; Indexes; Tools; Genomics; Bioinformatics; Hierarchy; chromatin conformation capture; Hi-C; topologically associating domains; FUNCTIONAL-ORGANIZATION; TOPOLOGICAL DOMAINS; GENOME; PRINCIPLES; TRANSCRIPTION; ARCHITECTURE; MODEL; MAP;
D O I
10.1109/TCBB.2018.2851200
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent studies involving the 3-dimensional conformation of chromatin have revealed the important role it has to play in different processes within the cell. These studies have also led to the discovery of densely interacting segments of the chromosome, called topologically associating domains. The accurate identification of these domains from Hi-C interaction data is an interesting and important computational problem for which numerous methods have been proposed. Unfortunately, most existing algorithms designed to identify these domains assume that they are non-overlapping whereas there is substantial evidence to believe a nested structure exists. We present a methodology to predict hierarchical chromatin domains using chromatin conformation capture data. Our method predicts domains at different resolutions, calculated using intrinsic properties of the chromatin data, and effectively clusters these to construct the hierarchy. At each individual level, the domains are non-overlapping in such a way that the intra-domain interaction frequencies are maximized. We show that our predicted structure is highly enriched for actively transcribing housekeeping genes and various chromatin markers, including CTCF, around the domain boundaries. We also show that large-scale domains, at multiple resolutions within our hierarchy, are conserved across cell types and species. We also provide comparisons against existing tools for extracting hierarchical domains. Our software, Matryoshka, is written in C++11 and licensed under GPL v3; it is available at https://github.com/COMBINE-lab/matryoshka.
引用
收藏
页码:1448 / 1458
页数:11
相关论文
共 50 条
  • [21] covNorm: An R package for coverage based normalization of Hi-C and capture Hi-C data
    Kim, Kyukwang
    Jung, Inkyung
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 3149 - 3159
  • [22] Comparative study on chromatin loop callers using Hi-C data reveals their effectiveness
    H. M. A. Mohit Chowdhury
    Terrance Boult
    Oluwatosin Oluwadare
    BMC Bioinformatics, 25
  • [23] Comparative study on chromatin loop callers using Hi-C data reveals their effectiveness
    Chowdhury, H. M. A. Mohit
    Boult, Terrance
    Oluwadare, Oluwatosin
    BMC BIOINFORMATICS, 2024, 25 (01)
  • [24] Inferential Structure Determination of Chromosomes from Single-Cell Hi-C Data
    Carstens, Simeon
    Nilges, Michael
    Habeck, Michael
    PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (12)
  • [25] DNase Hi-C — pitch-perfect chromatin mapping?
    Linda Koch
    Nature Reviews Genetics, 2015, 16 (1) : 5 - 5
  • [26] Inferring chromosome radial organization from Hi-C data
    Das, Priyojit
    Shen, Tongye
    McCord, Rachel Patton
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [27] A fast and adaptive detection framework for genome-wide chromatin loop mapping from Hi-C data
    Chen, Siyuan
    Wang, Jiuming
    Jung, Inkyung
    Qiu, Zhaowen
    Gao, Xin
    Li, Yu
    GENOME RESEARCH, 2024, 34 (08) : 1174 - 1184
  • [28] Reference panel guided topological structure annotation of Hi-C data
    Zhang, Yanlin
    Blanchette, Mathieu
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [29] Reference panel guided topological structure annotation of Hi-C data
    Yanlin Zhang
    Mathieu Blanchette
    Nature Communications, 13
  • [30] MaxHiC: A robust background correction model to identify biologically relevant chromatin interactions in Hi-C and capture Hi-C experiments
    Alinejad-Rokny, Hamid
    Modegh, Rassa Ghavami
    Rabiee, Hamid R. R.
    Sarbandi, Ehsan Ramezani
    Rezaie, Narges
    Tam, Kin Tung
    Forrest, Alistair R. R.
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (06)