STATISTICAL CURVE MODELS FOR INFERRING 3D CHROMATIN ARCHITECTURE

被引:0
|
作者
Uzhilina, Lena [1 ]
Astie, Trevor [2 ]
Segal, Mark [3 ]
机构
[1] Univ Toronto, Dept Stat Sci, Toronto, ON, Canada
[2] Stanford Univ, Dept Stat, Stanford, CA USA
[3] Univ Calif Irvine, Dept Epidemiol & Biostat, Irvine, CA USA
来源
ANNALS OF APPLIED STATISTICS | 2024年 / 18卷 / 04期
基金
加拿大自然科学与工程研究理事会; 美国国家卫生研究院; 美国国家科学基金会;
关键词
Key words and phrases. Spatial structure; conformation reconstruction; metric scaling; splines; REVEALS; GENOME; PRINCIPLES; REGRESSION;
D O I
10.1214/24-AOAS1917
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Reconstructing three-dimensional (3D) chromatin structure from conformation capture assays (such as Hi-C) is a critical task in computational biology, since chromatin spatial architecture plays a vital role in numerous cellular processes and direct imaging is challenging. Most existing algorithms that operate on Hi-C contact matrices produce reconstructed 3D configurations in the form of a polygonal chain. However, none of the methods exploit the fact that the target solution is a (smooth) curve in 3D: this contiguity attribute is either ignored or indirectly addressed by imposing spatial constraints that are challenging to formulate. In this paper we develop both B-spline and smoothing spline techniques for directly capturing this potentially complex 1D curve. We subsequently combine these techniques with a Poisson model for contact counts and compare their performance on a real data example. In addition, motivated by the sparsity of Hi-C contact data, especially when obtained from single-cell assays, we appreciably extend the class of distributions used to model contact counts. We build a general distribution-based metric scaling (DBMS) framework from which we develop zero-inflated and Hurdle Poisson models as well as negative binomial applications. Illustrative applications make recourse to bulk Hi-C data from IMR90 cells and singlecell Hi-C data from mouse embryonic stem cells.
引用
收藏
页码:2979 / 3006
页数:28
相关论文
共 50 条
  • [1] Principal curve approaches for inferring 3D chromatin architecture
    Tuzhilina, Elena
    Hastie, Trevor J.
    Segal, Mark R.
    BIOSTATISTICS, 2022, 23 (02) : 626 - 642
  • [2] A statistical approach for inferring the 3D structure of the genome
    Varoquaux, Nelle
    Ay, Ferhat
    Noble, William Stafford
    Vert, Jean-Philippe
    BIOINFORMATICS, 2014, 30 (12) : 26 - 33
  • [3] Exploring chromatin architecture by FISHing in 3D
    Ekat Kritikou
    Nature Reviews Genetics, 2005, 6 (6) : 429 - 429
  • [4] Exploring chromatin architecture by FISHing in 3D
    Kritikou, E
    NATURE REVIEWS GENETICS, 2005, 6 (06) : 429 - 429
  • [5] Contribution of 3D Chromatin Architecture to the Maintenance of Pluripotency
    Brant L.
    Papantonis A.
    Current Stem Cell Reports, 2015, 1 (3) : 170 - 175
  • [6] A model for the 3D chromatin architecture of pro and eukaryotes
    Heermann, Dieter W.
    Jerabek, Hansjoerg
    Liu, Lei
    Li, Yixue
    METHODS, 2012, 58 (03) : 307 - 314
  • [7] 3D chromatin architecture and transcription regulation in cancer
    Siwei Deng
    Yuliang Feng
    Siim Pauklin
    Journal of Hematology & Oncology, 15
  • [8] 3D chromatin architecture and transcription regulation in cancer
    Deng, Siwei
    Feng, Yuliang
    Pauklin, Siim
    JOURNAL OF HEMATOLOGY & ONCOLOGY, 2022, 15 (01)
  • [9] Inferring 3D chromatin structure using a multiscale approach based on quaternions
    Claudia Caudai
    Emanuele Salerno
    Monica Zoppè
    Anna Tonazzini
    BMC Bioinformatics, 16
  • [10] Inferring 3D chromatin structure using a multiscale approach based on quaternions
    Caudai, Claudia
    Salerno, Emanuele
    Zoppe, Monica
    Tonazzini, Anna
    BMC BIOINFORMATICS, 2015, 16