Forseti: a mechanistic and predictive model of the splicing status of scRNA-seq reads

被引:0
|
作者
He, Dongze [1 ,2 ]
Gao, Yuan [1 ,2 ]
Chan, Spencer Skylar [3 ]
Quintana-Parrilla, Natalia [4 ]
Patro, Rob [1 ,3 ]
机构
[1] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[2] Univ Maryland, Program Computat Biol Bioinformat & Genom, College Pk, MD 20742 USA
[3] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[4] Univ Puerto Rico, Dept Biol, Mayaguez Campus, Mayaguez, PR 00682 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/btae207
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Short-read single-cell RNA-sequencing (scRNA-seq) has been used to study cellular heterogeneity, cellular fate, and transcriptional dynamics. Modeling splicing dynamics in scRNA-seq data is challenging, with inherent difficulty in even the seemingly straightforward task of elucidating the splicing status of the molecules from which sequenced fragments are drawn. This difficulty arises, in part, from the limited read length and positional biases, which substantially reduce the specificity of the sequenced fragments. As a result, the splicing status of many reads in scRNA-seq is ambiguous because of a lack of definitive evidence. We are therefore in need of methods that can recover the splicing status of ambiguous reads which, in turn, can lead to more accuracy and confidence in downstream analyses. Results: We develop Forseti, a predictive model to probabilistically assign a splicing status to scRNA-seq reads. Our model has two key components. First, we train a binding affinity model to assign a probability that a given transcriptomic site is used in fragment generation. Second, we fit a robust fragment length distribution model that generalizes well across datasets deriving from different species and tissue types. Forseti combines these two trained models to predict the splicing status of the molecule of origin of reads by scoring putative fragments that associate each alignment of sequenced reads with proximate potential priming sites. Using both simulated and experimental data, we show that our model can precisely predict the splicing status of many reads and identify the true gene origin of multi-gene mapped reads.
引用
收藏
页码:i297 / i306
页数:10
相关论文
共 50 条
  • [21] Establishment of an ovarian cancer omentum metastasis-related prognostic model by integrated analysis of scRNA-seq and bulk RNA-seq
    Dongni Zhang
    Wenping Lu
    Shasha Cui
    Heting Mei
    Xiaoqing Wu
    Zhili Zhuo
    Journal of Ovarian Research, 15
  • [22] Integration of scRNA-Seq and Bulk RNA-Seq to Analyse the Heterogeneity of Ovarian Cancer Immune Cells and Establish a Molecular Risk Model
    Liang, Leilei
    Yu, Jing
    Li, Jian
    Li, Ning
    Liu, Jing
    Xiu, Lin
    Zeng, Jia
    Wang, Tiantian
    Wu, Lingying
    FRONTIERS IN ONCOLOGY, 2021, 11
  • [23] Identification of a novel sepsis prognosis model and analysis of possible drug application prospects: Based on scRNA-seq and RNA-seq data
    He, Haihong
    Huang, Tingting
    Guo, Shixing
    Yu, Fan
    Shen, Hongwei
    Shao, Haibin
    Chen, Keyan
    Zhang, Lijun
    Wu, Yunfeng
    Tang, Xi
    Yuan, Xinhua
    Liu, Jiao
    Zhou, Yiwen
    FRONTIERS IN IMMUNOLOGY, 2022, 13
  • [24] Prognostic risk model of LIHC T-cells based on scRNA-seq and RNA-seq and the regulation of the tumor immune microenvironment
    Ding, Shoupeng
    Yi, Xiaomei
    Gao, Jinghua
    Huang, Chunxiao
    Zheng, Shouzhao
    Wu, Lixian
    Cai, Zihan
    DISCOVER ONCOLOGY, 2024, 15 (01)
  • [25] Establishment of an ovarian cancer omentum metastasis-related prognostic model by integrated analysis of scRNA-seq and bulk RNA-seq
    Zhang, Dongni
    Lu, Wenping
    Cui, Shasha
    Mei, Heting
    Wu, Xiaoqing
    Zhuo, Zhili
    JOURNAL OF OVARIAN RESEARCH, 2022, 15 (01)
  • [26] Predicting Outcomes in Esophageal Squamous Cell Carcinoma Using scRNA-Seq and Bulk RNA-Seq: A Model Development and Validation Study
    Zhang, Jiaqi
    Song, Shunzhe
    Li, Yuqing
    Gong, Aixia
    CANCER MEDICINE, 2025, 14 (02):
  • [27] Integrated Analysis of scRNA-Seq and Bulk RNA-Seq Reveals Metabolic Reprogramming of Liver Cancer and Establishes a Prognostic Risk Model
    Xiong, Zhuang
    Li, Lizhi
    Wang, Guoliang
    Guo, Lei
    Luo, Shangyi
    Liao, Xiangwen
    Liu, Jingfeng
    Teng, Wenhao
    GENES, 2024, 15 (06)
  • [28] Inferring Transcriptional Bursting Kinetics Using Gene Expression Model with Memory and Crosstalk from scRNA-seq Data
    Wang, Mengyuan
    Cao, Wenjie
    Guo, Yanbing
    Wang, Guilin
    Jiang, Jian
    Qiu, Huahai
    Zhang, Ben-gong
    JOURNAL OF COMPUTATIONAL BIOPHYSICS AND CHEMISTRY, 2024, 23 (06): : 765 - 779
  • [29] Comprehensive scRNA-seq Model Reveals Artery Endothelial Cell Heterogeneity and Metabolic Preference in Human Vascular Disease
    Liping Zeng
    Yunchang Liu
    Xiaoping Li
    Xue Gong
    Miao Tian
    Peili Yang
    Qi Cai
    Gengze Wu
    Chunyu Zeng
    Interdisciplinary Sciences: Computational Life Sciences, 2024, 16 : 104 - 122
  • [30] GOWDL: gene ontology-driven wide and deep learning model for cell typing of scRNA-seq data
    Fiannaca, Antonino
    La Rosa, Massimo
    La Paglia, Laura
    Gaglio, Salvatore
    Urso, Alfonso
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)