Forseti: a mechanistic and predictive model of the splicing status of scRNA-seq reads

被引:0
|
作者
He, Dongze [1 ,2 ]
Gao, Yuan [1 ,2 ]
Chan, Spencer Skylar [3 ]
Quintana-Parrilla, Natalia [4 ]
Patro, Rob [1 ,3 ]
机构
[1] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[2] Univ Maryland, Program Computat Biol Bioinformat & Genom, College Pk, MD 20742 USA
[3] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[4] Univ Puerto Rico, Dept Biol, Mayaguez Campus, Mayaguez, PR 00682 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/btae207
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Short-read single-cell RNA-sequencing (scRNA-seq) has been used to study cellular heterogeneity, cellular fate, and transcriptional dynamics. Modeling splicing dynamics in scRNA-seq data is challenging, with inherent difficulty in even the seemingly straightforward task of elucidating the splicing status of the molecules from which sequenced fragments are drawn. This difficulty arises, in part, from the limited read length and positional biases, which substantially reduce the specificity of the sequenced fragments. As a result, the splicing status of many reads in scRNA-seq is ambiguous because of a lack of definitive evidence. We are therefore in need of methods that can recover the splicing status of ambiguous reads which, in turn, can lead to more accuracy and confidence in downstream analyses. Results: We develop Forseti, a predictive model to probabilistically assign a splicing status to scRNA-seq reads. Our model has two key components. First, we train a binding affinity model to assign a probability that a given transcriptomic site is used in fragment generation. Second, we fit a robust fragment length distribution model that generalizes well across datasets deriving from different species and tissue types. Forseti combines these two trained models to predict the splicing status of the molecule of origin of reads by scoring putative fragments that associate each alignment of sequenced reads with proximate potential priming sites. Using both simulated and experimental data, we show that our model can precisely predict the splicing status of many reads and identify the true gene origin of multi-gene mapped reads.
引用
收藏
页码:i297 / i306
页数:10
相关论文
共 50 条
  • [31] A New Graph Autoencoder-Based Consensus-Guided Model for scRNA-seq Cell Type Detection
    Zhang, Dai-Jun
    Gao, Ying-Lian
    Zhao, Jing-Xiu
    Zheng, Chun-Hou
    Liu, Jin-Xing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 2473 - 2483
  • [32] Identification and Validation of a Ferroptosis Prognostic Model for Prostate Cancer Patients through Screening the TCGA and scRNA-seq Datasets
    Lyu, F.
    Gao, X.
    Shang, S.
    Ma, M.
    Li, S.
    Chen, J.
    Ren, X.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2023, 117 (02): : E412 - E412
  • [33] Comprehensive scRNA-seq Model Reveals Artery Endothelial Cell Heterogeneity and Metabolic Preference in Human Vascular Disease
    Zeng, Liping
    Liu, Yunchang
    Li, Xiaoping
    Gong, Xue
    Tian, Miao
    Yang, Peili
    Cai, Qi
    Wu, Gengze
    Zeng, Chunyu
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (01) : 104 - 122
  • [34] Development and validation of a prognostic model and gene co-expression networks for breast carcinoma based on scRNA-seq and bulk-seq data
    Ruan, Zhaohui
    Chi, Dongmei
    Wang, Qianyu
    Jiang, Jiaxin
    Quan, Qi
    Bei, Jinxin
    Peng, Roujun
    ANNALS OF TRANSLATIONAL MEDICINE, 2022, 10 (24)
  • [35] Integrative Analysis of scRNA-Seq and Bulk RNA-Seq Identifies Plasma Cell Related Genes and Constructs a Prognostic Model for Hepatocellular Carcinoma
    Tang, Mingyang
    Xu, Yuyan
    Pan, Mingxin
    JOURNAL OF HEPATOCELLULAR CARCINOMA, 2025, 12 : 427 - 444
  • [36] CaSee: A lightning transfer-learning model directly used to discriminate cancer/normal cells from scRNA-seq
    Sh, Yuan
    Zhang, Xiuli
    Yang, Zhimin
    Dong, Jierong
    Wang, Yuanzhuo
    Zhou, Ying
    Li, Xuejie
    Guo, Caixia
    Hu, Zhiyuan
    ONCOGENE, 2022, 41 (44) : 4866 - 4876
  • [37] CaSee: A lightning transfer-learning model directly used to discriminate cancer/normal cells from scRNA-seq
    Yuan Sh
    Xiuli Zhang
    Zhimin Yang
    Jierong Dong
    Yuanzhuo Wang
    Ying Zhou
    Xuejie Li
    Caixia Guo
    Zhiyuan Hu
    Oncogene, 2022, 41 : 4866 - 4876
  • [38] Comprehensive analysis of scRNA-Seq and bulk RNA-Seq reveals dynamic changes in the tumor immune microenvironment of bladder cancer and establishes a prognostic model
    Tan, Zhiyong
    Chen, Xiaorong
    Zuo, Jieming
    Fu, Shi
    Wang, Haifeng
    Wang, Jiansong
    JOURNAL OF TRANSLATIONAL MEDICINE, 2023, 21 (01)
  • [39] Comprehensive analysis of scRNA-Seq and bulk RNA-Seq reveals dynamic changes in the tumor immune microenvironment of bladder cancer and establishes a prognostic model
    Zhiyong Tan
    Xiaorong Chen
    Jieming Zuo
    Shi Fu
    Haifeng Wang
    Jiansong Wang
    Journal of Translational Medicine, 21
  • [40] scRNA-seq of diffuse-type gastric cancer mouse model reveals potential therapeutic targets on cancer microenvironment.
    Kokubo, Haruki
    Komura, Daisuke
    Kakiuchi, Miwako
    Katoh, Hiroto
    Ishikawa, Shumpei
    CANCER SCIENCE, 2022, 113 : 474 - 474