scMRA: a robust deep learning method to annotate scRNA-seq data with multiple reference datasets

被引:23
|
作者
Yuan, Musu [1 ,2 ]
Chen, Liang [1 ]
Deng, Minghua [1 ,2 ,3 ]
机构
[1] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
[3] Peking Univ, Ctr Stat Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
CELL RNA-SEQ;
D O I
10.1093/bioinformatics/btab700
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell RNA-seq (scRNA-seq) has been widely used to resolve cellular heterogeneity. After collecting scRNA-seq data, the natural next step is to integrate the accumulated data to achieve a common ontology of cell types and states. Thus, an effective and efficient cell-type identification method is urgently needed. Meanwhile, high-quality reference data remain a necessity for precise annotation. However, such tailored reference data are always lacking in practice. To address this, we aggregated multiple datasets into a meta-dataset on which annotation is conducted. Existing supervised or semi-supervised annotation methods suffer from batch effects caused by different sequencing platforms, the effect of which increases in severity with multiple reference datasets. Results: Herein, a robust deep learning-based single-cell Multiple Reference Annotator (scMRA) is introduced. In scMRA, a knowledge graph is constructed to represent the characteristics of cell types in different datasets, and a graphic convolutional network serves as a discriminator based on this graph. scMRA keeps intra-cell-type closeness and the relative position of cell types across datasets. scMRA is remarkably powerful at transferring knowledge from multiple reference datasets, to the unlabeled target domain, thereby gaining an advantage over other state-of-theart annotation methods in multi-reference data experiments. Furthermore, scMRA can remove batch effects. To the best of our knowledge, this is the first attempt to use multiple insufficient reference datasets to annotate target data, and it is, comparatively, the best annotation method for multiple scRNA-seq datasets.
引用
收藏
页码:738 / 745
页数:8
相关论文
共 50 条
  • [1] Deep embedded clustering with multiple objectives on scRNA-seq data
    Li, Xiangtao
    Zhang, Shixiong
    Wong, Ka-Chun
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [2] FRMC: a fast and robust method for the imputation of scRNA-seq data
    Wu, Honglong
    Wang, Xuebin
    Chu, Mengtian
    Xiang, Ruizhi
    Zhou, Ke
    RNA BIOLOGY, 2021, 18 : 172 - 181
  • [3] scDeepInsight: a supervised cell-type identification method for scRNA-seq data with deep learning
    Jia, Shangru
    Lysenko, Artem
    Boroevich, Keith A.
    Sharma, Alok
    Tsunoda, Tatsuhiko
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [4] A Deep Learning Pipeline for the Automatic cell type Assignment of scRNA-seq Data
    Riva, Simone G.
    Myers, Brynelle
    Cazzaniga, Paolo
    Tangherloni, Andrea
    2022 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (IEEE CIBCB 2022), 2022, : 67 - 74
  • [5] Digitaldlsorter: Deep-Learning on scRNA-Seq to Deconvolute Gene Expression Data
    Torroja, Carlos
    Sanchez-Cabo, Fatima
    FRONTIERS IN GENETICS, 2019, 10
  • [6] scSFCL:Deep clustering of scRNA-seq data with subspace feature confidence learning
    Meng, Xiaokun
    Zhang, Yuanyuan
    Xu, Xiaoyu
    Zhang, Kaihao
    Feng, Baoming
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2025, 114
  • [7] Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data
    Gan, Yanglan
    Chen, Yuhan
    Xu, Guangwei
    Guo, Wenjing
    Zou, Guobing
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [8] Comparison of scRNA-seq data analysis method combinations
    Xu, Li
    Xue, Tong
    Ding, Weiyue
    Shen, Linshan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2022, 21 (06) : 433 - 440
  • [9] scGCL: an imputation method for scRNA-seq data based on graph contrastive learning
    Xiong, Zehao
    Luo, Jiawei
    Shi, Wanwan
    Liu, Ying
    Xu, Zhongyuan
    Wang, Bo
    BIOINFORMATICS, 2023, 39 (03)
  • [10] scVIC: deep generative modeling of heterogeneity for scRNA-seq data
    Xiong, Jiankang
    Gong, Fuzhou
    Ma, Liang
    Wan, Lin
    BIOINFORMATICS ADVANCES, 2024, 4 (01):