scMRA: a robust deep learning method to annotate scRNA-seq data with multiple reference datasets

被引:23
|
作者
Yuan, Musu [1 ,2 ]
Chen, Liang [1 ]
Deng, Minghua [1 ,2 ,3 ]
机构
[1] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
[3] Peking Univ, Ctr Stat Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
CELL RNA-SEQ;
D O I
10.1093/bioinformatics/btab700
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Single-cell RNA-seq (scRNA-seq) has been widely used to resolve cellular heterogeneity. After collecting scRNA-seq data, the natural next step is to integrate the accumulated data to achieve a common ontology of cell types and states. Thus, an effective and efficient cell-type identification method is urgently needed. Meanwhile, high-quality reference data remain a necessity for precise annotation. However, such tailored reference data are always lacking in practice. To address this, we aggregated multiple datasets into a meta-dataset on which annotation is conducted. Existing supervised or semi-supervised annotation methods suffer from batch effects caused by different sequencing platforms, the effect of which increases in severity with multiple reference datasets. Results: Herein, a robust deep learning-based single-cell Multiple Reference Annotator (scMRA) is introduced. In scMRA, a knowledge graph is constructed to represent the characteristics of cell types in different datasets, and a graphic convolutional network serves as a discriminator based on this graph. scMRA keeps intra-cell-type closeness and the relative position of cell types across datasets. scMRA is remarkably powerful at transferring knowledge from multiple reference datasets, to the unlabeled target domain, thereby gaining an advantage over other state-of-theart annotation methods in multi-reference data experiments. Furthermore, scMRA can remove batch effects. To the best of our knowledge, this is the first attempt to use multiple insufficient reference datasets to annotate target data, and it is, comparatively, the best annotation method for multiple scRNA-seq datasets.
引用
收藏
页码:738 / 745
页数:8
相关论文
共 50 条
  • [31] Digitaldlsorter: Deep-Learning on scRNA-Seq to Deconvolute Gene Expression Data (vol 10, 978, 2019)
    Torroja, Carlos
    Sanchez-Cabo, Fatima
    FRONTIERS IN GENETICS, 2020, 10
  • [32] A clustering method for small scRNA-seq data based on subspace and weighted distance
    Ning, Zilan
    Dai, Zhijun
    Zhang, Hongyan
    Chen, Yuan
    Yuan, Zheming
    PEERJ, 2023, 11 : 28 - 28
  • [33] A deep learning adversarial autoencoder with dynamic batching displays high performance in denoising and ordering scRNA-seq data
    Ko, Kyung Dae
    Sartorelli, Vittorio
    ISCIENCE, 2024, 27 (03)
  • [34] Robust scRNA-seq Cell Types Identification by Self-Guided Deep Clustering Network
    Wang, Yisong
    Wang, Mingzhi
    IEEE ACCESS, 2024, 12 : 60222 - 60233
  • [35] FSPAM: A Feature Construction Method to Identifying Cell Populations in ScRNA-seq Data
    Einipour, Amin
    Mosleh, Mohammad
    Ansari-Asl, Karim
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2020, 122 (01): : 377 - 397
  • [36] Advancing ScRNA-Seq Data Integration via a Novel Gene Selection Method
    Lazaros, Konstantinos
    Exarchos, Themis
    Maglogiannis, Ilias
    Vlamos, Panagiotis
    Vrahatis, Aristidis G.
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT I, AIAI 2024, 2024, 711 : 31 - 41
  • [37] Deep learning tackles single-cell analysis-a survey of deep learning for scRNA-seq analysis
    Flores, Mario
    Liu, Zhentao
    Zhang, Tinghe
    Hasib, Md Musaddaqui
    Chiu, Yu-Chiao
    Ye, Zhenqing
    Paniagua, Karla
    Jo, Sumin
    Zhang, Jianqiu
    Gao, Shou-Jiang
    Jin, Yu-Fang
    Chen, Yidong
    Huang, Yufei
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [38] Attention-based deep clustering method for scRNA-seq cell type identification
    Li, Shenghao
    Guo, Hui
    Zhang, Simai
    Li, Yizhou
    Li, Menglong
    PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (11)
  • [39] Supervised capacity preserving mapping: a clustering guided visualization method for scRNA-seq data
    Zhai, Zhiqian
    Lei, Yu L.
    Wang, Rongrong
    Xie, Yuying
    BIOINFORMATICS, 2022, 38 (09) : 2496 - 2503
  • [40] A rank-based marker selection method for high throughput scRNA-seq data
    Alexander H. S. Vargo
    Anna C. Gilbert
    BMC Bioinformatics, 21