CIForm as a Transformer-based model for cell-type annotation of large-scale single-cell RNA-seq data

被引:22
|
作者
Xu, Jing [1 ,2 ]
Zhang, Aidi [1 ]
Liu, Fang [1 ]
Chen, Liang [1 ]
Zhang, Xiujun [1 ]
机构
[1] Chinese Acad Sci, Key Lab Plant Germplasm Enhancement & Specialty Ag, Wuhan Bot Garden, Wuhan 430074, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
cell-type annotation; deep learning; Transformer; scRNA-seq; large-scale dataset; HETEROGENEITY; ATLAS;
D O I
10.1093/bib/bbad195
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell omics technologies have made it possible to analyze the individual cells within a biological sample, providing a more detailed understanding of biological systems. Accurately determining the cell type of each cell is a crucial goal in single-cell RNA-seq (scRNA-seq) analysis. Apart from overcoming the batch effects arising from various factors, single-cell annotation methods also face the challenge of effectively processing large-scale datasets. With the availability of an increase in the scRNA-seq datasets, integrating multiple datasets and addressing batch effects originating from diverse sources are also challenges in cell-type annotation. In this work, to overcome the challenges, we developed a supervised method called CIForm based on the Transformer for cell-type annotation of large-scale scRNA-seq data. To assess the effectiveness and robustness of CIForm, we have compared it with some leading tools on benchmark datasets. Through the systematic comparisons under various cell-type annotation scenarios, we exhibit that the effectiveness of CIForm is particularly pronounced in cell-type annotation. The source code and data are available at .
引用
收藏
页数:11
相关论文
共 50 条
  • [1] scBERT as a large-scale pretrained deep language model for cell type annotation of single-cell RNA-seq data
    Yang, Fan
    Wang, Wenchuan
    Wang, Fang
    Fang, Yuan
    Tang, Duyu
    Huang, Junzhou
    Lu, Hui
    Yao, Jianhua
    NATURE MACHINE INTELLIGENCE, 2022, 4 (10) : 852 - +
  • [2] scEVOLVE: cell-type incremental annotation without forgetting for single-cell RNA-seq data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [3] Impact of data preprocessing on cell-type clustering based on single-cell RNA-seq data
    Wang, Chunxiang
    Gao, Xin
    Liu, Juntao
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [4] Impact of data preprocessing on cell-type clustering based on single-cell RNA-seq data
    Chunxiang Wang
    Xin Gao
    Juntao Liu
    BMC Bioinformatics, 21
  • [5] Generalized Cell Type Annotation and Discovery for Single-Cell RNA-Seq Data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 5402 - 5410
  • [6] SCSA: A Cell Type Annotation Tool for Single-Cell RNA-seq Data
    Cao, Yinghao
    Wang, Xiaoyue
    Peng, Gongxin
    FRONTIERS IN GENETICS, 2020, 11
  • [7] Realistic Cell Type Annotation and Discovery for Single-cell RNA-seq Data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4967 - 4974
  • [8] TransCluster: A Cell-Type Identification Method for single-cell RNA-Seq data using deep learning based on transformer
    Song, Tao
    Dai, Huanhuan
    Wang, Shuang
    Wang, Gan
    Zhang, Xudong
    Zhang, Ying
    Jiao, Linfang
    FRONTIERS IN GENETICS, 2022, 13
  • [9] scGAA: a general gated axial-attention model for accurate cell-type annotation of single-cell RNA-seq data
    Kong, Tianci
    Yu, Tiancheng
    Zhao, Jiaxin
    Hu, Zhenhua
    Xiong, Neal
    Wan, Jian
    Dong, Xiaoliang
    Pan, Yi
    Zheng, Huilin
    Zhang, Lei
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [10] Analyzing Large-Scale Single-Cell RNA-Seq Data Using Coreset
    Usman, Khalid
    Wan, Fangping
    Zhao, Dan
    Peng, Jian
    Zeng, Jianyang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 1784 - 1793