CIForm as a Transformer-based model for cell-type annotation of large-scale single-cell RNA-seq data

被引:22
|
作者
Xu, Jing [1 ,2 ]
Zhang, Aidi [1 ]
Liu, Fang [1 ]
Chen, Liang [1 ]
Zhang, Xiujun [1 ]
机构
[1] Chinese Acad Sci, Key Lab Plant Germplasm Enhancement & Specialty Ag, Wuhan Bot Garden, Wuhan 430074, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
cell-type annotation; deep learning; Transformer; scRNA-seq; large-scale dataset; HETEROGENEITY; ATLAS;
D O I
10.1093/bib/bbad195
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell omics technologies have made it possible to analyze the individual cells within a biological sample, providing a more detailed understanding of biological systems. Accurately determining the cell type of each cell is a crucial goal in single-cell RNA-seq (scRNA-seq) analysis. Apart from overcoming the batch effects arising from various factors, single-cell annotation methods also face the challenge of effectively processing large-scale datasets. With the availability of an increase in the scRNA-seq datasets, integrating multiple datasets and addressing batch effects originating from diverse sources are also challenges in cell-type annotation. In this work, to overcome the challenges, we developed a supervised method called CIForm based on the Transformer for cell-type annotation of large-scale scRNA-seq data. To assess the effectiveness and robustness of CIForm, we have compared it with some leading tools on benchmark datasets. Through the systematic comparisons under various cell-type annotation scenarios, we exhibit that the effectiveness of CIForm is particularly pronounced in cell-type annotation. The source code and data are available at .
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Single-cell RNA-seq data augmentation using generative Fourier transformer
    Nouri, Nima
    COMMUNICATIONS BIOLOGY, 2025, 8 (01)
  • [32] Latent cellular analysis robustly reveals subtle diversity in large-scale single-cell RNA-seq data
    Cheng, Changde
    Easton, John
    Rosencrance, Celeste
    Li, Yan
    Ju, Bensheng
    Williams, Justin
    Mulder, Heather L.
    Pang, Yakun
    Chen, Wenan
    Chen, Xiang
    NUCLEIC ACIDS RESEARCH, 2019, 47 (22)
  • [33] LCA: A robust and scalable algorithm to reveal subtle diversity in large-scale single-cell RNA-Seq data
    Cheng, Changde
    Easton, John
    Rosencrance, Celeste
    Li, Yan
    Ju, Bensheng
    Chen, Wenan
    Chen, Xiang
    CANCER RESEARCH, 2018, 78 (13)
  • [34] A Review of Single-Cell RNA-Seq Annotation, Integration, and Cell-Cell Communication
    Cheng, Changde
    Chen, Wenan
    Jin, Hongjian
    Chen, Xiang
    CELLS, 2023, 12 (15)
  • [35] Cell-type deconvolution for bulk RNA-seq data using single-cell reference: a comparative analysis and recommendation guideline
    Xu, Xintian
    Li, Rui
    Mo, Ouyang
    Liu, Kai
    Li, Justin
    Hao, Pei
    BRIEFINGS IN BIOINFORMATICS, 2025, 26 (01)
  • [36] An Automated Microwell Platform for Large-Scale Single Cell RNA-Seq
    Jinzhou Yuan
    Peter A. Sims
    Scientific Reports, 6
  • [37] An Automated Microwell Platform for Large-Scale Single Cell RNA-Seq
    Yuan, Jinzhou
    Sims, Peter A.
    SCIENTIFIC REPORTS, 2016, 6
  • [38] CSI-Microbes: Identifying cell-type specific intracellular microbes from single-cell RNA-seq data
    Robinson, Welles
    Schischlik, Fiorella
    Gertz, E. Michael
    Lee, Joo S.
    Zhu, Kaiyuan
    Sahinalp, S. Cenk
    Patro, Rob
    Leiserson, Mark D.
    Schaffer, Alejandro A.
    Ruppin, Eytan
    CANCER RESEARCH, 2022, 82 (12)
  • [39] scPML: pathway-based multi-view learning for cell type annotation from single-cell RNA-seq data
    Du, Zhi-Hua
    Hu, Wei-Lin
    Li, Jian-Qiang
    Shang, Xuequn
    You, Zhu-Hong
    Chen, Zhuang-zhuang
    Huang, Yu-An
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [40] scPML: pathway-based multi-view learning for cell type annotation from single-cell RNA-seq data
    Zhi-Hua Du
    Wei-Lin Hu
    Jian-Qiang Li
    Xuequn Shang
    Zhu-Hong You
    Zhuang-zhuang Chen
    Yu-An Huang
    Communications Biology, 6