CIForm as a Transformer-based model for cell-type annotation of large-scale single-cell RNA-seq data

被引:22
|
作者
Xu, Jing [1 ,2 ]
Zhang, Aidi [1 ]
Liu, Fang [1 ]
Chen, Liang [1 ]
Zhang, Xiujun [1 ]
机构
[1] Chinese Acad Sci, Key Lab Plant Germplasm Enhancement & Specialty Ag, Wuhan Bot Garden, Wuhan 430074, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
cell-type annotation; deep learning; Transformer; scRNA-seq; large-scale dataset; HETEROGENEITY; ATLAS;
D O I
10.1093/bib/bbad195
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell omics technologies have made it possible to analyze the individual cells within a biological sample, providing a more detailed understanding of biological systems. Accurately determining the cell type of each cell is a crucial goal in single-cell RNA-seq (scRNA-seq) analysis. Apart from overcoming the batch effects arising from various factors, single-cell annotation methods also face the challenge of effectively processing large-scale datasets. With the availability of an increase in the scRNA-seq datasets, integrating multiple datasets and addressing batch effects originating from diverse sources are also challenges in cell-type annotation. In this work, to overcome the challenges, we developed a supervised method called CIForm based on the Transformer for cell-type annotation of large-scale scRNA-seq data. To assess the effectiveness and robustness of CIForm, we have compared it with some leading tools on benchmark datasets. Through the systematic comparisons under various cell-type annotation scenarios, we exhibit that the effectiveness of CIForm is particularly pronounced in cell-type annotation. The source code and data are available at .
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq
    Kotliar, Dylan
    Veres, Adrian
    Nagy, M. Aurel
    Tabrizi, Shervin
    Hodis, Eran
    Melton, Douglas A.
    Sabeti, Pardis C.
    ELIFE, 2019, 8
  • [42] A probabilistic gene expression barcode for annotation of cell types from single-cell RNA-seq data
    Grabski, Isabella N.
    Irizarry, Rafael A.
    BIOSTATISTICS, 2022, 23 (04) : 1150 - 1164
  • [43] Cell-type eQTL deconvolution of bronchial epithelium through integration of single-cell and bulk RNA-seq
    Qi, Cancan
    Berg, Marijn
    Chu, Xiaojing
    Timens, Wim
    Kole, Tessa
    van den Berge, Maarten
    Xu, Cheng-Jian
    Koppelman, Gerard H.
    Nawijn, Martijn C.
    Li, Yang
    ALLERGY, 2022, 77 (12) : 3663 - 3666
  • [44] Distribution-Independent Cell Type Identification for Single-Cell RNA-seq Data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 6143 - 6151
  • [45] scReClassify: post hoc cell type classification of single-cell rNA-seq data
    Taiyun Kim
    Kitty Lo
    Thomas A. Geddes
    Hani Jieun Kim
    Jean Yee Hwa Yang
    Pengyi Yang
    BMC Genomics, 20
  • [46] scReClassify: post hoc cell type classification of single-cell rNA-seq data
    Kim, Taiyun
    Lo, Kitty
    Geddes, Thomas A.
    Kim, Hani Jieun
    Yang, Jean Yee Hwa
    Yang, Pengyi
    BMC GENOMICS, 2019, 20 (Suppl 9)
  • [47] Comparative Analysis of Supervised Cell Type Detection in Single-Cell RNA-seq Data
    Vasighizaker, Akram
    Hora, Sheena
    Trivedi, Yash
    Rueda, Luis
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, PT II, 2022, : 333 - 345
  • [48] scAnnotate: an automated cell-type annotation tool for single-cell RNA-sequencing data
    Ji, Xiangling
    Tsao, Danielle
    Bai, Kailun
    Tsao, Min
    Xing, Li
    Zhang, Xuekui
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [49] Comparison of transformations for single-cell RNA-seq data
    Constantin Ahlmann-Eltze
    Wolfgang Huber
    Nature Methods, 2023, 20 : 665 - 672
  • [50] Application of bioinformatic tools in cell type classification for single-cell RNA-seq data
    Sujana, Shah Tania Akter
    Shahjaman, Md.
    Singha, Atul Chandra
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2025, 115