cell-type annotation;
deep learning;
Transformer;
scRNA-seq;
large-scale dataset;
HETEROGENEITY;
ATLAS;
D O I:
10.1093/bib/bbad195
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
Single-cell omics technologies have made it possible to analyze the individual cells within a biological sample, providing a more detailed understanding of biological systems. Accurately determining the cell type of each cell is a crucial goal in single-cell RNA-seq (scRNA-seq) analysis. Apart from overcoming the batch effects arising from various factors, single-cell annotation methods also face the challenge of effectively processing large-scale datasets. With the availability of an increase in the scRNA-seq datasets, integrating multiple datasets and addressing batch effects originating from diverse sources are also challenges in cell-type annotation. In this work, to overcome the challenges, we developed a supervised method called CIForm based on the Transformer for cell-type annotation of large-scale scRNA-seq data. To assess the effectiveness and robustness of CIForm, we have compared it with some leading tools on benchmark datasets. Through the systematic comparisons under various cell-type annotation scenarios, we exhibit that the effectiveness of CIForm is particularly pronounced in cell-type annotation. The source code and data are available at .
机构:
Columbia Univ, Med Ctr, Dept Syst Biol, New York, NY 10032 USAColumbia Univ, Med Ctr, Dept Syst Biol, New York, NY 10032 USA
Yuan, Jinzhou
Sims, Peter A.
论文数: 0引用数: 0
h-index: 0
机构:
Columbia Univ, Med Ctr, Dept Syst Biol, New York, NY 10032 USA
Columbia Univ, Med Ctr, Sulzberger Columbia Genome Ctr, New York, NY 10032 USA
Columbia Univ, Med Ctr, Dept Biochem & Mol Biophys, New York, NY 10032 USAColumbia Univ, Med Ctr, Dept Syst Biol, New York, NY 10032 USA