scAnno: a deconvolution strategy-based automatic cell type annotation tool for single-cell RNA-sequencing data sets

被引:6
|
作者
Liu, Hongjia [1 ]
Li, Huamei [2 ]
Sharma, Amit [3 ]
Huang, Wenjuan [4 ]
Pan, Duo [1 ]
Gu, Yu [1 ]
Lin, Lu [1 ]
Sun, Xiao [1 ]
Liu, Hongde [1 ]
机构
[1] Southeast Univ, Sch Biol Sci & Med Engn, State Key Lab Digital Med Engn, Nanjing, Peoples R China
[2] Nanjing Univ, Nanjing Drum Tower Hosp, Affiliated Hosp, Med Sch,Dept Gen Surg, Nanjing, Peoples R China
[3] Univ Hosp Bonn, Bonn, Germany
[4] Southeast Univ, Southeast Univ Hosp, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
scRNA-seq data annotation; deconvolution; logistic regression; cell type-specific genes;
D O I
10.1093/bib/bbad179
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Undoubtedly, single-cell RNA sequencing (scRNA-seq) has changed the research landscape by providing insights into heterogeneous, complex and rare cell populations. Given that more such data sets will become available in the near future, their accurate assessment with compatible and robust models for cell type annotation is a prerequisite. Considering this, herein, we developed scAnno (scRNA-seq data annotation), an automated annotation tool for scRNA-seq data sets primarily based on the single-cell cluster levels, using a joint deconvolution strategy and logistic regression. We explicitly constructed a reference profile for human (30 cell types and 50 human tissues) and a reference profile for mouse (26 cell types and 50 mouse tissues) to support this novel methodology (scAnno). scAnno offers a possibility to obtain genes with high expression and specificity in a given cell type as cell type-specific genes (marker genes) by combining co-expression genes with seed genes as a core. Of importance, scAnno can accurately identify cell type-specific genes based on cell type reference expression profiles without any prior information. Particularly, in the peripheral blood mononuclear cell data set, the marker genes identified by scAnno showed cell type-specific expression, and the majority of marker genes matched exactly with those included in the CellMarker database. Besides validating the flexibility and interpretability of scAnno in identifying marker genes, we also proved its superiority in cell type annotation over other cell type annotation tools (SingleR, scPred, CHETAH and scmap-cluster) through internal validation of data sets (average annotation accuracy: 99.05%) and cross-platform data sets (average annotation accuracy: 95.56%). Taken together, we established the first novel methodology that utilizes a deconvolution strategy for automated cell typing and is capable of being a significant application in broader scRNA-seq analysis. scAnno is available at .
引用
收藏
页数:12
相关论文
共 50 条
  • [1] scAnnotate: an automated cell-type annotation tool for single-cell RNA-sequencing data
    Ji, Xiangling
    Tsao, Danielle
    Bai, Kailun
    Tsao, Min
    Xing, Li
    Zhang, Xuekui
    BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [2] Single-cell Mayo Map (scMayoMap): an easy-to-use tool for cell type annotation in single-cell RNA-sequencing data analysis
    Lu Yang
    Yan Er Ng
    Haipeng Sun
    Ying Li
    Lucas C. S. Chini
    Nathan K. LeBrasseur
    Jun Chen
    Xu Zhang
    BMC Biology, 21
  • [3] Single-cell Mayo Map (scMayoMap): an easy-to-use tool for cell type annotation in single-cell RNA-sequencing data analysis
    Yang, Lu
    Ng, Yan Er
    Sun, Haipeng
    Li, Ying
    Chini, Lucas C. S.
    Lebrasseur, Nathan K.
    Chen, Jun
    Zhang, Xu
    BMC BIOLOGY, 2023, 21 (01)
  • [4] Improved deconvolution of combined bulk and single-cell RNA-sequencing data
    Lei, Haoyun
    Guo, Xiaoyan A.
    Tao, Yifeng
    Ding, Kai
    Fu, Xuecong
    Oesterreich, Steffi
    Lee, Adrian V.
    Schwartz, Russell
    CANCER RESEARCH, 2022, 82 (12)
  • [5] Automatic Cell Type Annotation Using Marker Genes for Single-Cell RNA Sequencing Data
    Chen, Yu
    Zhang, Shuqin
    BIOMOLECULES, 2022, 12 (10)
  • [6] Multi-Target Integration and Annotation of Single-Cell RNA-Sequencing Data
    Bhandari, Sapan
    Whitener, Nathan P.
    Zhao, Konghao
    Khuri, Natalia
    13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022, 2022,
  • [7] scQCEA: a framework for annotation and quality control report of single-cell RNA-sequencing data
    Nassiri, Isar
    Fairfax, Benjamin
    Lee, Angela
    Wu, Yanxia
    Buck, David
    Piazza, Paolo
    BMC GENOMICS, 2023, 24 (01)
  • [8] scQCEA: a framework for annotation and quality control report of single-cell RNA-sequencing data
    Isar Nassiri
    Benjamin Fairfax
    Angela Lee
    Yanxia Wu
    David Buck
    Paolo Piazza
    BMC Genomics, 24
  • [9] scQCEA a framework for annotation and quality control report of single-cell RNA-sequencing data
    Nassiri, Isar
    Fairfax, Benjamin
    Lee, Angela
    Wu, Yanxia
    Buck, David
    Piazza, Paolo
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 649 - 649
  • [10] scCATCH: Automatic Annotation on Cell Types of Clusters from Single-Cell RNA Sequencing Data
    Shao, Xin
    Liao, Jie
    Lu, Xiaoyan
    Xue, Rui
    Ai, Ni
    Fan, Xiaohui
    ISCIENCE, 2020, 23 (03)