FIRM: Flexible integration of single-cell RNA-sequencing data for large-scale multi-tissue cell atlas datasets

被引:10
|
作者
Ming, Jingsi [1 ,2 ]
Lin, Zhixiang [3 ]
Zhao, Jia [2 ]
Wan, Xiang [4 ,5 ]
Yang, Can [2 ]
Wu, Angela Ruohao [6 ,7 ]
机构
[1] East China Normal Univ, Acad Stat & Interdisciplinary Sci, KLATASDS MOE, Shanghai, Peoples R China
[2] Hong Kong Univ Sci & Technol, Dept Math, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Dept Stat, Hong Kong, Peoples R China
[4] Shenzhen Res Inst Big Data, Shenzhen, Peoples R China
[5] Pazhou Lab, Guangzhou, Peoples R China
[6] Hong Kong Univ Sci & Technol, Div Life Sci, Hong Kong, Peoples R China
[7] Hong Kong Univ Sci & Technol, Dept Chem & Biol Engn, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
single-cell RNA sequencing; data integration; cell atlas; bioinformatics; GENE-EXPRESSION; SEQ; LUNG;
D O I
10.1093/bib/bbac167
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA-sequencing (scRNA-seq) is being used extensively to measure the mRNA expression of individual cells from deconstructed tissues, organs and even entire organisms to generate cell atlas references, leading to discoveries of novel cell types and deeper insight into biological trajectories. These massive datasets are usually collected from many samples using different scRNA-seq technology platforms, including the popular SMART-Seq2 (SS2) and 10X platforms. Inherent heterogeneities between platforms, tissues and other batch effects make scRNA-seq data difficult to compare and integrate, especially in large-scale cell atlas efforts; yet, accurate integration is essential for gaining deeper insights into cell biology. We present FIRM, a re-scaling algorithm which accounts for the effects of cell type compositions, and achieve accurate integration of scRNA-seq datasets across multiple tissue types, platforms and experimental batches. Compared with existing state-of-the-art integration methods, FIRM provides accurate mixing of shared cell type identities and superior preservation of original structure without overcorrection, generating robust integrated datasets for downstream exploration and analysis. FIRM is also a facile way to transfer cell type labels and annotations from one dataset to another, making it a reliable and versatile tool for scRNA-seq analysis, especially for cell atlas data integration.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] SCEMENT: scalable and memory efficient integration of large-scale single-cell RNA-sequencing data
    Chockalingam, Sriram P.
    Aluru, Maneesha
    Aluru, Srinivas
    BIOINFORMATICS, 2025, 41 (02)
  • [2] schex avoids overplotting for large single-cell RNA-sequencing datasets
    Freytag, Saskia
    Lister, Ryan
    BIOINFORMATICS, 2020, 36 (07) : 2291 - 2292
  • [3] Multi-Target Integration and Annotation of Single-Cell RNA-Sequencing Data
    Bhandari, Sapan
    Whitener, Nathan P.
    Zhao, Konghao
    Khuri, Natalia
    13TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY AND HEALTH INFORMATICS, BCB 2022, 2022,
  • [4] Benchmarking principal component analysis for large-scale single-cell RNA-sequencing
    Koki Tsuyuzaki
    Hiroyuki Sato
    Kenta Sato
    Itoshi Nikaido
    Genome Biology, 21
  • [5] Benchmarking principal component analysis for large-scale single-cell RNA-sequencing
    Tsuyuzaki, Koki
    Sato, Hiroyuki
    Sato, Kenta
    Nikaido, Itoshi
    GENOME BIOLOGY, 2020, 21 (01)
  • [6] Benchmarking single-cell RNA-sequencing protocols for cell atlas projects
    Mereu, Elisabetta
    Lafzi, Atefeh
    Moutinho, Catia
    Ziegenhain, Christoph
    McCarthy, Davis J.
    Alvarez-Varela, Adrian
    Batlle, Eduard
    Sagar
    Gruen, Dominic
    Lau, Julia K.
    Boutet, Stephane C.
    Sanada, Chad
    Ooi, Aik
    Jones, Robert C.
    Kaihara, Kelly
    Brampton, Chris
    Talaga, Yasha
    Sasagawa, Yohei
    Tanaka, Kaori
    Hayashi, Tetsutaro
    Braeuning, Caroline
    Fischer, Cornelius
    Sauers, Sascha
    Trefzer, Timo
    Conrad, Christian
    Adiconis, Xian
    Nguyen, Lan T.
    Regev, Aviv
    Levin, Joshua Z.
    Parekh, Swati
    Janjic, Aleksandar
    Wange, Lucas E.
    Bagnoli, Johannes W.
    Enard, Wolfgang
    Gut, Marta
    Sandberg, Rickard
    Nikaido, Itoshi
    Gut, Ivo
    Stegle, Oliver
    Heyn, Holger
    NATURE BIOTECHNOLOGY, 2020, 38 (06) : 747 - +
  • [7] Benchmarking single-cell RNA-sequencing protocols for cell atlas projects
    Elisabetta Mereu
    Atefeh Lafzi
    Catia Moutinho
    Christoph Ziegenhain
    Davis J. McCarthy
    Adrián Álvarez-Varela
    Eduard Batlle
    Dominic Sagar
    Julia K. Grün
    Stéphane C. Lau
    Chad Boutet
    Aik Sanada
    Robert C. Ooi
    Kelly Jones
    Chris Kaihara
    Yasha Brampton
    Yohei Talaga
    Kaori Sasagawa
    Tetsutaro Tanaka
    Caroline Hayashi
    Cornelius Braeuning
    Sascha Fischer
    Timo Sauer
    Christian Trefzer
    Xian Conrad
    Lan T. Adiconis
    Aviv Nguyen
    Joshua Z. Regev
    Swati Levin
    Aleksandar Parekh
    Lucas E. Janjic
    Johannes W. Wange
    Wolfgang Bagnoli
    Marta Enard
    Rickard Gut
    Itoshi Sandberg
    Ivo Nikaido
    Oliver Gut
    Holger Stegle
    Nature Biotechnology, 2020, 38 : 747 - 755
  • [8] An Introduction to the Analysis of Single-Cell RNA-Sequencing Data
    AlJanahi, Aisha A.
    Danielsen, Mark
    Dunbar, Cynthia E.
    MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT, 2018, 10 : 189 - 196
  • [9] Single-Cell RNA-Sequencing in Glioma
    Eli Johnson
    Katherine L. Dickerson
    Ian D. Connolly
    Melanie Hayden Gephart
    Current Oncology Reports, 2018, 20
  • [10] Transcriptomics and single-cell RNA-sequencing
    Chambers, Daniel C.
    Carew, Alan M.
    Lukowski, Samuel W.
    Powell, Joseph E.
    RESPIROLOGY, 2019, 24 (01) : 29 - 36