Predicting Cell Populations in Single Cell Mass Cytometry Data

被引:41
|
作者
Abdelaal, Tamim [1 ,2 ]
van Unen, Vincent [3 ]
Hollt, Thomas [2 ,4 ]
Koning, Frits [3 ]
Reinders, Marcel J. T. [1 ,2 ]
Mahfouz, Ahmed [1 ,2 ]
机构
[1] Delft Univ Technol, Delft Bioinformat Lab, NL-2628 XE Delft, Netherlands
[2] Leiden Univ, Med Ctr, Leiden Computat Biol Ctr, Einthovenweg 20, NL-2333 ZC Leiden, Netherlands
[3] Leiden Univ, Med Ctr, Dept Immunohematol & Blood Transfus, NL-2333 ZA Leiden, Netherlands
[4] Delft Univ Technol, Comp Graph & Visualizat, NL-2628 XE Delft, Netherlands
基金
欧盟地平线“2020”;
关键词
single cell; mass cytometry; cell population prediction; machine learning; IMMUNE; FLOW; VISUALIZATION; SPACE;
D O I
10.1002/cyto.a.23738
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Mass cytometry by time-of-flight (CyTOF) is a valuable technology for high-dimensional analysis at the single cell level. Identification of different cell populations is an important task during the data analysis. Many clustering tools can perform this task, which is essential to identify "new" cell populations in explorative experiments. However, relying on clustering is laborious since it often involves manual annotation, which significantly limits the reproducibility of identifying cell-populations across different samples. The latter is particularly important in studies comparing different conditions, for example in cohort studies. Learning cell populations from an annotated set of cells solves these problems. However, currently available methods for automatic cell population identification are either complex, dependent on prior biological knowledge about the populations during the learning process, or can only identify canonical cell populations. We propose to use a linear discriminant analysis (LDA) classifier to automatically identify cell populations in CyTOF data. LDA outperforms two state-of-the-art algorithms on four benchmark datasets. Compared to more complex classifiers, LDA has substantial advantages with respect to the interpretable performance, reproducibility, and scalability to larger datasets with deeper annotations. We apply LDA to a dataset of similar to 3.5 million cells representing 57 cell populations in the Human Mucosal Immune System. LDA has high performance on abundant cell populations as well as the majority of rare cell populations, and provides accurate estimates of cell population frequencies. Further incorporating a rejection option, based on the estimated posterior probabilities, allows LDA to identify previously unknown (new) cell populations that were not encountered during training. Altogether, reproducible prediction of cell population compositions using LDA opens up possibilities to analyze large cohort studies based on CyTOF data. (C) 2019 The Authors. Cytometry Part A published by Wiley Periodicals, Inc.
引用
收藏
页码:769 / 781
页数:13
相关论文
共 50 条
  • [1] Unraveling cell populations in tumors by single-cell mass cytometry
    Di Palma, Serena
    Bodenmiller, Bernd
    CURRENT OPINION IN BIOTECHNOLOGY, 2015, 31 : 122 - 129
  • [2] The anatomy of single cell mass cytometry data
    Olsen, Lars R.
    Leipold, Michael D.
    Pedersen, Christina B.
    Maecker, Holden Terry
    CYTOMETRY PART A, 2019, 95A (02) : 156 - 172
  • [3] Characterization of adipose depot-specific stromal cell populations by single-cell mass cytometry
    Lee, Ju Hee
    Ealey, Kafi N.
    Patel, Yash
    Verma, Navkiran
    Thakkar, Nikita
    Park, So Young
    Kim, Jae-Ryong
    Sung, Hoon-Ki
    ISCIENCE, 2022, 25 (04)
  • [4] Single-cell mass cytometry adapted to measurements of the cell cycle
    Behbehani, Gregory K.
    Bendall, Sean C.
    Clutter, Matthew R.
    Fantl, Wendy J.
    Nolan, Garry P.
    CYTOMETRY PART A, 2012, 81A (07) : 552 - 566
  • [5] Performance of spectral flow cytometry and mass cytometry for the study of innate myeloid cell populations
    van der Pan, Kyra
    Khatri, Indu
    de Jager, Anniek L.
    Louis, Alesha
    Kassem, Sara
    Naber, Brigitta A. E.
    de Laat, Inge F.
    Hameetman, Marjolijn
    Comans, Suzanne E. T.
    Orfao, Alberto
    van Dongen, Jacques J. M.
    Diez, Paula
    Teodosio, Cristina
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [6] MASSIVELY MULTIPARAMETER SINGLE CELL ANALYSIS BY MASS CYTOMETRY
    Tanner, Scott
    Bandura, Dmitry
    Ornatsky, Olga
    Baranov, Vladimir
    CYTOMETRY PART B-CLINICAL CYTOMETRY, 2011, 80B (06) : 396 - 397
  • [7] DGCyTOF: Deep learning with graphic cluster visualization to predict cell types of single cell mass cytometry data
    Cheng, Lijun
    Karkhanis, Pratik
    Gokbag, Birkan
    Liu, Yueze
    Li, Lang
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (04)
  • [8] Single Cell Multiplex Mass Cytometry Reveals Differential Abundance of Specific Memory and Regulatory T Cell Populations in Human Hypertension
    Alexander, Matthew R.
    Dale, Bethany L.
    Elijovich, Fernando
    Wogsland, Cara E.
    Irish, Jonathan M.
    Madhur, Meena S.
    HYPERTENSION, 2019, 74
  • [9] Automated identification of cell populations in flow cytometry data with transformers
    Woedlinger, Matthias
    Reiter, Michael
    Weijler, Lisa
    Maurer-Granofszky, Margarita
    Schumich, Angela
    Sajaroff, Elisa O.
    Groeneveld-Krentz, Stefanie
    Rossi, Jorge G.
    Karawajew, Leonid
    Ratei, Richard
    Dworzak, Michael N.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 144
  • [10] Cell Cycle Analysis and Relevance for Single-Cell Gating in Mass Cytometry
    Rein, Idun D.
    Noto, Heidi O.
    Bostad, Monica
    Huse, Kanutte
    Stokke, Trond
    CYTOMETRY PART A, 2020, 97 (08) : 832 - 844