Single Cell Self-Paced Clustering with Transcriptome Sequencing Data

被引:0
|
作者
Zhao, Peng [1 ]
Xu, Zenglin [2 ,3 ]
Chen, Junjie [2 ]
Ren, Yazhou [1 ,4 ]
King, Irwin [5 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[2] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[3] Peng Cheng Natl Lab, Ctr Artificial Intelligence, Shenzhen 518066, Peoples R China
[4] Inst Elect & Informat Engn UESTC Guangdong, Dongguan 523808, Peoples R China
[5] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong 999077, Peoples R China
关键词
sequencing data; scRNA-seq; clustering; self-paced learning; nonnegative matrix factorization; HETEROGENEITY;
D O I
10.3390/ijms23073900
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single cell RNA sequencing (scRNA-seq) allows researchers to explore tissue heterogeneity, distinguish unusual cell identities, and find novel cellular subtypes by providing transcriptome profiling for individual cells. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, the performance of existing single-cell clustering methods is extremely sensitive to the presence of noise data and outliers. Existing clustering algorithms can easily fall into local optimal solutions. There is still no consensus on the best performing method. To address this issue, we introduce a single cell self-paced clustering (scSPaC) method with F-norm based nonnegative matrix factorization (NMF) for scRNA-seq data and a sparse single cell self-paced clustering (sscSPaC) method with l(21)-norm based nonnegative matrix factorization for scRNA-seq data. We gradually add single cells from simple to complex to our model until all cells are selected. In this way, the influences of noisy data and outliers can be significantly reduced. The proposed method achieved the best performance on both simulation data and real scRNA-seq data. A case study about human clara cells and ependymal cells scRNA-seq data clustering shows that scSPaC is more advantageous near the clustering dividing line.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] SELF-PACED SUBSPACE CLUSTERING
    Liu, Youfa
    Du, Bo
    Zhang, Lefei
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 350 - 355
  • [2] Self-Paced Clustering Ensemble
    Zhou, Peng
    Du, Liang
    Liu, Xinwang
    Shen, Yi-Dong
    Fan, Mingyu
    Li, Xuejun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (04) : 1497 - 1511
  • [3] Adaptive Self-Paced Deep Clustering with Data Augmentation
    Guo, Xifeng
    Liu, Xinwang
    Zhu, En
    Zhu, Xinzhong
    Li, Miaomiao
    Xu, Xin
    Yin, Jianping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (09) : 1680 - 1693
  • [4] Deep convolutional self-paced clustering
    Rui Chen
    Yongqiang Tang
    Lei Tian
    Caixia Zhang
    Wensheng Zhang
    Applied Intelligence, 2022, 52 : 4858 - 4872
  • [5] Deep convolutional self-paced clustering
    Chen, Rui
    Tang, Yongqiang
    Tian, Lei
    Zhang, Caixia
    Zhang, Wensheng
    APPLIED INTELLIGENCE, 2022, 52 (05) : 4858 - 4872
  • [6] Self-paced deep clustering with learning loss
    Zhang, Kai
    Song, Chengyun
    Qiu, Lianpeng
    PATTERN RECOGNITION LETTERS, 2023, 171 : 8 - 14
  • [7] Self-paced multi-task clustering
    Ren, Yazhou
    Que, Xiaofan
    Yao, Dezhong
    Xu, Zenglin
    NEUROCOMPUTING, 2019, 350 : 212 - 220
  • [8] Self-paced Consensus Clustering with Bipartite Graph
    Zhou, Peng
    Du, Liang
    Li, Xuejun
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2133 - 2139
  • [9] Active Clustering Ensemble With Self-Paced Learning
    Zhou, Peng
    Sun, Bicheng
    Liu, Xinwang
    Du, Liang
    Li, Xuejun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12186 - 12200
  • [10] Self-paced Learning for Imbalanced Data
    Zieba, Maciej
    Tomczak, Jakub M.
    Swiatek, Jerzy
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2016, PT I, 2016, 9621 : 564 - 573