Boosting scRNA-seq data clustering by cluster-aware feature weighting

被引:3
|
作者
Li, Rui-Yi [1 ]
Guan, Jihong [1 ]
Zhou, Shuigeng [2 ,3 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, 4800 Caoan Rd, Shanghai 201804, Peoples R China
[2] Fudan Univ, Shanghai Key Lab Intelligent Informat Proc, 220 Handan Rd, Shanghai 200433, Peoples R China
[3] Fudan Univ, Sch Comp Sci, 220 Handan Rd, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
Single cell RNA sequencing; Feature weighting; feature selection; Clustering; MESSENGER-RNA-SEQ; CELL-TYPES; SINGLE; HETEROGENEITY; CLASSIFICATION; RECONSTRUCTION;
D O I
10.1186/s12859-021-04033-7
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background The rapid development of single-cell RNA sequencing (scRNA-seq) enables the exploration of cell heterogeneity, which is usually done by scRNA-seq data clustering. The essence of scRNA-seq data clustering is to group cells by measuring the similarities among genes/transcripts of cells. And the selection of features for cell similarity evaluation is of great importance, which will significantly impact clustering effectiveness and efficiency. Results In this paper, we propose a novel method called CaFew to select genes based on cluster-aware feature weighting. By optimizing the clustering objective function, CaFew obtains a feature weight matrix, which is further used for feature selection. The genes have large weights in at least one cluster or the genes whose weights vary greatly in different clusters are selected. Experiments on 8 real scRNA-seq datasets show that CaFew can obviously improve the clustering performance of existing scRNA-seq data clustering methods. Particularly, the combination of CaFew with SC3 achieves the state-of-art performance. Furthermore, CaFew also benefits the visualization of scRNA-seq data. Conclusion CaFew is an effective scRNA-seq data clustering method due to its gene selection mechanism based on cluster-aware feature weighting, and it is a useful tool for scRNA-seq data analysis.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Clustering Deviation Index (CDI): a robust and accurate internal measure for evaluating scRNA-seq data clustering
    Jiyuan Fang
    Cliburn Chan
    Kouros Owzar
    Liuyang Wang
    Diyuan Qin
    Qi-Jing Li
    Jichun Xie
    Genome Biology, 23
  • [42] Domain adaptation for supervised integration of scRNA-seq data
    Sun, Yutong
    Qiu, Peng
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [43] Comparison of scRNA-seq data analysis method combinations
    Xu, Li
    Xue, Tong
    Ding, Weiyue
    Shen, Linshan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2022, 21 (06) : 433 - 440
  • [44] Predicting lung aging using scRNA-Seq data
    Song, Qi
    Singh, Alex
    Mcdonough, John E.
    Adams, Taylor S.
    Vos, Robin
    De Man, Ruben
    Myers, Greg
    Ceulemans, Laurens J.
    Vanaudenaerde, Bart M.
    Wuyts, Wim A.
    Yan, Xiting
    Schuppe, Jonas
    Hagood, James S.
    Kaminski, Naftali
    Bar-Joseph, Ziv
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (12)
  • [45] Domain adaptation for supervised integration of scRNA-seq data
    Yutong Sun
    Peng Qiu
    Communications Biology, 6
  • [46] Visualizing scRNA-Seq data at population scale with GloScope
    Wang, Hao
    Torous, William
    Gong, Boying
    Purdom, Elizabeth
    GENOME BIOLOGY, 2024, 25 (01):
  • [47] Dual-GCN-based deep clustering with triplet contrast for ScRNA-seq data analysis?
    Wang, Linjie
    Li, Wei
    Xie, Weidong
    Wang, Rui
    Yu, Kun
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2023, 106
  • [48] UICPC: Centrality-based clustering for scRNA-seq data analysis without user input
    Chowdhury, Hussain Ahmed
    Bhattacharyya, Dhruba Kumar
    Kalita, Jugal Kumar
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 137
  • [49] Local-Global Graph Fusion to Enhance scRNA-Seq Clustering
    Du, Lin
    Han, Yehong
    IEEE ACCESS, 2024, 12 : 165371 - 165383
  • [50] Entropy subspace separation-based clustering for noise reduction (ENCORE) of scRNA-seq data
    Song, Jia
    Liu, Yao
    Zhang, Xuebing
    Wu, Qiuyue
    Gao, Juan
    Wang, Wei
    Li, Jin
    Song, Yanling
    Yang, Chaoyong
    NUCLEIC ACIDS RESEARCH, 2021, 49 (03) : E18