scLEGA: an attention-based deep clustering method with a tendency for low expression of genes on single-cell RNA-seq data

被引:4
|
作者
Liu, Zhenze [1 ]
Liang, Yingjian [2 ]
Wang, Guohua [3 ,4 ]
Zhang, Tianjiao [3 ]
机构
[1] Northeast Forestry Univ, Aulin Coll, 26 Hexing Rd, Harbin 150040, Peoples R China
[2] Harbin Med Univ, Affiliated Hosp 1, Key Lab Hepatosplen Surg, 23 Postal St, Harbin 150001, Peoples R China
[3] Northeast Forestry Univ, Coll Comp & Control Engn, 26 Hexing Rd, Harbin 150040, Peoples R China
[4] Harbin Inst Technol, Fac Comp, 92 West Dazhi St, Harbin 150006, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
scRNA-seq; multi-head attention mechanism; DAE; GAE; AUTOENCODER;
D O I
10.1093/bib/bbae371
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA sequencing (scRNA-seq) enables the exploration of biological heterogeneity among different cell types within tissues at a resolution. Inferring cell types within tissues is foundational for downstream research. Most existing methods for cell type inference based on scRNA-seq data primarily utilize highly variable genes (HVGs) with higher expression levels as clustering features, overlooking the contribution of HVGs with lower expression levels. To address this, we have designed a novel cell type inference method for scRNA-seq data, termed scLEGA. scLEGA employs a novel zero-inflated negative binomial (ZINB) loss function that fully considers the contribution of genes with lower expression levels and combines two distinct scRNA-seq clustering strategies through a multi-head attention mechanism. It utilizes a low-expression optimized denoising autoencoder, based on the novel ZINB model, to extract low-dimensional features and handle dropout events, and a GCN-based graph autoencoder (GAE) that leverages neighbor information to guide dimensionality reduction. The iterative fusion of denoising and topological embedding in scLEGA facilitates the acquisition of cluster-friendly cell representations in the hidden embedding, where similar cells are brought closer together. Compared to 12 state-of-the-art cell type inference methods on 15 scRNA-seq datasets, scLEGA demonstrates superior performance in clustering accuracy, scalability, and stability. Our scLEGA model codes are freely available at https://github.com/Masonze/scLEGA-main.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] scDFC: A deep fusion clustering method for single-cell RNA-seq data
    Hu, Dayu
    Liang, Ke
    Zhou, Sihang
    Tu, Wenxuan
    Liu, Meng
    Liu, Xinwang
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [2] Deep Learning for Clustering Single-cell RNA-seq Data
    Zhu, Yuan
    Bai, Litai
    Ning, Zilin
    Fu, Wenfei
    Liu, Jie
    Jiang, Linfeng
    Fei, Shihuang
    Gong, Shiyun
    Lu, Lulu
    Deng, Minghua
    Yi, Ming
    CURRENT BIOINFORMATICS, 2024, 19 (03) : 193 - 210
  • [3] Single-cell RNA-seq data clustering by deep information fusion
    Ren, Liangrui
    Wang, Jun
    Li, Wei
    Guo, Maozu
    Yu, Guoxian
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (02) : 128 - 137
  • [4] A deep matrix factorization based approach for single-cell RNA-seq data clustering
    Liang, Zhenlan
    Zheng, Ruiqing
    Chen, Siqi
    Yan, Xuhua
    Li, Min
    METHODS, 2022, 205 : 114 - 122
  • [5] A parameter-free deep embedded clustering method for single-cell RNA-seq data
    Zeng, Yuansong
    Wei, Zhuoyi
    Zhong, Fengqi
    Pan, Zixiang
    Lu, Yutong
    Yang, Yuedong
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [6] Multiobjective Deep Clustering and Its Applications in Single-cell RNA-seq Data
    Wang, Yunhe
    Bian, Chuang
    Wong, Ka-Chun
    Li, Xiangtao
    Yang, Shengxiang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (08): : 5016 - 5027
  • [7] CASCC: a co-expression-assisted single-cell RNA-seq data clustering method
    Cai, Lingyi
    Anastassiou, Dimitris
    BIOINFORMATICS, 2024, 40 (05)
  • [8] Clustering single-cell RNA-seq data with a model-based deep learning approach
    Tian, Tian
    Wan, Ji
    Song, Qi
    Wei, Zhi
    NATURE MACHINE INTELLIGENCE, 2019, 1 (04) : 191 - 198
  • [9] Phenotype prediction from single-cell RNA-seq data using attention-based neural networks
    Mao, Yuzhen
    Lin, Yen-Yi
    Wong, Nelson K. Y.
    Volik, Stanislav
    Sar, Funda
    Collins, Colin
    Ester, Martin
    BIOINFORMATICS, 2024, 40 (02)
  • [10] Clustering single-cell RNA-seq data with a model-based deep learning approach
    Tian Tian
    Ji Wan
    Qi Song
    Zhi Wei
    Nature Machine Intelligence, 2019, 1 : 191 - 198