scLEGA: an attention-based deep clustering method with a tendency for low expression of genes on single-cell RNA-seq data

被引:4
|
作者
Liu, Zhenze [1 ]
Liang, Yingjian [2 ]
Wang, Guohua [3 ,4 ]
Zhang, Tianjiao [3 ]
机构
[1] Northeast Forestry Univ, Aulin Coll, 26 Hexing Rd, Harbin 150040, Peoples R China
[2] Harbin Med Univ, Affiliated Hosp 1, Key Lab Hepatosplen Surg, 23 Postal St, Harbin 150001, Peoples R China
[3] Northeast Forestry Univ, Coll Comp & Control Engn, 26 Hexing Rd, Harbin 150040, Peoples R China
[4] Harbin Inst Technol, Fac Comp, 92 West Dazhi St, Harbin 150006, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
scRNA-seq; multi-head attention mechanism; DAE; GAE; AUTOENCODER;
D O I
10.1093/bib/bbae371
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA sequencing (scRNA-seq) enables the exploration of biological heterogeneity among different cell types within tissues at a resolution. Inferring cell types within tissues is foundational for downstream research. Most existing methods for cell type inference based on scRNA-seq data primarily utilize highly variable genes (HVGs) with higher expression levels as clustering features, overlooking the contribution of HVGs with lower expression levels. To address this, we have designed a novel cell type inference method for scRNA-seq data, termed scLEGA. scLEGA employs a novel zero-inflated negative binomial (ZINB) loss function that fully considers the contribution of genes with lower expression levels and combines two distinct scRNA-seq clustering strategies through a multi-head attention mechanism. It utilizes a low-expression optimized denoising autoencoder, based on the novel ZINB model, to extract low-dimensional features and handle dropout events, and a GCN-based graph autoencoder (GAE) that leverages neighbor information to guide dimensionality reduction. The iterative fusion of denoising and topological embedding in scLEGA facilitates the acquisition of cluster-friendly cell representations in the hidden embedding, where similar cells are brought closer together. Compared to 12 state-of-the-art cell type inference methods on 15 scRNA-seq datasets, scLEGA demonstrates superior performance in clustering accuracy, scalability, and stability. Our scLEGA model codes are freely available at https://github.com/Masonze/scLEGA-main.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] PhytoCluster: a generative deep learning model for clustering plant single-cell RNA-seq data
    Wang, Hao
    Fu, Xiangzheng
    Liu, Lijia
    Wang, Yi
    Hong, Jingpeng
    Pan, Bintao
    Cao, Yaning
    Chen, Yanqing
    Cao, Yongsheng
    Ma, Xiaoding
    Fang, Wei
    Yan, Shen
    ABIOTECH, 2025,
  • [22] SAFE-clustering: Single-cell Aggregated (from Ensemble) clustering for single-cell RNA-seq data
    Yang, Yuchen
    Huh, Ruth
    Culpepper, Houston W.
    Lin, Yuan
    Love, Michael I.
    Li, Yun
    BIOINFORMATICS, 2019, 35 (08) : 1269 - 1277
  • [23] FEATS: feature selection-based clustering of single-cell RNA-seq data
    Vans, Edwin
    Patil, Ashwini
    Sharma, Alok
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [24] Impact of data preprocessing on cell-type clustering based on single-cell RNA-seq data
    Chunxiang Wang
    Xin Gao
    Juntao Liu
    BMC Bioinformatics, 21
  • [25] Impact of data preprocessing on cell-type clustering based on single-cell RNA-seq data
    Wang, Chunxiang
    Gao, Xin
    Liu, Juntao
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [26] scDCCA: deep contrastive clustering for single-cell RNA-seq data based on auto-encoder network
    Wang, Jing
    Xia, Junfeng
    Wang, Haiyun
    Su, Yansen
    Zheng, Chun-Hou
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [27] scDFN: enhancing single-cell RNA-seq clustering with deep fusion networks
    Liu, Tianxiang
    Jia, Cangzhi
    Bi, Yue
    Guo, Xudong
    Zou, Quan
    Li, Fuyi
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [28] Deep Batch Integration and Denoise of Single-Cell RNA-Seq Data
    Qin, Lu
    Zhang, Guangya
    Zhang, Shaoqiang
    Chen, Yong
    ADVANCED SCIENCE, 2024, 11 (29)
  • [29] scMEB: a fast and clustering-independent method for detecting differentially expressed genes in single-cell RNA-seq data
    Jiadi Zhu
    Youlong Yang
    BMC Genomics, 24
  • [30] scMEB: a fast and clustering-independent method for detecting differentially expressed genes in single-cell RNA-seq data
    Zhu, Jiadi
    Yang, Youlong
    BMC GENOMICS, 2023, 24 (01)