scAAGA: Single cell data analysis framework using asymmetric autoencoder with gene attention

被引:64
|
作者
Meng, Rui [1 ]
Yin, Shuaidong [1 ]
Sun, Jianqiang [2 ]
Hu, Huan [3 ]
Zhao, Qi [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Comp Sci & Software Engn, Anshan 114051, Peoples R China
[2] Linyi Univ, Sch Informat Sci & Engn, Linyi 276000, Peoples R China
[3] Fuzhou Univ, Inst Appl Genom, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
scRNA-seq; Deep learning; Gene attention; Data augmentation; COVID-19; RNA;
D O I
10.1016/j.compbiomed.2023.107414
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, single-cell RNA sequencing (scRNA-seq) has emerged as a powerful technique for investigating cellular heterogeneity and structure. However, analyzing scRNA-seq data remains challenging, especially in the context of COVID-19 research. Single-cell clustering is a key step in analyzing scRNA-seq data, and deep learning methods have shown great potential in this area. In this work, we propose a novel scRNA-seq analysis framework called scAAGA. Specifically, we utilize an asymmetric autoencoder with a gene attention module to learn important gene features adaptively from scRNA-seq data, with the aim of improving the clustering effect. We apply scAAGA to COVID19 peripheral blood mononuclear cell (PBMC) scRNA-seq data and compare its performance with state-of-the-art methods. Our results consistently demonstrate that scAAGA outperforms existing methods in terms of adjusted rand index (ARI), normalized mutual information (NMI), and adjusted mutual information (AMI) scores, achieving improvements ranging from 2.8% to 27.8% in NMI scores. Additionally, we discuss a data augmentation technology to expand the datasets and improve the accuracy of scAAGA. Overall, scAAGA presents a robust tool for scRNA-seq data analysis, enhancing the accuracy and reliability of clustering results in COVID-19 research.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Autoencoder-based cluster ensembles for single-cell RNA-seq data analysis
    Thomas A. Geddes
    Taiyun Kim
    Lihao Nan
    James G. Burchfield
    Jean Y. H. Yang
    Dacheng Tao
    Pengyi Yang
    BMC Bioinformatics, 20
  • [22] Autoencoder-based cluster ensembles for single-cell RNA-seq data analysis
    Geddes, Thomas A.
    Kim, Taiyun
    Nan, Lihao
    Burchfield, James G.
    Yang, Jean Y. H.
    Tao, Dacheng
    Yang, Pengyi
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [23] Algorithm for Clustering Analysis of Gene Expression Data using MapReduce Framework
    Priya, P. Packia Amutha
    Lawrance, R.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [24] scCompressSA: dual-channel self-attention based deep autoencoder model for single-cell clustering by compressing gene-gene interactions
    Zhang, Wei
    Yu, Ruochen
    Xu, Zeqi
    Li, Junnan
    Gao, Wenhao
    Jiang, Mingfeng
    Dai, Qi
    BMC GENOMICS, 2024, 25 (01)
  • [25] ScMOGAE: A Graph Convolutional Autoencoder-Based Multi-omics Data Integration Framework for Single-Cell Clustering
    Zhou, Benjie
    Jiang, Hongyang
    Wang, Yuezhu
    Gu, Yujie
    Sun, Huiyan
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT I, ISBRA 2024, 2024, 14954 : 322 - 334
  • [26] An improved hierarchical variational autoencoder for cell-cell communication estimation using single-cell RNA-seq data
    Liu, Shuhui
    Zhang, Yupei
    Peng, Jiajie
    Shang, Xuequn
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (02) : 118 - 127
  • [27] Selecting gene features for unsupervised analysis of single-cell gene expression data
    Sheng, Jie
    Li, Wei Vivian
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [28] Analysis of MicroRNA Regulation and Gene Expression Variability in Single Cell Data
    Liu, Wendao
    Shomron, Noam
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (10):
  • [29] Differential variability analysis of single-cell gene expression data
    Liu, Jiayi
    Kreimer, Anat
    Li, Wei Vivian
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [30] Recovering Gene Interactions from Single-Cell Data Using Data Diffusion
    van Dijk, David
    Sharma, Roshan
    Nainys, Juozas
    Yim, Kristina
    Kathail, Pooja
    Carr, Ambrose J.
    Burdziak, Cassandra
    Moon, Kevin R.
    Chaffer, Christine L.
    Pattabiraman, Diwakar
    Bierie, Brian
    Mazutis, Linas
    Wolf, Guy
    Krishnaswamy, Smita
    Pe'er, Dana
    CELL, 2018, 174 (03) : 716 - +