scAAGA: Single cell data analysis framework using asymmetric autoencoder with gene attention

被引:64
|
作者
Meng, Rui [1 ]
Yin, Shuaidong [1 ]
Sun, Jianqiang [2 ]
Hu, Huan [3 ]
Zhao, Qi [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Comp Sci & Software Engn, Anshan 114051, Peoples R China
[2] Linyi Univ, Sch Informat Sci & Engn, Linyi 276000, Peoples R China
[3] Fuzhou Univ, Inst Appl Genom, Fuzhou 350108, Peoples R China
基金
中国国家自然科学基金;
关键词
scRNA-seq; Deep learning; Gene attention; Data augmentation; COVID-19; RNA;
D O I
10.1016/j.compbiomed.2023.107414
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, single-cell RNA sequencing (scRNA-seq) has emerged as a powerful technique for investigating cellular heterogeneity and structure. However, analyzing scRNA-seq data remains challenging, especially in the context of COVID-19 research. Single-cell clustering is a key step in analyzing scRNA-seq data, and deep learning methods have shown great potential in this area. In this work, we propose a novel scRNA-seq analysis framework called scAAGA. Specifically, we utilize an asymmetric autoencoder with a gene attention module to learn important gene features adaptively from scRNA-seq data, with the aim of improving the clustering effect. We apply scAAGA to COVID19 peripheral blood mononuclear cell (PBMC) scRNA-seq data and compare its performance with state-of-the-art methods. Our results consistently demonstrate that scAAGA outperforms existing methods in terms of adjusted rand index (ARI), normalized mutual information (NMI), and adjusted mutual information (AMI) scores, achieving improvements ranging from 2.8% to 27.8% in NMI scores. Additionally, we discuss a data augmentation technology to expand the datasets and improve the accuracy of scAAGA. Overall, scAAGA presents a robust tool for scRNA-seq data analysis, enhancing the accuracy and reliability of clustering results in COVID-19 research.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] scCAN: single-cell clustering using autoencoder and network fusion
    Bang Tran
    Duc Tran
    Hung Nguyen
    Seungil Ro
    Tin Nguyen
    Scientific Reports, 12
  • [42] Comparison Analysis of Data Augmentation using Bootstrap, GANs and Autoencoder
    Nakhwan, Mukrin
    Duangsoithong, Rakkrit
    2022-14TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SMART TECHNOLOGY (KST 2022), 2022, : 18 - 23
  • [43] A surrogate modeling framework for aircraft assembly deformation using triplet attention-enhanced conditional autoencoder
    Zhang, Yifan
    Zhang, Qiang
    Hu, Ye
    Wang, Qing
    Cheng, Liang
    Ke, Yinglin
    JOURNAL OF MANUFACTURING SYSTEMS, 2024, 77 : 708 - 729
  • [44] BERMAD: batch effect removal for single-cell RNA-seq data using a multi-layer adaptation autoencoder with dual-channel framework
    Zhan, Xiangxin
    Yin, Yanbin
    Zhang, Han
    BIOINFORMATICS, 2024, 40 (03)
  • [45] Gene knockout inference with variational graph autoencoder learning single-cell gene regulatory networks
    Yang, Yongjian
    Li, Guanxun
    Zhong, Yan
    Xu, Qian
    Chen, Bo-Jia
    Lin, Yu-Te
    Chapkin, Robert S.
    Cai, James J.
    NUCLEIC ACIDS RESEARCH, 2023, 51 (13) : 6578 - 6592
  • [46] Interactive single-cell data analysis using Cellar
    Hasanaj, Euxhen
    Wang, Jingtao
    Sarathi, Arjun
    Ding, Jun
    Bar-Joseph, Ziv
    NATURE COMMUNICATIONS, 2022, 13 (01)
  • [47] Interactive single-cell data analysis using Cellar
    Euxhen Hasanaj
    Jingtao Wang
    Arjun Sarathi
    Jun Ding
    Ziv Bar-Joseph
    Nature Communications, 13
  • [48] A framework for significance analysis of gene expression data using dimension reduction methods
    Lars Gidskehaug
    Endre Anderssen
    Arnar Flatberg
    Bjørn K Alsberg
    BMC Bioinformatics, 8
  • [49] A framework for significance analysis of gene expression data using dimension reduction methods
    Gidskehaug, Lars
    Anderssen, Endre
    Flatberg, Arnar
    Alsberg, Bjorn K.
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [50] Of Gene Expression and Cell Division Time: A Mathematical Framework for Advanced Differential Gene Expression and Data Analysis
    Baum, Katharina
    Schuchhardt, Johannes
    Wolf, Jana
    Busse, Dorothea
    CELL SYSTEMS, 2019, 9 (06) : 569 - +