AGImpute: imputation of scRNA-seq data based on a hybrid GAN with dropouts identification

被引:0
|
作者
Zhu, Xiaoshu [1 ]
Meng, Shuang [2 ]
Li, Gaoshi [2 ]
Wang, Jianxin [3 ]
Peng, Xiaoqing [4 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp & Informat Secur, Guangxi Key Lab Trusted Software, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541006, Peoples R China
[3] Cent South Univ, Sch Comp Sci & Engn, Hunan Prov Key Lab Bioinformat, Changsha 400083, Peoples R China
[4] Cent South Univ, Ctr Med Genet, Sch Life Sci, Changsha 400083, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1093/bioinformatics/btae068
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Dropout events bring challenges in analyzing single-cell RNA sequencing data as they introduce noise and distort the true distributions of gene expression profiles. Recent studies focus on estimating dropout probability and imputing dropout events by leveraging information from similar cells or genes. However, the number of dropout events differs in different cells, due to the complex factors, such as different sequencing protocols, cell types, and batch effects. The dropout event differences are not fully considered in assessing the similarities between cells and genes, which compromises the reliability of downstream analysis.Results This work proposes a hybrid Generative Adversarial Network with dropouts identification to impute single-cell RNA sequencing data, named AGImpute. First, the numbers of dropout events in different cells in scRNA-seq data are differentially estimated by using a dynamic threshold estimation strategy. Next, the identified dropout events are imputed by a hybrid deep learning model, combining Autoencoder with a Generative Adversarial Network. To validate the efficiency of the AGImpute, it is compared with seven state-of-the-art dropout imputation methods on two simulated datasets and seven real single-cell RNA sequencing datasets. The results show that AGImpute imputes the least number of dropout events than other methods. Moreover, AGImpute enhances the performance of downstream analysis, including clustering performance, identifying cell-specific marker genes, and inferring trajectory in the time-course dataset.Availability and implementation The source code can be obtained from https://github.com/xszhu-lab/AGImpute.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Predicting lung aging using scRNA-Seq data
    Song, Qi
    Singh, Alex
    Mcdonough, John E.
    Adams, Taylor S.
    Vos, Robin
    De Man, Ruben
    Myers, Greg
    Ceulemans, Laurens J.
    Vanaudenaerde, Bart M.
    Wuyts, Wim A.
    Yan, Xiting
    Schuppe, Jonas
    Hagood, James S.
    Kaminski, Naftali
    Bar-Joseph, Ziv
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (12)
  • [32] Domain adaptation for supervised integration of scRNA-seq data
    Yutong Sun
    Peng Qiu
    Communications Biology, 6
  • [33] Visualizing scRNA-Seq data at population scale with GloScope
    Wang, Hao
    Torous, William
    Gong, Boying
    Purdom, Elizabeth
    GENOME BIOLOGY, 2024, 25 (01):
  • [34] scAlign: a tool for alignment, integration, and rare cell identification from scRNA-seq data
    Nelson Johansen
    Gerald Quon
    Genome Biology, 20
  • [35] scAlign: a tool for alignment, integration, and rare cell identification from scRNA-seq data
    Johansen, Nelson
    Quon, Gerald
    GENOME BIOLOGY, 2019, 20 (01):
  • [36] A clustering method for small scRNA-seq data based on subspace and weighted distance
    Ning, Zilan
    Dai, Zhijun
    Zhang, Hongyan
    Chen, Yuan
    Yuan, Zheming
    PEERJ, 2023, 11 : 28 - 28
  • [37] SCDRHA: A scRNA-Seq Data Dimensionality Reduction Algorithm Based on Hierarchical Autoencoder
    Zhao, Jianping
    Wang, Na
    Wang, Haiyun
    Zheng, Chunhou
    Su, Yansen
    FRONTIERS IN GENETICS, 2021, 12
  • [38] COTAN: scRNA-seq data analysis based on gene co-expression
    Galfre, Silvia Giulia
    Morandin, Francesco
    Pietrosanto, Marco
    Cremisi, Federico
    Helmer-Citterich, Manuela
    NAR GENOMICS AND BIOINFORMATICS, 2021, 3 (03)
  • [39] Deep enhanced constraint clustering based on contrastive learning for scRNA-seq data
    Gan, Yanglan
    Chen, Yuhan
    Xu, Guangwei
    Guo, Wenjing
    Zou, Guobing
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [40] Integration tools for scRNA-seq data and spatial transcriptomics sequencing data
    Yan, Chaorui
    Zhu, Yanxu
    Chen, Miao
    Yang, Kainan
    Cui, Feifei
    Zou, Quan
    Zhang, Zilong
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (04) : 295 - 302