AGImpute: imputation of scRNA-seq data based on a hybrid GAN with dropouts identification

被引:0
|
作者
Zhu, Xiaoshu [1 ]
Meng, Shuang [2 ]
Li, Gaoshi [2 ]
Wang, Jianxin [3 ]
Peng, Xiaoqing [4 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp & Informat Secur, Guangxi Key Lab Trusted Software, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Sch Comp Sci & Engn, Guilin 541006, Peoples R China
[3] Cent South Univ, Sch Comp Sci & Engn, Hunan Prov Key Lab Bioinformat, Changsha 400083, Peoples R China
[4] Cent South Univ, Ctr Med Genet, Sch Life Sci, Changsha 400083, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1093/bioinformatics/btae068
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Dropout events bring challenges in analyzing single-cell RNA sequencing data as they introduce noise and distort the true distributions of gene expression profiles. Recent studies focus on estimating dropout probability and imputing dropout events by leveraging information from similar cells or genes. However, the number of dropout events differs in different cells, due to the complex factors, such as different sequencing protocols, cell types, and batch effects. The dropout event differences are not fully considered in assessing the similarities between cells and genes, which compromises the reliability of downstream analysis.Results This work proposes a hybrid Generative Adversarial Network with dropouts identification to impute single-cell RNA sequencing data, named AGImpute. First, the numbers of dropout events in different cells in scRNA-seq data are differentially estimated by using a dynamic threshold estimation strategy. Next, the identified dropout events are imputed by a hybrid deep learning model, combining Autoencoder with a Generative Adversarial Network. To validate the efficiency of the AGImpute, it is compared with seven state-of-the-art dropout imputation methods on two simulated datasets and seven real single-cell RNA sequencing datasets. The results show that AGImpute imputes the least number of dropout events than other methods. Moreover, AGImpute enhances the performance of downstream analysis, including clustering performance, identifying cell-specific marker genes, and inferring trajectory in the time-course dataset.Availability and implementation The source code can be obtained from https://github.com/xszhu-lab/AGImpute.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] scCDG: A Method Based on DAE and GCN for scRNA-Seq Data Analysis
    Wang, Hai-Yun
    Zhao, Jian-Ping
    Su, Yan-Sen
    Zheng, Chun-Hou
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3685 - 3694
  • [22] A statistical approach for systematic identification of transition cells from scRNA-seq data
    Wang, Yuanxin
    Dede, Merve
    Mohanty, Vakul
    Dou, Jinzhuang
    Li, Ziyi
    Chen, Ken
    CELL REPORTS METHODS, 2024, 4 (12):
  • [23] scCNC: a method based on capsule network for clustering scRNA-seq data
    Wang, Hai-Yun
    Zhao, Jian-Ping
    Zheng, Chun-Hou
    Su, Yan-Sen
    BIOINFORMATICS, 2022, 38 (15) : 3703 - 3709
  • [24] An efficient scRNA-seq dropout imputation method using graph attention network
    Chenyang Xu
    Lei Cai
    Jingyang Gao
    BMC Bioinformatics, 22
  • [25] Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation
    Ayoub Lasri
    Vahid Shahrezaei
    Marc Sturrock
    BMC Bioinformatics, 23
  • [26] Domain adaptation for supervised integration of scRNA-seq data
    Sun, Yutong
    Qiu, Peng
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [27] Recursive Clustering of Cellular Diversity in scRNA-Seq Data
    Squires, Michael
    Qiu, Peng
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2025,
  • [28] Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation
    Lasri, Ayoub
    Shahrezaei, Vahid
    Sturrock, Marc
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [29] An efficient scRNA-seq dropout imputation method using graph attention network
    Xu, Chenyang
    Cai, Lei
    Gao, Jingyang
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [30] Comparison of scRNA-seq data analysis method combinations
    Xu, Li
    Xue, Tong
    Ding, Weiyue
    Shen, Linshan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2022, 21 (06) : 433 - 440