A novel f-divergence based generative adversarial imputation method for scRNA-seq data analysis

被引:2
|
作者
Si, Tong [1 ]
Hopkins, Zackary [2 ]
Yanev, John [2 ]
Hou, Jie [2 ]
Gong, Haijun [1 ]
机构
[1] St Louis Univ, Dept Math & Stat, St Louis, MO 63103 USA
[2] St Louis Univ, Dept Comp Sci, St Louis, MO USA
来源
PLOS ONE | 2023年 / 18卷 / 11期
基金
美国国家卫生研究院;
关键词
SINGLE; GENES;
D O I
10.1371/journal.pone.0292792
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Comprehensive analysis of single-cell RNA sequencing (scRNA-seq) data can enhance our understanding of cellular diversity and aid in the development of personalized therapies for individuals. The abundance of missing values, known as dropouts, makes the analysis of scRNA-seq data a challenging task. Most traditional methods made assumptions about specific distributions for missing values, which limit their capability to capture the intricacy of high-dimensional scRNA-seq data. Moreover, the imputation performance of traditional methods decreases with higher missing rates. We propose a novel f-divergence based generative adversarial imputation method, called sc-fGAIN, for the scRNA-seq data imputation. Our studies identify four f-divergence functions, namely cross-entropy, Kullback-Leibler (KL), reverse KL, and Jensen-Shannon, that can be effectively integrated with the generative adversarial imputation network to generate imputed values without any assumptions, and mathematically prove that the distribution of imputed data using sc-fGAIN algorithm is same as the distribution of original data. Real scRNA-seq data analysis has shown that, compared to many traditional methods, the imputed values generated by sc-fGAIN algorithm have a smaller root-mean-square error, and it is robust to varying missing rates, moreover, it can reduce imputation variability. The flexibility offered by the f-divergence allows the sc-fGAIN method to accommodate various types of data, making it a more universal approach for imputing missing values of scRNA-seq data.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Imputation in Scrna-seq Data Using Supervised Deep Generative Networks
    Tang, Jianxiong
    Zou, Jianxiao
    Fan, Mei
    Tian, Qi
    Fan, Shicai
    2021 8TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS RESEARCH AND APPLICATIONS, ICBRA 2021, 2021, : 1 - 7
  • [2] FRMC: a fast and robust method for the imputation of scRNA-seq data
    Wu, Honglong
    Wang, Xuebin
    Chu, Mengtian
    Xiang, Ruizhi
    Zhou, Ke
    RNA BIOLOGY, 2021, 18 : 172 - 181
  • [3] scGCL: an imputation method for scRNA-seq data based on graph contrastive learning
    Xiong, Zehao
    Luo, Jiawei
    Shi, Wanwan
    Liu, Ying
    Xu, Zhongyuan
    Wang, Bo
    BIOINFORMATICS, 2023, 39 (03)
  • [4] A novel batch-effect correction method for scRNA-seq data based on Adversarial Information Factorization
    Monnier, Lily
    Cournede, Paul-Henry
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (02)
  • [5] Comparison of scRNA-seq data analysis method combinations
    Xu, Li
    Xue, Tong
    Ding, Weiyue
    Shen, Linshan
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2022, 21 (06) : 433 - 440
  • [6] scCDG: A Method Based on DAE and GCN for scRNA-Seq Data Analysis
    Wang, Hai-Yun
    Zhao, Jian-Ping
    Su, Yan-Sen
    Zheng, Chun-Hou
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3685 - 3694
  • [7] Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation
    Ayoub Lasri
    Vahid Shahrezaei
    Marc Sturrock
    BMC Bioinformatics, 23
  • [8] Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation
    Lasri, Ayoub
    Shahrezaei, Vahid
    Sturrock, Marc
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [9] SCC: an accurate imputation method for scRNA-seq dropouts based on a mixture model
    Zheng, Yan
    Zhong, Yuanke
    Hu, Jialu
    Shang, Xuequn
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [10] AGImpute: imputation of scRNA-seq data based on a hybrid GAN with dropouts identification
    Zhu, Xiaoshu
    Meng, Shuang
    Li, Gaoshi
    Wang, Jianxin
    Peng, Xiaoqing
    BIOINFORMATICS, 2024, 40 (02)