ARTIFICIAL BANDWIDTH EXTENSION USING CONDITIONAL VARIATIONAL AUTO-ENCODERS AND ADVERSARIAL LEARNING

被引:0
|
作者
Bachhav, Pramod [1 ]
Todisco, Massimiliano [1 ]
Evans, Nicholas [1 ]
机构
[1] EURECOM, Sophia Antipolis, France
关键词
variational auto-encoder; generative adversarial network; latent variable; artificial bandwidth extension; speech quality; NETWORK;
D O I
10.1109/icassp40776.2020.9053737
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Artificial bandwidth extension (ABE) algorithms have been developed to estimate missing highband frequency components (4-8kHz) to improve quality of narrowband (0-4kHz) telephone calls. Most ABE solutions employ deep neural networks (DNNs) due to their well-known ability to model highly complex, non-linear relationship between narrowband and highband features. Generative models such as conditional variational auto-encoders (CVAEs) are capable of modelling complex data distributions via latent representation learning. This paper reports their application to ABE. CVAEs, form of directed, graphical models, are exploited to model the probability distribution of highband features conditioned on narrowband features. While CVAEs are trained with the standard mean square criterion (MSE), their combination with adversarial learning give further improvements. When compared to results obtained with the baseline approach, the wideband PESQ is improved significantly by 0.21 points. The performance is also compared on an automatic speech recognition (ASR) task on the TIMIT dataset where word error rate (WER) is decreased by an absolute value of 0.3%.
引用
收藏
页码:6924 / 6928
页数:5
相关论文
共 50 条
  • [31] Discriminative regularization of the latent manifold of variational auto-encoders
    Kossyk, Ingo
    Marton, Zoltan-Csaba
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 61 : 121 - 129
  • [32] Unsupervised Blind Source Separation with Variational Auto-Encoders
    Neri, Julian
    Badeau, Roland
    Depalle, Philippe
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 311 - 315
  • [33] Semi-Implicit Graph Variational Auto-Encoders
    Hasanzadeh, Arman
    Hajiramezanali, Ehsan
    Duffield, Nick
    Narayanan, Krishna
    Zhou, Mingyuan
    Qian, Xiaoning
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [34] Deep variational auto-encoders for unsupervised glomerular classification
    Lutnick, Brendon
    Yacoub, Rabi
    Jen, Kuang-Yu
    Tomaszewski, John E.
    Jain, Sanjay
    Sarder, Pinaki
    MEDICAL IMAGING 2018: DIGITAL PATHOLOGY, 2018, 10581
  • [35] Return of the normal distribution: Flexible deep continual learning with variational auto-encoders
    Hong Y.
    Mundt M.
    Park S.
    Uh Y.
    Byun H.
    Neural Networks, 2022, 154 : 397 - 412
  • [36] Robust and Unsupervised KPI Anomaly Detection Based on Highly Sensitive Conditional Variational Auto-Encoders
    Yan, Shili
    Tang, Bing
    Yang, Qing
    He, Yijia
    Zhang, Xiaoyuan
    2022 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING, ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM, 2022, : 597 - 604
  • [37] Continuous Hierarchical Representations with Poincare Variational Auto-Encoders
    Mathieu, Emile
    Le Lan, Charline
    Maddison, Chris J.
    Tomioka, Ryota
    Teh, Yee Whye
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [38] A comprehensive investigation of variational auto-encoders for population synthesis
    Sane, Abdoul Razac
    Vandanjon, Pierre-Olivier
    Belaroussi, Rachid
    Hankach, Pierre
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2025, 8 (01):
  • [39] ARTIFICIAL BANDWIDTH EXTENSION USING A CONDITIONAL GENERATIVE ADVERSARIAL NETWORK WITH DISCRIMINATIVE TRAINING
    Sautter, Jonas
    Faubel, Friedrich
    Buck, Markus
    Schmidt, Gerhard
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7005 - 7009
  • [40] Learning to Design Constellation for AWGN Channel Using Auto-Encoders
    Huang, Qisheng
    Jiang, Ming
    Zhao, Chunming
    PROCEEDINGS OF THE 2019 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2019), 2019, : 154 - 159