ARTIFICIAL BANDWIDTH EXTENSION USING CONDITIONAL VARIATIONAL AUTO-ENCODERS AND ADVERSARIAL LEARNING

被引:0
|
作者
Bachhav, Pramod [1 ]
Todisco, Massimiliano [1 ]
Evans, Nicholas [1 ]
机构
[1] EURECOM, Sophia Antipolis, France
关键词
variational auto-encoder; generative adversarial network; latent variable; artificial bandwidth extension; speech quality; NETWORK;
D O I
10.1109/icassp40776.2020.9053737
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Artificial bandwidth extension (ABE) algorithms have been developed to estimate missing highband frequency components (4-8kHz) to improve quality of narrowband (0-4kHz) telephone calls. Most ABE solutions employ deep neural networks (DNNs) due to their well-known ability to model highly complex, non-linear relationship between narrowband and highband features. Generative models such as conditional variational auto-encoders (CVAEs) are capable of modelling complex data distributions via latent representation learning. This paper reports their application to ABE. CVAEs, form of directed, graphical models, are exploited to model the probability distribution of highband features conditioned on narrowband features. While CVAEs are trained with the standard mean square criterion (MSE), their combination with adversarial learning give further improvements. When compared to results obtained with the baseline approach, the wideband PESQ is improved significantly by 0.21 points. The performance is also compared on an automatic speech recognition (ASR) task on the TIMIT dataset where word error rate (WER) is decreased by an absolute value of 0.3%.
引用
收藏
页码:6924 / 6928
页数:5
相关论文
共 50 条
  • [41] 3D reconstruction of digital cores based on a model using generative adversarial networks and variational auto-encoders
    Zhang, Ting
    Xia, Pengfei
    Lu, Fangfang
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2021, 207
  • [42] An effective deep learning model for grading abnormalities in retinal fundus images using variational auto-encoders
    Sundar, Sumod
    Sumathy, Subramanian
    INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2023, 33 (01) : 92 - 107
  • [43] Reconstruction probability-based anomaly detection using variational auto-encoders
    Iqbal T.
    Qureshi S.
    International Journal of Computers and Applications, 2023, 45 (03) : 231 - 237
  • [44] A hybrid learning model based on auto-encoders
    Zhou, Ju
    Ju, Li
    Zhang, Xiaolong
    PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 522 - 528
  • [45] On the Quality of Deep Representations for Kepler Light Curves Using Variational Auto-Encoders
    Mena, Francisco
    Olivares, Patricio
    Bugueno, Margarita
    Molina, Gabriel
    Araya, Mauricio
    SIGNALS, 2021, 2 (04): : 706 - 728
  • [46] Preliminary Unknown Appliance Detection using Convolutional Variational Auto-Encoders for AAL
    de Diego-Oton, Laura
    Fuentes, David
    Pizarro, Daniel
    Hernandez, Alvaro
    Mari, Simone
    Nieto, Ruben
    2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 289 - 292
  • [47] Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-encoders
    Qi, Jinzi
    Van Hamme, Hugo
    INTERSPEECH 2021, 2021, : 1917 - 1921
  • [48] Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond
    Fei, Zhengcong
    Fan, Mingyuan
    Zhu, Li
    Huang, Junshi
    Wei, Xiaoming
    Wei, Xiaolin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24449 - 24459
  • [49] Interpretable and effective hashing via Bernoulli variational auto-encoders
    Mena, Francisco
    Nanculef, Ricardo
    Valle, Carlos
    INTELLIGENT DATA ANALYSIS, 2020, 24 (24) : S141 - S166
  • [50] Graph Auto-Encoders for Learning Edge Representations
    Rennard, Virgile
    Nikolentzos, Giannis
    Vazirgiannis, Michalis
    COMPLEX NETWORKS & THEIR APPLICATIONS IX, VOL 2, COMPLEX NETWORKS 2020, 2021, 944 : 117 - 129