ARTIFICIAL BANDWIDTH EXTENSION USING CONDITIONAL VARIATIONAL AUTO-ENCODERS AND ADVERSARIAL LEARNING

被引：0

作者：

Bachhav, Pramod ^{[1
]}

Todisco, Massimiliano ^{[1
]}

Evans, Nicholas ^{[1
]}

机构：

[1] EURECOM, Sophia Antipolis, France

来源：

2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年

关键词：

variational auto-encoder; generative adversarial network; latent variable; artificial bandwidth extension; speech quality; NETWORK;

D O I：

10.1109/icassp40776.2020.9053737

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Artificial bandwidth extension (ABE) algorithms have been developed to estimate missing highband frequency components (4-8kHz) to improve quality of narrowband (0-4kHz) telephone calls. Most ABE solutions employ deep neural networks (DNNs) due to their well-known ability to model highly complex, non-linear relationship between narrowband and highband features. Generative models such as conditional variational auto-encoders (CVAEs) are capable of modelling complex data distributions via latent representation learning. This paper reports their application to ABE. CVAEs, form of directed, graphical models, are exploited to model the probability distribution of highband features conditioned on narrowband features. While CVAEs are trained with the standard mean square criterion (MSE), their combination with adversarial learning give further improvements. When compared to results obtained with the baseline approach, the wideband PESQ is improved significantly by 0.21 points. The performance is also compared on an automatic speech recognition (ASR) task on the TIMIT dataset where word error rate (WER) is decreased by an absolute value of 0.3%.

引用

页码：6924 / 6928

页数：5

共 50 条

[41] 3D reconstruction of digital cores based on a model using generative adversarial networks and variational auto-encoders
Zhang, Ting
Xia, Pengfei
Lu, Fangfang
JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2021, 207
[42] An effective deep learning model for grading abnormalities in retinal fundus images using variational auto-encoders
Sundar, Sumod
Sumathy, Subramanian
INTERNATIONAL JOURNAL OF IMAGING SYSTEMS AND TECHNOLOGY, 2023, 33 (01) : 92 - 107
[43] Reconstruction probability-based anomaly detection using variational auto-encoders
Iqbal T.
Qureshi S.
International Journal of Computers and Applications, 2023, 45 (03) : 231 - 237
[44] A hybrid learning model based on auto-encoders
Zhou, Ju
Ju, Li
Zhang, Xiaolong
PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 522 - 528
[45] On the Quality of Deep Representations for Kepler Light Curves Using Variational Auto-Encoders
Mena, Francisco
Olivares, Patricio
Bugueno, Margarita
Molina, Gabriel
Araya, Mauricio
SIGNALS, 2021, 2 (04): : 706 - 728
[46] Preliminary Unknown Appliance Detection using Convolutional Variational Auto-Encoders for AAL
de Diego-Oton, Laura
Fuentes, David
Pizarro, Daniel
Hernandez, Alvaro
Mari, Simone
Nieto, Ruben
2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 289 - 292
[47] Speech Disorder Classification Using Extended Factorized Hierarchical Variational Auto-encoders
Qi, Jinzi
Van Hamme, Hugo
INTERSPEECH 2021, 2021, : 1917 - 1921
[48] Masked Auto-Encoders Meet Generative Adversarial Networks and Beyond
Fei, Zhengcong
Fan, Mingyuan
Zhu, Li
Huang, Junshi
Wei, Xiaoming
Wei, Xiaolin
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24449 - 24459
[49] Interpretable and effective hashing via Bernoulli variational auto-encoders
Mena, Francisco
Nanculef, Ricardo
Valle, Carlos
INTELLIGENT DATA ANALYSIS, 2020, 24 (24) : S141 - S166
[50] Graph Auto-Encoders for Learning Edge Representations
Rennard, Virgile
Nikolentzos, Giannis
Vazirgiannis, Michalis
COMPLEX NETWORKS & THEIR APPLICATIONS IX, VOL 2, COMPLEX NETWORKS 2020, 2021, 944 : 117 - 129

← 1 2 3 4 5 →