ATTENTION-BASED WAVENET AUTOENCODER FOR UNIVERSAL VOICE CONVERSION

被引:0
|
作者
Polyak, Adam [1 ]
Wolf, Lior
机构
[1] Facebook AI Res, Cambridge, MA 02142 USA
关键词
D O I
10.1109/icassp.2019.8682589
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a method for converting any voice to a target voice. The method is based on a WaveNet autoencoder, with the addition of a novel attention component that supports the modification of timing between the input and the output samples. Training the attention is done in an unsupervised way, by teaching the neural network to recover the original timing from an artificially modified one. Adding a generic voice robot, which we convert to the target voice, we present a robust Text To Speech pipeline that is able to train without any transcript. Our experiments show that the proposed method is able to recover the timing of the speaker and that the proposed pipeline provides a competitive Text To Speech method.
引用
收藏
页码:6800 / 6804
页数:5
相关论文
共 50 条
  • [41] Learning Normal Patterns via Adversarial Attention-Based Autoencoder for Abnormal Event Detection in Videos
    Song, Hao
    Sun, Che
    Wu, Xinxiao
    Chen, Mei
    Jia, Yunde
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (08) : 2138 - 2148
  • [42] Attention-Based Variational Autoencoder Models for Human-Human Interaction Recognition via Generation
    Banerjee, Bonny
    Baruah, Murchana
    SENSORS, 2024, 24 (12)
  • [43] Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion
    Akuzawa, Kei
    Onishi, Kotaro
    Takiguchi, Keisuke
    Mametani, Kohki
    Mori, Koichiro
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 808 - 813
  • [44] An Attention-Based ConvLSTM Autoencoder with Dynamic Thresholding for Unsupervised Anomaly Detection in Multivariate Time Series
    Tayeh, Tareq
    Aburakhia, Sulaiman
    Myers, Ryan
    Shami, Abdallah
    MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2022, 4 (02): : 350 - 370
  • [45] A3N: Attention-based adversarial autoencoder network for detecting anomalies in video sequence
    Aslam, Nazia
    Rai, Prateek Kumar
    Kolekar, Maheshkumar H.
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87
  • [46] RUL Prediction Using a Fusion of Attention-Based Convolutional Variational AutoEncoder and Ensemble Learning Classifier
    Remadna, Ikram
    Terrissa, Labib Sadek
    Al Masry, Zeina
    Zerhouni, Noureddine
    IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (01) : 106 - 124
  • [47] Attention-Based Convolutional Denoising Autoencoder for Two-Lead ECG Denoising and Arrhythmia Classification
    Singh, Prateek
    Sharma, Ambalika
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [48] An Automatic Grading Model for Semantic Complexity of English Texts Using Bidirectional Attention-Based Autoencoder
    Chen, Ruo Han
    Ng, Boon Sim
    Paramasivam, Shamala
    Ren, Li
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024,
  • [49] Attention-based similarity
    Stentiford, Fred
    PATTERN RECOGNITION, 2007, 40 (03) : 771 - 783
  • [50] Real-Time GNSS Spoofing Detection for Autonomous Vehicles: An Attention-Based Autoencoder Approach
    Yang, Huan
    Liu, Guoqiang
    Zhao, Chunyang
    Wen, Mingxing
    Wang, Yuanzhe
    2024 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, ICARCV, 2024, : 232 - 237