Investigating accuracy of pitch-accent annotations in neural network-based

被引:0
|
作者
Luong, Hieu-Thi [1 ]
Wang, Xin [1 ]
Yamagishi, Junichi [1 ]
Nishizawa, Nobuyuki [2 ]
机构
[1] Natl Inst Informat, Tokyo, Japan
[2] KDDI Res Inc, Saitama, Japan
关键词
speech synthesis; deep neural network; Japanese prosody; WaveNet;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigated the impact of noisy linguistic features on the performance of a Japanese speech synthesis system based on neural network that uses WaveNet vocoder. We compared an ideal system that uses manually corrected linguistic features including phoneme and prosodic information in training and test sets against a few other systems that use corrupted linguistic features. Both subjective and objective results demonstrate that corrupted linguistic features, especially those in the test set, affected the ideal system's performance significantly in a statistical sense due to a mismatched condition between the training and test sets. Interestingly, while an utterance-level Turing test showed that listeners had a difficult time differentiating synthetic speech from natural speech, it further indicated that adding noise to the linguistic features in the training set can partially reduce the effect of the mismatch, regularize the model, and help the system perform better when linguistic features of the test set are noisy.
引用
收藏
页码:37 / 41
页数:5
相关论文
共 50 条
  • [31] Beyond-Accuracy Perspectives on Graph Neural Network-Based Models for Behavioural User Profiling
    Purificato, Erasmo
    PROCEEDINGS OF THE 30TH ACM CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION, UMAP 2022, 2022, : 311 - 315
  • [32] A neural network-based prediction model for fine pitch stencil-printing quality in surface mount assembly
    Yang, T
    Tsai, TN
    Yeh, JW
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2005, 18 (03) : 335 - 341
  • [33] Neural network-based control of an ultrafast laser
    Aslam, A.
    Biedron, S. G.
    Ma, Y.
    Murphy, J.
    Burger, M.
    Nees, J.
    Thomas, A. G. R.
    Krushelnick, K.
    Martinez-Ramon, M.
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2023, 1053
  • [34] Neural Network-based Taxonomic Clustering for Metagenomics
    Essinger, Steven D.
    Polikar, Robi
    Rosen, Gail L.
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [35] A neural network-based ionospheric model for Arecibo
    Friedrich, M.
    Fankhauser, M.
    Oyeyemi, E.
    McKinnell, L. A.
    ADVANCES IN SPACE RESEARCH, 2008, 42 (04) : 776 - 781
  • [36] Neural network-based blockchain decision scheme
    Abdo, Jacques Bou
    Zeadally, Sherali
    INFORMATION SECURITY JOURNAL, 2021, 30 (03): : 173 - 187
  • [37] Neural network-based vision guided robotics
    Natl Research Council of Canada
    Proc IEEE Int Conf Rob Autom, (281-286):
  • [38] Neural Network-Based Limiter with Transfer Learning
    Abgrall, Remi
    Han Veiga, Maria
    COMMUNICATIONS ON APPLIED MATHEMATICS AND COMPUTATION, 2020,
  • [39] Neural Network-Based Estimation for OFDM Channels
    Cheng, Chia-Hsin
    Huang, Yung-Fa
    Huang, Yao-Hung
    Chen, Hsing-Chung
    Yao, Tsung-Yu
    2015 IEEE 29th International Conference on Advanced Information Networking and Applications (IEEE AINA 2015), 2015, : 600 - 604
  • [40] Artificial neural network-based environmental models
    Boznar, MZ
    Mlakar, P
    AIR POLLUTION MODELING AND ITS APPLICATION XIV, 2001, : 483 - 492