Artificial neural network approach to the modelling of prosody in the speech synthesizer of the Czech language

被引:0
|
作者
Tuckova, Jana [1 ]
Sebesta, Vaclav [2 ]
机构
[1] Czech Tech Univ, Fac Elect Engn, Prague, Czech Republic
[2] Czech Tech Univ, Acad Sci Czech Republ, Inst Comp Sci, Prague, Czech Republic
关键词
neural networks; prosody modelling; pruning method;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this contribution we try to describe the optimal choice of phonetic and phonologic parameters, which are necessary for prosody modelling. The rule-based approach [5] or the artificial neural networks (ANN) can be used for prosody control. According to our experience ANNs are able to achieve better results. If the prosody of the speech synthesizer is controlled by an artificial neural network (ANN), an optimisation of the ANN topology is one of the most important problems. The application of a supervised neural network has been used for prosody parameters determination in the process of prosody modelling. The pruning of neural networks based on the GUHA method [10] or the utilization of the synaptic weights sensitivities can be suitable tools for the minimization of the number of input parameters, and for the reduction of the neural network structure redundancy. The automatic system, designed for the preprocessing of written text, training the ANN by the speech of suitable speaker and prosody modelling are the main goals of our research. The ANN dedicated for prosody control is able to model prosodic parameters in a quality, which may be comparable with natural speech. The specific attributes of national languages must be taken into account. From this point of view the Czech, similarly as the other Slavonic languages, is more difficult than English or German.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 50 条
  • [31] An Artificial Neural Network Approach for Sentence Boundary Disambiguation in Urdu Language Text
    Raj, Shazia
    Rehman, Zobia
    Rauf, Sonia
    Siddique, Rehana
    Anwar, Muhammad
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2015, 12 (04) : 395 - 400
  • [32] Real time conversion of sign language to speech and prediction of gestures using Artificial Neural Network
    Abraham, Abey
    Rohini, V
    8TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATIONS (ICACC-2018), 2018, 143 : 587 - 594
  • [33] An optimized neural network based prosody model of Chinese speech synthesis system
    Tao, JH
    Cai, LH
    Tropf, H
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 477 - 480
  • [34] Neural network approach to speech pathology
    Salvatore, AP
    Thorne, NA
    Gross, CM
    Cannito, MP
    42ND MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS, VOLS 1 AND 2, 1999, : 439 - 442
  • [35] Neural network approach to speech recognition
    Lee, Y.C.
    Chen, H.H.
    Sun, G.Z.
    Neural Networks, 1988, 1 (1 SUPPL)
  • [36] Is artificial neural network an ideal modelling technique?
    Ozden, Sabri
    Saylam, Baris
    Tez, Mesut
    JOURNAL OF CRITICAL CARE, 2017, 40 : 292 - 292
  • [37] Modelling the SOFC behaviours by artificial neural network
    Milewski, Jaroslaw
    Swirski, Konrad
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2009, 34 (13) : 5546 - 5553
  • [38] Artificial Neural Network modelling of sorption chillers
    Frey, Patrick
    Fischer, Stephan
    Drueck, Harald
    SOLAR ENERGY, 2014, 108 : 525 - 537
  • [39] Artificial neural network for modelling thermal decompositions
    Conesa, JA
    Caballero, JA
    Reyes-Labarta, JA
    JOURNAL OF ANALYTICAL AND APPLIED PYROLYSIS, 2004, 71 (01) : 343 - 352
  • [40] A Recurrent Neural Network-Based Approach to Automatic Language Identification from Speech
    Mukherjee, Himadri
    Dhar, Ankita
    Obaidullah, Sk Md
    Santosh, K. C.
    Phadikar, Santanu
    Roy, Kaushik
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMMUNICATION, DEVICES AND COMPUTING, 2020, 602 : 441 - 450