Development of robotic voice conversion for RIBO using text-to-speech synthesis

被引:0
|
作者
Hossain, Md. Jakir [1 ]
Al Amin, Sayed Mahmud [1 ]
Islam, Md. Saiful [1 ]
Marium-E-Jannat [1 ]
机构
[1] Shahjalal Univ Sci & Technol Sylhet, Dept Comp Sci & Engn, Sylhet, Bangladesh
关键词
TTS; RIBO; Diode; Ring modulator; VCA; Transformer; RF;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RIBO is the first social interaction robot in Bangladesh. This robot is designed and developed by 'ROBO SUST' team of Shahjalal University of Science and Technology. RIBO is able to hands and eyes ups and downs, can walk very slowly and can speak some Bengali recorded sentences. Now the 'ROBO SUST' team is trying to develop the RIBO so that it can communicate with human. One of the parts to communicate with human is convert bengali text to bengali speech in robotic voice. In this article, we propose a method which will convert bengali text to speech in robotic voice using google text to speech system and ring modulator. There are existed some text to speech synthesizer system which can convert bengali text to bengali speech. Among these TTS synthesizer system google TTS system for bengali is better. Hence, we use google text to speech system to produce bengali speech from any bengali written text. Google TTS synthesizer system produces speech as audio object file which can be converted to .mp3 file. Then we modify this .mp3 file using the characteristics of diode and ring modulator concept to get machine voice. After changing pitch and speed of this machine voice we get our final robotic voice which will be used in RIBO as his voice.
引用
收藏
页码:422 / 425
页数:4
相关论文
共 50 条
  • [41] A hybrid model for text-to-speech synthesis
    Violaro, F
    Boeffard, O
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 426 - 434
  • [42] Environment Aware Text-to-Speech Synthesis
    Tan, Daxin
    Zhang, Guangyan
    Lee, Tan
    INTERSPEECH 2022, 2022, : 481 - 485
  • [43] Text-to-speech synthesis integrated circuit
    Baskaya, IF
    Aktan, O
    Dündar, G
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 653 - 656
  • [44] PHONETIC KNOWLEDGE IN TEXT-TO-SPEECH SYNTHESIS
    van Santen, Jan P. H.
    INTEGRATION OF PHONETIC KNOWLEDGE IN SPEECH TECHNOLOGY, 2005, 25 : 149 - 166
  • [45] TEXT-TO-SPEECH RULE AND DICTIONARY DEVELOPMENT
    MANNELL, R
    CLARK, JE
    SPEECH COMMUNICATION, 1987, 6 (04) : 317 - 324
  • [46] Gemination prediction using DNN for Arabic text-to-speech synthesis
    Ali, Ikbel Hadj
    Mnasri, Zied
    Laachri, Zied
    2019 16TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2019, : 366 - 370
  • [47] Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
    Yeshpanov, Rustem
    Mussakhojayeva, Saida
    Khassanov, Yerbolat
    arXiv, 2023,
  • [48] Multilingual Text-to-Speech Synthesis for Turkic Languages Using Transliteration
    Yeshpanov, Rustem
    Mussakhojayeva, Saida
    Khassanov, Yerbolat
    INTERSPEECH 2023, 2023, : 5521 - 5525
  • [49] Lombard Speech Synthesis using Transfer Learning in a Tacotron Text-to-Speech System
    Bollepalli, Bajibabu
    Juvela, Lauri
    Alku, Paavo
    INTERSPEECH 2019, 2019, : 2833 - 2837
  • [50] Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform
    Steiner, Ingmar
    Le Maguer, Sebastien
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3171 - 3175