Development of robotic voice conversion for RIBO using text-to-speech synthesis

被引:0
|
作者
Hossain, Md. Jakir [1 ]
Al Amin, Sayed Mahmud [1 ]
Islam, Md. Saiful [1 ]
Marium-E-Jannat [1 ]
机构
[1] Shahjalal Univ Sci & Technol Sylhet, Dept Comp Sci & Engn, Sylhet, Bangladesh
关键词
TTS; RIBO; Diode; Ring modulator; VCA; Transformer; RF;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RIBO is the first social interaction robot in Bangladesh. This robot is designed and developed by 'ROBO SUST' team of Shahjalal University of Science and Technology. RIBO is able to hands and eyes ups and downs, can walk very slowly and can speak some Bengali recorded sentences. Now the 'ROBO SUST' team is trying to develop the RIBO so that it can communicate with human. One of the parts to communicate with human is convert bengali text to bengali speech in robotic voice. In this article, we propose a method which will convert bengali text to speech in robotic voice using google text to speech system and ring modulator. There are existed some text to speech synthesizer system which can convert bengali text to bengali speech. Among these TTS synthesizer system google TTS system for bengali is better. Hence, we use google text to speech system to produce bengali speech from any bengali written text. Google TTS synthesizer system produces speech as audio object file which can be converted to .mp3 file. Then we modify this .mp3 file using the characteristics of diode and ring modulator concept to get machine voice. After changing pitch and speed of this machine voice we get our final robotic voice which will be used in RIBO as his voice.
引用
收藏
页码:422 / 425
页数:4
相关论文
共 50 条
  • [31] Multilingual text analysis for text-to-speech synthesis
    Bell Lab, Murray Hill, United States
    International Conference on Spoken Language Processing, ICSLP, Proceedings, 1996, 3 : 1365 - 1368
  • [32] Multilingual text analysis for text-to-speech synthesis
    Sproat, R
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1365 - 1368
  • [33] ERROR DETECTION OF GRAPHEME-TO-PHONEME CONVERSION IN TEXT-TO-SPEECH SYNTHESIS USING SPEECH SIGNAL AND LEXICAL CONTEXT
    Vythelingum, Kevin
    Esteve, Yannick
    Rosec, Olivier
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 692 - 697
  • [34] TEXT-TO-SPEECH CONVERSION ON A PERSONAL-COMPUTER
    ELIMAM, YA
    BANAT, K
    IEEE MICRO, 1990, 10 (04) : 62 - 74
  • [35] ACCENTUATION RULES FOR JAPANESE TEXT-TO-SPEECH CONVERSION
    SAGISAKA, Y
    SATO, H
    REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1984, 32 (02): : 188 - 199
  • [36] Text-to-speech conversion system for Brazilian Portuguese using a formant-based synthesis technique
    Gomes, LDT
    Nagle, EJ
    Chiquito, JG
    ITS '98 PROCEEDINGS - SBT/IEEE INTERNATIONAL TELECOMMUNICATIONS SYMPOSIUM, VOLS 1 AND 2, 1998, : 219 - 224
  • [37] Voice Builder: A Tool for Building Text-To-Speech Voices
    De Silva, Pasindu
    Wattanavekin, Theeraphol
    Hao, Tang
    Pipatsrisawat, Knot
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 2241 - 2245
  • [38] Development of an automatic phonetization system for Arabic text-to-speech synthesis
    Imedjdouben, Faycal
    Houacine, Amrane
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (04) : 417 - 426
  • [39] MULTI VOICE TEXT TO SPEECH SYNTHESIS BASED ON THE INSTANTANEOUS PARAMETRIC VOICE CONVERSION
    Azarov, Elias
    Petrovsky, Alexander
    Zubrycki, Piotr
    SPA 2010: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2010, : 78 - 82
  • [40] BOOTSTRAPPING NON-PARALLEL VOICE CONVERSION FROM SPEAKER-ADAPTIVE TEXT-TO-SPEECH
    Luong, Hieu-Thi
    Yamagishi, Junichi
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 200 - 207