Text-to-Speech Synthesis: Literature Review with an Emphasis on Malayalam Language

被引:0
|
作者
Jasir, M. P. [1 ]
Balakrishnan, Kannan [1 ]
机构
[1] Cochin Univ Sci & Technol, Dept Comp Applicat, Artificial Intelligence Res Lab, Kochi 682022, Kerala, India
关键词
Text to speech synthesis; TTS literature review; Indian language TTS; Malayalam TTS; INDIAN LANGUAGES; SYNTHESIS SYSTEM; NEURAL-NETWORKS; DURATION; ENGLISH; NORMALIZATION; CONSONANTS; TRANSFORMATION; EXTRACTION; CONVERSION;
D O I
10.1145/3501397
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-to-Speech Synthesis (TTS) is an active area of research to generate synthetic speech from underlying text. The identified syllables are uttered with proper duration and prosody characteristics to emulate natural speech. It falls under the category of Natural Language Processing (NLP), which aims to bridge the gap in communication between human and machine. So far as Western languages like English are concerned, the research to produce intelligent and natural synthetic speech has advanced considerably. But in a multilingual state like India, many regional languages viz. Malayalam is underexplored when it comes to NLP. In this article, we try to amalgamate the major research works performed in the area of TTS in English and the prominent Indian languages, with a special emphasis on the South Indian language, Malayalam. This review intends to provide right direction to the research activities in the language, in the area of TTS.
引用
收藏
页数:56
相关论文
共 50 条
  • [21] An Improved Syllabification for a Better Malay Language Text-to-Speech Synthesis (TTS)
    Ramlia, Izzad
    Jamil, Nursuriati
    Seman, Noraini
    Ardi, Norizah
    2015 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTICS AND INTELLIGENT SENSORS (IEEE IRIS2015), 2015, 76 : 417 - 424
  • [22] Implementation of a Text-to-Speech System for Kurdish Language
    Daneshfar, Fatemeh
    Barkhoda, Wafa
    Azami, Bahram Zahir
    ICDT: 2009 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL TELECOMMUNICATIONS, 2009, : 117 - 120
  • [23] Text-To-Speech technology for Arabic language learners
    Oumaima, Zine
    Abdelouafi, Meziane
    Meryem, El Hadi
    2018 IEEE 5TH INTERNATIONAL CONGRESS ON INFORMATION SCIENCE AND TECHNOLOGY (IEEE CIST'18), 2018, : 432 - 436
  • [24] A Prosodic Text-to-Speech System for Yoruba Language
    Akinwonmi, Akintoba Emmanuel
    Alese, Boniface Kayode
    2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 630 - 635
  • [25] A Complete Croatian Language Text-to-Speech System
    Krekovic, Gordan
    Prenner, Vladimir
    PROCEEDINGS ELMAR-2010, 2010, : 351 - 354
  • [26] REVIEW OF TEXT-TO-SPEECH CONVERSION FOR ENGLISH
    HERTZ, SR
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 84 (03): : 1097 - 1099
  • [27] REVIEW OF TEXT-TO-SPEECH CONVERSION FOR ENGLISH
    KLATT, DH
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 82 (03): : 737 - 793
  • [28] A hybrid model for text-to-speech synthesis
    Violaro, F
    Boeffard, O
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (05): : 426 - 434
  • [29] Environment Aware Text-to-Speech Synthesis
    Tan, Daxin
    Zhang, Guangyan
    Lee, Tan
    INTERSPEECH 2022, 2022, : 481 - 485
  • [30] Duration Modeling for Text to Speech Synthesis System using Festival Speech Engine Developed for Malayalam Language
    Rajan, Bindhu K.
    Rijoy, V
    Gopinath, Deepa P.
    George, Nimmy
    2015 INTERNATIONAL CONFERENCED ON CIRCUITS, POWER AND COMPUTING TECHNOLOGIES (ICCPCT-2015), 2015,