Text-to-Speech Synthesis: Literature Review with an Emphasis on Malayalam Language

被引:0
|
作者
Jasir, M. P. [1 ]
Balakrishnan, Kannan [1 ]
机构
[1] Cochin Univ Sci & Technol, Dept Comp Applicat, Artificial Intelligence Res Lab, Kochi 682022, Kerala, India
关键词
Text to speech synthesis; TTS literature review; Indian language TTS; Malayalam TTS; INDIAN LANGUAGES; SYNTHESIS SYSTEM; NEURAL-NETWORKS; DURATION; ENGLISH; NORMALIZATION; CONSONANTS; TRANSFORMATION; EXTRACTION; CONVERSION;
D O I
10.1145/3501397
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text-to-Speech Synthesis (TTS) is an active area of research to generate synthetic speech from underlying text. The identified syllables are uttered with proper duration and prosody characteristics to emulate natural speech. It falls under the category of Natural Language Processing (NLP), which aims to bridge the gap in communication between human and machine. So far as Western languages like English are concerned, the research to produce intelligent and natural synthetic speech has advanced considerably. But in a multilingual state like India, many regional languages viz. Malayalam is underexplored when it comes to NLP. In this article, we try to amalgamate the major research works performed in the area of TTS in English and the prominent Indian languages, with a special emphasis on the South Indian language, Malayalam. This review intends to provide right direction to the research activities in the language, in the area of TTS.
引用
收藏
页数:56
相关论文
共 50 条
  • [31] Text-to-speech synthesis integrated circuit
    Baskaya, IF
    Aktan, O
    Dündar, G
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 653 - 656
  • [32] PHONETIC KNOWLEDGE IN TEXT-TO-SPEECH SYNTHESIS
    van Santen, Jan P. H.
    INTEGRATION OF PHONETIC KNOWLEDGE IN SPEECH TECHNOLOGY, 2005, 25 : 149 - 166
  • [33] CLUSTERING OF DURATION PATTERNS IN SPEECH FOR TEXT-TO-SPEECH SYNTHESIS
    Sreelekshmi, K. S.
    Gopinath, Deepa P.
    2012 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2012, : 1122 - 1127
  • [34] A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems
    Panda, Soumya Priyadarsini
    Nayak, Ajit Kumar
    INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 523 - 531
  • [35] Slovenian Text-to-Speech Synthesis for Speech User Interfaces
    Gros, Jerneja Zganec
    Mihelic, Ales
    Pavesic, Nikola
    Zganec, Mario
    Gruden, Stanislav
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 5, 2005, 5 : 216 - 220
  • [36] Emotional Intelligence in Text-To-Speech Synthesis in Pali Language Using Fuzzy Logic
    Mache, Suhas
    Dabhade, Siddharth
    JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2024, 6 (03): : 179 - 192
  • [37] Text to Speech Synthesis System for English to Malayalam Translation
    Anto, Ancy
    Nisha, K. K.
    IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGICAL TRENDS IN COMPUTING, COMMUNICATIONS AND ELECTRICAL ENGINEERING (ICETT), 2016,
  • [38] Is text-to-speech synthesis ready for use in computer-assisted language learning?
    Handley, Zoee
    SPEECH COMMUNICATION, 2009, 51 (10) : 906 - 919
  • [39] MAKEDONKA: Applied Deep Learning Model for Text-to-Speech Synthesis in Macedonian Language
    Mishev, Kostadin
    Karovska Ristovska, Aleksandra
    Trajanov, Dimitar
    Eftimov, Tome
    Simjanoska, Monika
    APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 14
  • [40] An Approach to Building Language-Independent Text-to-Speech Synthesis for Indian Languages
    Prakash, Anusha
    Reddy, M. Ramasubba
    Nagarajan, T.
    Murthy, Hema A.
    2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,