Text-to-Speech Synthesis: Literature Review with an Emphasis on Malayalam Language

被引：0

作者：

Jasir, M. P. ^{[1
]}

Balakrishnan, Kannan ^{[1
]}

机构：

[1] Cochin Univ Sci & Technol, Dept Comp Applicat, Artificial Intelligence Res Lab, Kochi 682022, Kerala, India

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2022年 / 21卷 / 04期

关键词：

Text to speech synthesis; TTS literature review; Indian language TTS; Malayalam TTS; INDIAN LANGUAGES; SYNTHESIS SYSTEM; NEURAL-NETWORKS; DURATION; ENGLISH; NORMALIZATION; CONSONANTS; TRANSFORMATION; EXTRACTION; CONVERSION;

D O I：

10.1145/3501397

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Text-to-Speech Synthesis (TTS) is an active area of research to generate synthetic speech from underlying text. The identified syllables are uttered with proper duration and prosody characteristics to emulate natural speech. It falls under the category of Natural Language Processing (NLP), which aims to bridge the gap in communication between human and machine. So far as Western languages like English are concerned, the research to produce intelligent and natural synthetic speech has advanced considerably. But in a multilingual state like India, many regional languages viz. Malayalam is underexplored when it comes to NLP. In this article, we try to amalgamate the major research works performed in the area of TTS in English and the prominent Indian languages, with a special emphasis on the South Indian language, Malayalam. This review intends to provide right direction to the research activities in the language, in the area of TTS.

引用

页数：56

共 50 条

[31] Text-to-speech synthesis integrated circuit
Baskaya, IF
Aktan, O
Dündar, G
PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 653 - 656
[32] PHONETIC KNOWLEDGE IN TEXT-TO-SPEECH SYNTHESIS
van Santen, Jan P. H.
INTEGRATION OF PHONETIC KNOWLEDGE IN SPEECH TECHNOLOGY, 2005, 25 : 149 - 166
[33] CLUSTERING OF DURATION PATTERNS IN SPEECH FOR TEXT-TO-SPEECH SYNTHESIS
Sreelekshmi, K. S.
Gopinath, Deepa P.
2012 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2012, : 1122 - 1127
[34] A Rule-Based Concatenative Approach to Speech Synthesis in Indian Language Text-to-Speech Systems
Panda, Soumya Priyadarsini
Nayak, Ajit Kumar
INTELLIGENT COMPUTING, COMMUNICATION AND DEVICES, 2015, 309 : 523 - 531
[35] Slovenian Text-to-Speech Synthesis for Speech User Interfaces
Gros, Jerneja Zganec
Mihelic, Ales
Pavesic, Nikola
Zganec, Mario
Gruden, Stanislav
PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 5, 2005, 5 : 216 - 220
[36] Emotional Intelligence in Text-To-Speech Synthesis in Pali Language Using Fuzzy Logic
Mache, Suhas
Dabhade, Siddharth
JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2024, 6 (03): : 179 - 192
[37] Text to Speech Synthesis System for English to Malayalam Translation
Anto, Ancy
Nisha, K. K.
IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGICAL TRENDS IN COMPUTING, COMMUNICATIONS AND ELECTRICAL ENGINEERING (ICETT), 2016,
[38] Is text-to-speech synthesis ready for use in computer-assisted language learning?
Handley, Zoee
SPEECH COMMUNICATION, 2009, 51 (10) : 906 - 919
[39] MAKEDONKA: Applied Deep Learning Model for Text-to-Speech Synthesis in Macedonian Language
Mishev, Kostadin
Karovska Ristovska, Aleksandra
Trajanov, Dimitar
Eftimov, Tome
Simjanoska, Monika
APPLIED SCIENCES-BASEL, 2020, 10 (19): : 1 - 14
[40] An Approach to Building Language-Independent Text-to-Speech Synthesis for Indian Languages
Prakash, Anusha
Reddy, M. Ramasubba
Nagarajan, T.
Murthy, Hema A.
2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,

← 1 2 3 4 5 →