Development of multi-lingual speech recognition and text-to speech synthesis for automotive applications

被引:0
|
作者
Deguchi, Y. [1 ]
Kagoshima, T. [1 ]
Hirabayashi, G. [1 ]
Kanazawa, H. [1 ]
Hogenhout, M. [2 ]
机构
[1] Research and Development Center, Toshiba Corporation, Kawasaki-shi, Japan
[2] Cambridge Research Laboratory, Toshiba Research Europe Limited, Cambridge, United Kingdom
来源
VDI Berichte | 2003年 / 1789期
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Speech recognition
引用
收藏
页码:3081 / 3088
相关论文
共 50 条
  • [31] MULTI-LINGUAL SPEECH RECOGNITION WITH LOW-RANK MULTI-TASK DEEP NEURAL NETWORKS
    Mohan, Aanchan
    Rose, Richard
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4994 - 4998
  • [32] A comprehensive review on detection of hate speech for multi-lingual data
    Narula, Rachna
    Chaudhary, Poonam
    SOCIAL NETWORK ANALYSIS AND MINING, 2025, 14 (01)
  • [33] Automatic learning of numeral grammars for multi-lingual speech synthesizers
    Flach, G
    Holzapfel, M
    Just, C
    Wachtler, A
    Wolff, M
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1291 - 1294
  • [34] Design and Development of a Multi-lingual Speech Corpora (TaMaR-EmoDB) for Emotion Analysis
    Rajan, Rajeev
    Haritha, U. G.
    Sujitha, A. C.
    Rejisha, T. M.
    INTERSPEECH 2019, 2019, : 3267 - 3271
  • [35] CATALIST: CAmera TrAnsformations for Multi-LIngual Scene Text Recognition
    Sood, Shivam
    Saluja, Rohit
    Ramakrishnan, Ganesh
    Chaudhuri, Parag
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021 WORKSHOPS, PT I, 2021, 12916 : 213 - 228
  • [36] Multi-Lingual Depression-Level Assessment from Conversational Speech Using Acoustic and Text Features
    Ozkanca, Yasin
    Demiroglu, Cenk
    Besirli, Ash
    Celik, Selime
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3398 - 3402
  • [37] Optimal trained ensemble of classification model for speech emotion recognition: Considering cross-lingual and multi-lingual scenarios
    Rupali Ramdas Kawade
    Sonal K. Jagtap
    Multimedia Tools and Applications, 2024, 83 : 54331 - 54365
  • [38] Optimal trained ensemble of classification model for speech emotion recognition: Considering cross-lingual and multi-lingual scenarios
    Kawade, Rupali Ramdas
    Jagtap, Sonal K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54331 - 54365
  • [39] MULTI-LINGUAL MULTI-TASK SPEECH EMOTION RECOGNITION USING WAV2VEC 2.0
    Sharma, Mayank
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6907 - 6911
  • [40] Mono- and multi-lingual depression prediction based on speech processing
    Kiss G.
    Vicsi K.
    International Journal of Speech Technology, 2017, 20 (04) : 919 - 935