Advanced Rich Transcription System for Estonian Speech

被引:23
|
作者
Alumae, Tanel [1 ]
Tilk, Ottokar [1 ]
Asadullah [1 ]
机构
[1] Tallinn Univ Technol, Lab Language Technol, Tallinn, Estonia
关键词
Speech recognition; Estonian; punctuation recovery; speaker identification;
D O I
10.3233/978-1-61499-912-6-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes the current TTU speech transcription system for Estonian speech. The system is designed to handle semi-spontaneous speech, such as broadcast conversations, lecture recordings and interviews recorded in diverse acoustic conditions. The system is based on the Kaldi toolkit. Multi-condition training using background noise profiles extracted automatically from untranscribed data is used to improve the robustness of the system. Out-of-vocabulary words are recovered using a phoneme n-gram based decoding subgraph and a FST-based phoneme-to-grapheme model. The system achieves a word error rate of 8.1% on a test set of broadcast conversations. The system also performs punctuation recovery and speaker identification. Speaker identification models are trained using a recently proposed weakly supervised training method.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [31] Estonian: Some Findings for Modelling Speech Rhythmicity and Perception of Speech Rate
    Kalvik, Mari-Liis
    Mihkla, Meelis
    Kiissel, Indrek
    Hein, Indrek
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 314 - 321
  • [32] Advanced Speech Communication System for Deaf People
    San-Segundo, R.
    Lopez, V.
    Martin, R.
    Lufti, S.
    Ferreiros, J.
    Cordoba, R.
    Pardo, J. M.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 250 - 253
  • [33] Paradigmatic and Syntagmatic Effects in Estonian Spontaneous Speech
    Loo, Kaidi
    Tomaschek, Fabian
    Lippus, Partel
    Tucker, Benjamin, V
    LANGUAGE AND SPEECH, 2023, 66 (02) : 474 - 499
  • [34] The voice system of Estonian
    Torn-Leesik, Reeli
    STUF-LANGUAGE TYPOLOGY AND UNIVERSALS, 2009, 62 (1-2) : 72 - 90
  • [35] Limited-vocabulary Estonian continuous speech recognition system using hidden Markov models
    Alumäe, T
    Vohandu, L
    INFORMATICA, 2004, 15 (03) : 303 - 314
  • [36] CRIM'S FRENCH SPEECH TRANSCRIPTION SYSTEM FOR ETAPE 2011
    Gupta, Vishwa
    Boulianne, Gilles
    Osterrath, Frederic
    Ouellet, Pierre
    2013 8TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNAL PROCESSING AND THEIR APPLICATIONS (WOSSPA), 2013, : 351 - 356
  • [37] The IBM 2007 speech transcription system for European parliamentary speeches
    Ramabhadran, Bhuvana
    Siohan, Olivier
    Sethy, Abhinav
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 472 - +
  • [38] The RACAI Speech Translation System Challenges of morphologically rich languages
    Tufis, Dan
    Boros, Tiberiu
    Dumitrescu, Stefan Daniel
    2013 7TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN - COMPUTER DIALOGUE (SPED), 2013,
  • [39] System for speech transcription and post-editing in Microsoft Word
    Salimbajevs, Askars
    Ikauniece, Indra
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 825 - 826
  • [40] PROFOUNDLY DEAF BUSINESSMANS VIEWS ON THE PALANTYPE SPEECH TRANSCRIPTION SYSTEM
    HAYWARD, G
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1979, 11 (06): : 711 - 715