Advanced Rich Transcription System for Estonian Speech

被引：23

作者：

Alumae, Tanel ^{[1
]}

Tilk, Ottokar ^{[1
]}

Asadullah ^{[1
]}

机构：

[1] Tallinn Univ Technol, Lab Language Technol, Tallinn, Estonia

来源：

HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2018 | 2018年 / 307卷

关键词：

Speech recognition; Estonian; punctuation recovery; speaker identification;

D O I：

10.3233/978-1-61499-912-6-1

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes the current TTU speech transcription system for Estonian speech. The system is designed to handle semi-spontaneous speech, such as broadcast conversations, lecture recordings and interviews recorded in diverse acoustic conditions. The system is based on the Kaldi toolkit. Multi-condition training using background noise profiles extracted automatically from untranscribed data is used to improve the robustness of the system. Out-of-vocabulary words are recovered using a phoneme n-gram based decoding subgraph and a FST-based phoneme-to-grapheme model. The system achieves a word error rate of 8.1% on a test set of broadcast conversations. The system also performs punctuation recovery and speaker identification. Speaker identification models are trained using a recently proposed weakly supervised training method.

引用

页码：1 / 8

页数：8

共 50 条

[31] Estonian: Some Findings for Modelling Speech Rhythmicity and Perception of Speech Rate
Kalvik, Mari-Liis
Mihkla, Meelis
Kiissel, Indrek
Hein, Indrek
TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 314 - 321
[32] Advanced Speech Communication System for Deaf People
San-Segundo, R.
Lopez, V.
Martin, R.
Lufti, S.
Ferreiros, J.
Cordoba, R.
Pardo, J. M.
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 250 - 253
[33] Paradigmatic and Syntagmatic Effects in Estonian Spontaneous Speech
Loo, Kaidi
Tomaschek, Fabian
Lippus, Partel
Tucker, Benjamin, V
LANGUAGE AND SPEECH, 2023, 66 (02) : 474 - 499
[34] The voice system of Estonian
Torn-Leesik, Reeli
STUF-LANGUAGE TYPOLOGY AND UNIVERSALS, 2009, 62 (1-2) : 72 - 90
[35] Limited-vocabulary Estonian continuous speech recognition system using hidden Markov models
Alumäe, T
Vohandu, L
INFORMATICA, 2004, 15 (03) : 303 - 314
[36] CRIM'S FRENCH SPEECH TRANSCRIPTION SYSTEM FOR ETAPE 2011
Gupta, Vishwa
Boulianne, Gilles
Osterrath, Frederic
Ouellet, Pierre
2013 8TH INTERNATIONAL WORKSHOP ON SYSTEMS, SIGNAL PROCESSING AND THEIR APPLICATIONS (WOSSPA), 2013, : 351 - 356
[37] The IBM 2007 speech transcription system for European parliamentary speeches
Ramabhadran, Bhuvana
Siohan, Olivier
Sethy, Abhinav
2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 472 - +
[38] The RACAI Speech Translation System Challenges of morphologically rich languages
Tufis, Dan
Boros, Tiberiu
Dumitrescu, Stefan Daniel
2013 7TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN - COMPUTER DIALOGUE (SPED), 2013,
[39] System for speech transcription and post-editing in Microsoft Word
Salimbajevs, Askars
Ikauniece, Indra
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 825 - 826
[40] PROFOUNDLY DEAF BUSINESSMANS VIEWS ON THE PALANTYPE SPEECH TRANSCRIPTION SYSTEM
HAYWARD, G
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1979, 11 (06): : 711 - 715

← 1 2 3 4 5 →