Transcription System for Semi-Spontaneous Estonian Speech

被引：2

作者：

Alumaee, Tanel ^{[1
]}

机构：

[1] Tallinn Univ Technol, Inst Cybernet, EE-19086 Tallinn, Estonia

来源：

HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE | 2012年 / 247卷

关键词：

Estonian; speech recognition; compound words; RECOGNITION;

D O I：

10.3233/978-1-61499-133-5-10

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper describes a speech-to-text system for semi-spontaneous Estonian speech. The system is trained on about 100 hours of manually transcribed speech and a 300M word text corpus. Compound words are split before building the language model and reconstructed from recognizer output using a hidden event N-gram model. We use a three pass transcription strategy with unsupervised speaker adaptation between individual passes. The system achieves a word error rate of 34.6% on conference speeches and 25.6% on radio talk shows.

引用

页码：10 / 17

页数：8

共 50 条

[11] Semi-spontaneous oral text production: Measurements in clinical practice
Lind, Marianne
Kristoffersen, Kristian Emil
Moen, Inger
Simonsen, Hanne Gram
CLINICAL LINGUISTICS & PHONETICS, 2009, 23 (12) : 872 - 886
[12] Some acoustic indicators of verbal irony in semi-spontaneous contexts
Hernandez, Diana Martinez
LOQUENS, 2021, 8 (1-2):
[13] A Sociolinguistic Examination of the Dual Usted in Medellin, Colombia: Evidence from Semi-spontaneous Speech and Implicit Language Attitudes
Denbaum-Restrepo, Nofiya
Restrepo-Ramos, Falcon
HISPANIA-A JOURNAL DEVOTED TO THE TEACHING OF SPANISH AND PORTUGUESE, 2024, 107 (01): : 87 - 105
[14] Estonian Speech Recognition and Transcription Editing Service
Olev, Aivo
Alumae, Tanel
BALTIC JOURNAL OF MODERN COMPUTING, 2022, 10 (03): : 409 - 421
[15] Open source platform for Estonian speech transcription
Olev, Aivo
Alumae, Tanel
LANGUAGE RESOURCES AND EVALUATION, 2024,
[16] Paradigmatic and Syntagmatic Effects in Estonian Spontaneous Speech
Loo, Kaidi
Tomaschek, Fabian
Lippus, Partel
Tucker, Benjamin, V
LANGUAGE AND SPEECH, 2023, 66 (02) : 474 - 499
[17] Intelligent transcription system based on spontaneous speech processing
Kawahara, Tatsuya
ICKS 2007: SECOND INTERNATIONAL CONFERENCE ON INFORMATICS RESEARCH FOR DEVELOPMENT OF KNOWLEDGE SOCIETY INFRASTRUCTURE, PROCEEDINGS, 2007, : 19 - 26
[18] Semi-spontaneous temporal evolution of relief/fluorescence hybrid gratings for holographic encryption
Tao, Yuxin
Liu, Hongfang
Wang, Xiuli
Liu, Zhong
Li, Xin
Miao, Jingying
Fu, Shencheng
Zhang, Xintong
OPTICS LETTERS, 2023, 48 (23) : 6308 - 6311
[19] The spontaneous speech: transcription and treatment
Bazillon, Thierry
Jousse, Vincent
Bechet, Frederic
Esteve, Yannick
Linares, Georges
Luzzati, Daniel
TRAITEMENT AUTOMATIQUE DES LANGUES, 2008, 49 (03): : 47 - 76
[20] Enhancement of catansome formation by means of cosolvent effect: Semi-spontaneous preparation method
Wu, Kuo-Chang
Huang, Zheng-Lin
Yang, Yu-Min
Chang, Chien-Hsiang
Chou, Tzung-Han
COLLOIDS AND SURFACES A-PHYSICOCHEMICAL AND ENGINEERING ASPECTS, 2007, 302 (1-3) : 599 - 607

← 1 2 3 4 5 →