Transcription System for Semi-Spontaneous Estonian Speech

被引:2
|
作者
Alumaee, Tanel [1 ]
机构
[1] Tallinn Univ Technol, Inst Cybernet, EE-19086 Tallinn, Estonia
关键词
Estonian; speech recognition; compound words; RECOGNITION;
D O I
10.3233/978-1-61499-133-5-10
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a speech-to-text system for semi-spontaneous Estonian speech. The system is trained on about 100 hours of manually transcribed speech and a 300M word text corpus. Compound words are split before building the language model and reconstructed from recognizer output using a hidden event N-gram model. We use a three pass transcription strategy with unsupervised speaker adaptation between individual passes. The system achieves a word error rate of 34.6% on conference speeches and 25.6% on radio talk shows.
引用
收藏
页码:10 / 17
页数:8
相关论文
共 50 条
  • [11] Semi-spontaneous oral text production: Measurements in clinical practice
    Lind, Marianne
    Kristoffersen, Kristian Emil
    Moen, Inger
    Simonsen, Hanne Gram
    CLINICAL LINGUISTICS & PHONETICS, 2009, 23 (12) : 872 - 886
  • [12] Some acoustic indicators of verbal irony in semi-spontaneous contexts
    Hernandez, Diana Martinez
    LOQUENS, 2021, 8 (1-2):
  • [13] A Sociolinguistic Examination of the Dual Usted in Medellin, Colombia: Evidence from Semi-spontaneous Speech and Implicit Language Attitudes
    Denbaum-Restrepo, Nofiya
    Restrepo-Ramos, Falcon
    HISPANIA-A JOURNAL DEVOTED TO THE TEACHING OF SPANISH AND PORTUGUESE, 2024, 107 (01): : 87 - 105
  • [14] Estonian Speech Recognition and Transcription Editing Service
    Olev, Aivo
    Alumae, Tanel
    BALTIC JOURNAL OF MODERN COMPUTING, 2022, 10 (03): : 409 - 421
  • [15] Open source platform for Estonian speech transcription
    Olev, Aivo
    Alumae, Tanel
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [16] Paradigmatic and Syntagmatic Effects in Estonian Spontaneous Speech
    Loo, Kaidi
    Tomaschek, Fabian
    Lippus, Partel
    Tucker, Benjamin, V
    LANGUAGE AND SPEECH, 2023, 66 (02) : 474 - 499
  • [17] Intelligent transcription system based on spontaneous speech processing
    Kawahara, Tatsuya
    ICKS 2007: SECOND INTERNATIONAL CONFERENCE ON INFORMATICS RESEARCH FOR DEVELOPMENT OF KNOWLEDGE SOCIETY INFRASTRUCTURE, PROCEEDINGS, 2007, : 19 - 26
  • [18] Semi-spontaneous temporal evolution of relief/fluorescence hybrid gratings for holographic encryption
    Tao, Yuxin
    Liu, Hongfang
    Wang, Xiuli
    Liu, Zhong
    Li, Xin
    Miao, Jingying
    Fu, Shencheng
    Zhang, Xintong
    OPTICS LETTERS, 2023, 48 (23) : 6308 - 6311
  • [19] The spontaneous speech: transcription and treatment
    Bazillon, Thierry
    Jousse, Vincent
    Bechet, Frederic
    Esteve, Yannick
    Linares, Georges
    Luzzati, Daniel
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2008, 49 (03): : 47 - 76
  • [20] Enhancement of catansome formation by means of cosolvent effect: Semi-spontaneous preparation method
    Wu, Kuo-Chang
    Huang, Zheng-Lin
    Yang, Yu-Min
    Chang, Chien-Hsiang
    Chou, Tzung-Han
    COLLOIDS AND SURFACES A-PHYSICOCHEMICAL AND ENGINEERING ASPECTS, 2007, 302 (1-3) : 599 - 607