Open source platform for Estonian speech transcription

被引:0
|
作者
Olev, Aivo [1 ]
Alumae, Tanel [1 ]
机构
[1] Tallinn Univ Technol, Tallinn, Estonia
关键词
Automatic speech recognition; Speaker identification; NLP platform; Workflow management system; Transcription editing; REPRODUCIBILITY;
D O I
10.1007/s10579-024-09777-1
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper presents our progress in developing and maintaining a public speech and speaker recognition platform for the Estonian language. The platform consists of a speech processing pipeline and a web-based user interface for end-users, offering transcript post-editing functionality. It is offered for free as a public service and is in active use. The service provides significantly higher speech recognition accuracy than commercial alternatives. We discuss the switch to a workflow management system and how it has improved the core speech processing pipeline. The core systems behind the platform have been made available as open-source code and deployed internally by multiple public and private institutions.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Implementation of a Radiology Speech Recognition System for Estonian using Open Source Software
    Alumae, Tanel
    Paats, Andrus
    Fridolin, Ivo
    Meister, Einar
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2168 - 2172
  • [2] Estonian Speech Recognition and Transcription Editing Service
    Olev, Aivo
    Alumae, Tanel
    BALTIC JOURNAL OF MODERN COMPUTING, 2022, 10 (03): : 409 - 421
  • [3] Advanced Rich Transcription System for Estonian Speech
    Alumae, Tanel
    Tilk, Ottokar
    Asadullah
    HUMAN LANGUAGE TECHNOLOGIES - THE BALTIC PERSPECTIVE, BALTIC HLT 2018, 2018, 307 : 1 - 8
  • [4] Transcription System for Semi-Spontaneous Estonian Speech
    Alumaee, Tanel
    HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 10 - 17
  • [5] The Implementation of a Vocabulary and Grammar for an Open-Source Speech-Recognition Programming Platform
    Rodriguez-Cartagena, Jean K.
    Claudio-Palacios, Andrea
    Pacheco-Tallaj, Natalia
    Santiago-Gonzalez, Valerie
    Ordonez-Franco, Patricia
    ASSETS'15: PROCEEDINGS OF THE 17TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS & ACCESSIBILITY, 2015, : 447 - 448
  • [6] An open source platform for educators
    Su, CC
    5th IEEE International Conference on Advanced Learning Technologies, Proceedings, 2005, : 961 - 962
  • [7] tranSMART platform for translational medicine: An open source, open data and open science platform
    Potenzone, Rudy
    Elliston, Keith
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 253
  • [8] A Realtime, Open-Source Speech-Processing Platform for Research in Hearing Loss Compensation
    Garudadri, Harinath
    Boothroyd, Arthur
    Lee, Ching-Hua
    Gadiyaram, Swaroop
    Bell, Justyn
    Sengupta, Dhiman
    Hamilton, Sean
    Vastare, Krishna Chaithanya
    Gupta, Rajesh
    Rao, Bhaskar D.
    2017 FIFTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2017, : 1900 - 1904
  • [9] An Albanian Open Source Telemedicine Platform
    Zangara, Gianluca
    Valentino, Francesca
    Spinelli, Gaetano
    Valenza, Mario
    Marcheggiani, Angelo
    Di Blasi, Francesco
    TELEMEDICINE AND E-HEALTH, 2014, 20 (07) : 673 - 677
  • [10] A multilanguage platform for open source intelligence
    Baldini, N.
    Neri, F.
    Pettoni, M.
    Data Mining VIII: Data, Text and Web Mining and Their Business Applications, 2007, 38 : 325 - 334