Full-duplex Speech-to-text System for Estonian

被引:7
|
作者
Alumaee, Tanel [1 ]
机构
[1] Tallinn Univ Technol, Inst Cybernet, EE-19086 Tallinn, Estonia
关键词
Speech recognition; Estonian; radiology; client-server; open source;
D O I
10.3233/978-1-61499-442-8-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper describes a distributed online speech-to-text system. The main features of the system are real-time speech recognition and full-duplex user experience, meaning that the partially recognized utterance is progressively displayed to the user during speaking. Other benefits include easy client-server communication protocol and system scalability to many concurrent user sessions. The paper also describes two Estonian speech-to-text applications based on the developed framework: a general-domain dictation application with an estimated word error rate of 26.4% and a radiology report dictation system with a word error rate of 13.7%. The system is open-source and based on free software.
引用
收藏
页码:3 / 10
页数:8
相关论文
共 50 条
  • [1] Full-duplex speech for HF radio systems
    Serinken, N
    Gagnon, B
    Erogul, O
    SEVENTH INTERNATIONAL CONFERENCE ON HF RADIO SYSTEMS AND TECHNIQUES, 1997, (441): : 281 - 284
  • [2] Tracked Speech-To-Text Display: Enhancing Accessibility and Readability of Speech-To-Text
    Kushalnagar, Raja S.
    Behm, Gary W.
    Kelstone, Aaron W.
    Ali, Shareef S.
    ASSETS'15: PROCEEDINGS OF THE 17TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS & ACCESSIBILITY, 2015, : 223 - 230
  • [3] RAPID DEVELOPMENT OF A LATVIAN SPEECH-TO-TEXT SYSTEM
    Oparin, Ilya
    Lamel, Lori
    Gauvain, Jean-Luc
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7309 - 7313
  • [4] The 2010 CMU GALE Speech-to-Text System
    Metze, Florian
    Hsiao, Roger
    Jin, Qin
    Nallasamy, Udhyakumar
    Schultz, Tanja
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1501 - 1504
  • [5] An adaptive FIR echo canceller in analog full-duplex speech scrambling system
    Zoran, BD
    Kovacevic, BD
    Milosavljevic, MM
    Veinovic, MD
    TELSIKS 2001, VOL 1 & 2, PROCEEDINGS, 2001, : 245 - 248
  • [6] ROBUST FULL-DUPLEX ROF SYSTEM
    Thomas, D. H.
    de Faria, G. Vilela
    von der Weid, J. P.
    MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 2010, 52 (05) : 1009 - 1013
  • [7] Analyzing a Full-Duplex Cellular System
    Goyal, Sanjay
    Liu, Pei
    Hua, Sha
    Panwar, Shivendra
    2013 47TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2013,
  • [8] The Performance Analysis of Full-Duplex System
    Wu, Linjun
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ELECTROMECHANICAL CONTROL TECHNOLOGY AND TRANSPORTATION, 2015, 41 : 164 - 169
  • [9] The ISL RT-07 speech-to-text system
    Woelfel, Matthias
    Stueker, Sebastian
    Kraft, Florian
    MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2008, 4625 : 464 - 474
  • [10] Local echo canceler with optimal input for true full-duplex speech scrambling system
    Banjac, ZD
    Kovacevic, BD
    Milosavljevic, MM
    Veinovic, MD
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (08) : 1877 - 1882