Speech recognition engine for real-time broadcast news captioning

被引:0
|
作者
Imai, Toru [1 ]
Kobayashi, Akio [1 ]
Sato, Shoei [1 ]
Tanaka, Hideki [1 ]
Ando, Akio [1 ]
机构
[1] Human Science Research Division
来源
NHK Laboratories Note | 2000年 / 464期
关键词
Broadcasting - Decision making - Decoding - Degradation - Error detection - Parameter estimation - Real time systems;
D O I
暂无
中图分类号
学科分类号
摘要
This paper describes a speech recognition engine that progressively outputs the latest available results of words used for real-time closed captioning of Japanese broadcast news. The search engine called a progressive 2-pass decoder practically eliminates the disadvantage of conventional multiple-pass decoders that delay a decision until the end of a sentence. During the first pass of the search the proposed decoder periodically executes the second pass up to that time and detects a part of the final result of words. This method is not theoretically optimal but makes a quick decision with a negligible increase in word errors. In a recognition experiment on Japanese broadcast news, the decoder worked with an average decision delay of 554msec for each word and degraded word accuracy only by 0.22%.
引用
收藏
相关论文
共 50 条
  • [1] REAL-TIME SPEECH RECOGNITION CAPTIONING OF EVENTS AND MEETINGS
    Boulianne, Gilles
    Boisvert, Maryse
    Osterrath, Frederic
    2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 197 - 200
  • [2] Real-time recognition of broadcast radio speech
    Cook, GD
    Christie, JD
    Clarkson, PR
    Hochberg, MM
    Logan, BT
    Robinson, AJ
    Seymour, CW
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 141 - 144
  • [3] A real-Time Japanese broadcast news closed-captioning system
    Bell Labs - Lucent Technologies 600 Mountain Ave, Murray Hill
    NJ
    07974, United States
    不详
    157-8510, Japan
    EUROSPEECH - SCANDINAVIA - Euro. Conf. Speech Commun. Technol., 1600, (495-498):
  • [4] Real-time captioning for live broadcast
    NHK Science and Technology Research Laboratories, Tokyo, Japan
    Kyokai Joho Imeji Zasshi, 4 (300-304): : 300 - 304
  • [5] Progressive 2-pass decoder for real-time broadcast news captioning
    Imai, T
    Kobayashi, A
    Sato, S
    Tanaka, H
    Ando, A
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1559 - 1562
  • [6] Using Speech Recognition for Real-Time Captioning of Multiple Speakers
    Wald, Mike
    Bain, Keith
    IEEE MULTIMEDIA, 2008, 15 (04) : 56 - 57
  • [7] Online speech detection and dual-gender speech recognition for captioning broadcast news
    Imai, Toru
    Sato, Shoei
    Homma, Shinichi
    Onoe, Kazuo
    Kobayashi, Akio
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2007, E90D (08) : 1286 - 1291
  • [8] Online Speech Detection and Dual-Gender Speech Recognition for Captioning Broadcast News
    Imai, Toru
    Sato, Shoei
    Kobayashi, Akio
    Onoe, Kazuo
    Homma, Shinichi
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1602 - 1605
  • [9] New real-time closed-captioning system for Japanese broadcast news programs
    Homma, Shinichi
    Kobayashi, Akio
    Oku, Takahiro
    Sato, Shoei
    Imai, Torn
    Takagi, Tohru
    COMPUTERS HELPING PEOPLE WITH SPECIAL NEEDS, PROCEEDINGS, 2008, 5105 : 651 - 654
  • [10] Using Speech Recognition for Real-Time Captioning and Lecture Transcription in the Classroom
    Ranchal, Rohit
    Taber-Doughty, Teresa
    Guo, Yiren
    Bain, Keith
    Martin, Heather
    Robinson, J. Paul
    Duerstock, Bradley S.
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2013, 6 (04): : 299 - 311