THE IBM 2008 GALE ARABIC SPEECH TRANSCRIPTION SYSTEM

被引:9
|
作者
Saon, George [1 ]
Soltau, Hagen [1 ]
Chaudhari, Upendra [1 ]
Chu, Stephen [1 ]
Kingsbury, Brian [1 ]
Kuo, Hong-Kwang [1 ]
Mangu, Lidia [1 ]
Povey, Daniel [2 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Microsoft Res, Redmond, WA USA
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Speech recognition;
D O I
10.1109/ICASSP.2010.5495640
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes the Arabic broadcast transcription system fielded by IBM in the GALE Phase 3.5 machine translation evaluation. Key advances compared to our Phase 2.5 system include improved discriminative training, the use of Subspace Gaussian Mixture Models (SGMM), neural network acoustic features, variable frame rate decoding, training data partitioning experiments, unpruned n-gram language models and neural network language models. These advances were instrumental in achieving a word error rate of 8.9% on the evaluation test set.
引用
收藏
页码:4378 / 4381
页数:4
相关论文
共 50 条
  • [31] Arabic Speech Recognition System based on CMUSphinx
    Satori, H.
    Harti, M.
    Chenfour, N.
    ISCIII '07: 3RD INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, PROCEEDINGS, 2007, : 31 - +
  • [32] AN UNRESTRICTED VOCABULARY ARABIC SPEECH SYNTHESIS SYSTEM
    ELIMAM, YA
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12): : 1829 - 1845
  • [33] TOWARD AN ARABIC TEXT-TO-SPEECH SYSTEM
    AHMED, ME
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1991, 16 (04): : 565 - 583
  • [34] Arabic Speech Synthesis System Based on HMM
    Amrouche, Aissa
    Abed, Ahcene
    Falek, Leila
    2019 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2019), 2019, : 73 - 78
  • [35] A MANUAL SYSTEM TO SEGMENT AND TRANSCRIBE ARABIC SPEECH
    Alghamdi, M.
    El Hadj, Y. O. Mohamed
    Alkanhal, M.
    ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 233 - +
  • [36] An automatic transcription system for arabic numerals in Korean
    Yoon, A
    Kwon, HC
    Lee, MH
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 221 - 226
  • [37] The IBM rich transcription 2007 speech-to-text systems for lecture meetings
    Huang, Jing
    Marcheret, Etienne
    Visweswariah, Karthik
    Libal, Vit
    Potamianos, Gerasimos
    MULTIMODAL TECHNOLOGIES FOR PERCEPTION OF HUMANS, 2008, 4625 : 429 - 441
  • [38] Recent improvements to the IBM trainable speech synthesis system
    Eide, E
    Aaron, A
    Bakis, R
    Cohen, P
    Donovan, R
    Hamza, W
    Mathes, T
    Picheny, M
    Polkosky, M
    Smith, M
    Viswanathan, M
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 708 - 711
  • [39] A mandarin lecture speech transcription system for speech summarization
    Chan, Ho Yin
    Zhang, Justin Jian
    Fung, Pascale
    Cao, Lu
    2007 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, VOLS 1 AND 2, 2007, : 467 - 471
  • [40] The IBM 2004 conversational telephony system for rich transcription
    Soltau, H
    Kingsbury, B
    Mangu, L
    Povey, D
    Saon, G
    Zweig, G
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 205 - 208