The IBM 2006 Speech Transcription System for European Parliamentary Speeches

被引:0
|
作者
Ramabhadran, B. [1 ]
Siohan, O. [1 ]
Mangu, L. [1 ]
Zweig, G. [1 ]
Westphal, M. [2 ]
Schulz, H. [2 ]
Soneiro, A. [2 ]
机构
[1] IBM TJ Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] IBM Germany, EMEA Voice Technol Dev, Munich, Germany
来源
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年
关键词
speech recognition; automatic segmentation; cross-adaptation; randomized decision trees; TC-STAR;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
TC-STAR is an European Union funded speech to speech translation project to transcribe, translate and synthesize European Parliamentary Plenary Speeches (EPPS). This paper describes IBM's English and Spanish speech recognition systems submitted to the TC-STAR 2006 Evaluation. The technical advances in this submission include two different algorithms for automatic segmentation and speaker clustering of the input audio; a system architecture that is based on cross-adaptation across these two segmentation schemes and system combination through generation of an ensemble of systems using randomized decision tree state-tying; automatic punctuation of the speech recognition output; and the incorporation of an additional 35 hours of in-domain EPPS acoustic training data. These advances reduced the error rate by 30% relative over the best-performing system in the TC-STAR 2005 Evaluation on the 2006 English development test set, and produced one of the best performing systems on the 2006 evaluation in English with a word error rate of 8.3%.
引用
收藏
页码:1225 / +
页数:2
相关论文
共 50 条
  • [41] An Automatic Speech Transcription System for Manipuri Language
    Patel, Tanvina
    Krishna, D. N.
    Fathima, Noor
    Shah, Nisar
    Mahima, C.
    Kumar, Deepak
    Iyengar, Anuroop
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2388 - 2389
  • [42] The development of the AMI system for the transcription of speech in meetings
    Hain, T
    Burget, L
    Dines, J
    McCowan, I
    Garau, G
    Karafiat, M
    Lincoln, M
    Moore, D
    Wan, V
    Ordelman, R
    Renals, S
    MACHINE LEARNING FOR MULTIMODAL INTERACTION, 2005, 3869 : 344 - 356
  • [43] The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks
    Zhou, Bowen
    Cui, Xiaodong
    Huang, Songfang
    Cmejrek, Martin
    Zhang, Wei
    Xue, Jian
    Cui, Jia
    Xiang, Bing
    Daggett, Gregg
    Chaudhari, Upendra
    Maskey, Sameer
    Marcheret, Etienne
    COMPUTER SPEECH AND LANGUAGE, 2013, 27 (02): : 592 - 618
  • [44] Towards a two-party system? The Swedish parliamentary election of September 2006
    Aylott, Nicholas
    Bolin, Niklas
    WEST EUROPEAN POLITICS, 2007, 30 (03) : 621 - 633
  • [45] A SPEECH WAVE-FORM INPUT AND DISPLAY SYSTEM FOR THE IBM PC
    TYLER, JEM
    JOURNAL OF MICROCOMPUTER APPLICATIONS, 1987, 10 (03): : 219 - 227
  • [46] IMPROVEMENTS TO THE IBM SPEECH ACTIVITY DETECTION SYSTEM FOR THE DARPA RATS PROGRAM
    Thomas, Samuel
    Saon, George
    Van Segbroeck, Maarten
    Narayanan, Shrikanth S.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4500 - 4504
  • [47] The IBM expressive text-to-speech synthesis system for American English
    Pitrelli, John F.
    Bakis, Raitno
    Eide, Ellen M.
    Fernandez, Raul
    Hamza, Wael
    Picheny, Michael A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1099 - 1108
  • [48] GENESIS OF POLISH PARLIAMENTARY SYSTEM - SOCIOECONOMIC CONTEXT AND INFLUENCE OF EUROPEAN IDEAS
    WYRWA, T
    REVUE D ETUDES COMPARATIVES EST-OUEST, 1977, 8 (01): : 131 - 164
  • [49] European institutional architecture after Amsterdam: Parliamentary system or regulatory structure?
    Dehousse, R
    COMMON MARKET LAW REVIEW, 1998, 35 (03): : 595 - 627
  • [50] SPEECH CODEC FOR THE EUROPEAN MOBILE RADIO SYSTEM
    VARY, P
    HOFMANN, R
    FREQUENZ, 1988, 42 (2-3) : 85 - 93