The 2005 AMI system for the transcription of speech in meetings

被引:0
|
作者
Hain, T [1 ]
Burget, L
Dines, J
Garau, G
Karafiat, M
Lincoln, M
McCowan, I
Moore, D
Wan, V
Ordelman, R
Renals, S
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
[2] Brno Univ Technol, Fac Informat Engn, Brno 61266, Czech Republic
[3] IDIAP Res Inst, CH-1920 Martigny, Switzerland
[4] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland
[5] Univ Twente, Dept Elect Engn, NL-7500 AE Enschede, Netherlands
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe the 2005 AMI system for the transcription of speech in meetings used in the 2005 NIST RT evaluations. The system was designed for participation in the speech to text part of the evaluations, in particular for transcription of speech recorded with multiple distant microphones and independent headset microphones. System performance was tested on both conference room and lecture style meetings. Although input sources are processed using different front-ends, the recognition process is based on a unified system architecture. The system operates in multiple passes and makes use of state of the art technologies such as discriminative training, vocal tract length normalisation, heteroscedastic linear discriminant analysis, speaker adaptation with maximum likelihood linear regression and minimum word error rate decoding. In this paper we describe the system performance on the official development and test sets for the NIST RT05s evaluations. The system was jointly developed in less than 10 months by a multi-site team and was shown to achieve competitive performance.
引用
收藏
页码:450 / 462
页数:13
相关论文
共 50 条
  • [41] PROFOUNDLY DEAF BUSINESSMANS VIEWS ON THE PALANTYPE SPEECH TRANSCRIPTION SYSTEM
    HAYWARD, G
    INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1979, 11 (06): : 711 - 715
  • [42] PHONEMIA A PHONEME TRANSCRIPTION SYSTEM FOR SPEECH SYNTHESIS IN MODERN GREEK
    BAKAMIDIS, S
    CARAYANNIS, G
    SPEECH COMMUNICATION, 1987, 6 (02) : 159 - 169
  • [43] The IBM 2006 Speech Transcription System for European Parliamentary Speeches
    Ramabhadran, B.
    Siohan, O.
    Mangu, L.
    Zweig, G.
    Westphal, M.
    Schulz, H.
    Soneiro, A.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1225 - +
  • [44] Development of Transcription System Using Speech Recognition for Program Production
    Mishima T.
    Hagiwara A.
    Ito H.
    Komori T.
    Horikawa D.
    Kawase N.
    Sato S.
    1600, Inst. of Image Information and Television Engineers (74): : 729 - 735
  • [45] Prεεch: A System for Privacy-Preserving Speech Transcription
    Ahmed, Shimaa
    Chowdhury, Amrita Roy
    Fawaz, Kassem
    Ramanathan, Parmesh
    PROCEEDINGS OF THE 29TH USENIX SECURITY SYMPOSIUM, 2020, : 2703 - 2720
  • [46] More to Meetings: Challenges in Using Speech-Based Technology to Support Meetings
    McGregor, Moira
    Tang, John C.
    CSCW'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, 2017, : 2208 - 2220
  • [47] A Speech Recognizer for Frisian/Dutch Council Meetings
    Bentum, Martijn
    ten Bosch, Louis
    van den Heuvel, Henk
    Wills, Simone
    van der Niet, Domenique
    Dijkstra, Jelske
    Van de Velde, Hans
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 1009 - 1015
  • [48] Transcription table: Text support during meetings
    van Gelder, J
    van Peer, I
    Aliakseyeu, D
    HUMAN-COMPUTER INTERACTION - INTERACT 2005, PROCEEDINGS, 2005, 3585 : 1002 - 1005
  • [49] Speech recognition and transcription
    Benton, C
    ACADEMIC RADIOLOGY, 2001, 8 (05) : 427 - 429
  • [50] AUTOMATIC SPEAKER ROLE LABELING IN AMI MEETINGS: RECOGNITION OF FORMAL AND SOCIAL ROLES
    Sapru, Ashtosh
    Valente, Fabio
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5057 - 5060