The 2005 AMI system for the transcription of speech in meetings

被引：0

作者：

Hain, T ^{[1
]}

Burget, L

Dines, J

Garau, G

Karafiat, M

Lincoln, M

McCowan, I

Moore, D

Wan, V

Ordelman, R

Renals, S

机构：

[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England

[2] Brno Univ Technol, Fac Informat Engn, Brno 61266, Czech Republic

[3] IDIAP Res Inst, CH-1920 Martigny, Switzerland

[4] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland

[5] Univ Twente, Dept Elect Engn, NL-7500 AE Enschede, Netherlands

来源：

MACHINE LEARNING FOR MULTIMODAL INTERACTION | 2005年 / 3869卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we describe the 2005 AMI system for the transcription of speech in meetings used in the 2005 NIST RT evaluations. The system was designed for participation in the speech to text part of the evaluations, in particular for transcription of speech recorded with multiple distant microphones and independent headset microphones. System performance was tested on both conference room and lecture style meetings. Although input sources are processed using different front-ends, the recognition process is based on a unified system architecture. The system operates in multiple passes and makes use of state of the art technologies such as discriminative training, vocal tract length normalisation, heteroscedastic linear discriminant analysis, speaker adaptation with maximum likelihood linear regression and minimum word error rate decoding. In this paper we describe the system performance on the official development and test sets for the NIST RT05s evaluations. The system was jointly developed in less than 10 months by a multi-site team and was shown to achieve competitive performance.

引用

页码：450 / 462

页数：13

共 50 条

[21] Forthcoming Meetings February 2005
Supportive Care in Cancer, 2005, 13 (2) : 133 - 133
[22] Forthcoming Meetings March 2005
Supportive Care in Cancer, 2005, 13 (3) : 202 - 202
[23] Forthcoming Meetings January 2005
Supportive Care in Cancer, 2005, 13 (1) : 75 - 75
[24] Forthcoming Meetings August 2005
Supportive Care in Cancer, 2005, 13 (8) : 668 - 668
[25] Meetings about meetings: Research at ICSI on speech in multiparty conversations
Morgan, N
Baron, D
Bhagat, S
Carvey, H
Dhillon, R
Edwards, J
Gelbart, D
Janin, A
Krupski, A
Peskin, B
Pfau, T
Shriberg, E
Stolcke, A
Wooters, C
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PROCEEDINGS: SIGNAL PROCESSING FOR COMMUNICATIONS SPECIAL SESSIONS, 2003, : 740 - 743
[26] 1998 HTK system for transcription of conversational telephone speech
Hain, T.
Woodland, P.C.
Niesler, T.R.
Whittaker, E.W.D.
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 57 - 60
[27] Transcription System for Semi-Spontaneous Estonian Speech
Alumaee, Tanel
HUMAN LANGUAGE TECHNOLOGIES: THE BALTIC PERSPECTIVE, 2012, 247 : 10 - 17
[28] Intelligent transcription system based on spontaneous speech processing
Kawahara, Tatsuya
ICKS 2007: SECOND INTERNATIONAL CONFERENCE ON INFORMATICS RESEARCH FOR DEVELOPMENT OF KNOWLEDGE SOCIETY INFRASTRUCTURE, PROCEEDINGS, 2007, : 19 - 26
[29] THE IBM 2009 GALE ARABIC SPEECH TRANSCRIPTION SYSTEM
Kingsbury, Brian
Soltau, Hagen
Saon, George
Chu, Stephen
Kuo, Hong-Kwang
Mangu, Lidia
Ravuri, Suman
Morgan, Nelson
Janin, Adam
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4672 - 4675
[30] Slovak Broadcast News Speech Recognition and Transcription System
Lojka, Martin
Viszlay, Peter
Stas, Jan
Hladek, Daniel
Juhar, Jozef
ADVANCES IN NETWORK-BASED INFORMATION SYSTEMS, NBIS-2018, 2019, 22 : 385 - 394

← 1 2 3 4 5 →