TRANSCRIPTION OF MULTI-GENRE MEDIA ARCHIVES USING OUT-OF-DOMAIN DATA

被引:0
|
作者
Bell, P. J. [1 ]
Gales, M. J. F.
Lanchantin, P.
Liu, X.
Long, Y.
Renals, S. [1 ]
Swietojanski, P. [1 ]
Woodland, P. C.
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9AB, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
speech recognition; tandem; cross-domain adaptation; media archives;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We describe our work on developing a speech recognition system for multi-genre media archives. The high diversity of the data makes this a challenging recognition task, which may benefit from systems trained on a combination of in-domain and out-of-domain data. Working with tandem HMMs, we present Multi-level Adaptive Networks (MLAN), a novel technique for incorporating information from out-of-domain posterior features using deep neural networks. We show that it provides a substantial reduction in WER over other systems, with relative WER reductions of 15% over a PLP baseline, 9% over in-domain tandem features and 8% over the best out-of-domain tandem features.
引用
收藏
页码:324 / 329
页数:6
相关论文
共 50 条
  • [41] Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness
    Gokhale, Tejas
    Mishra, Swaroop
    Luo, Man
    Sachdeva, Bhavdeep Singh
    Baral, Chitta
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 2705 - 2718
  • [42] Neural sentence embedding using only in-domain sentences for out-of-domain sentence detection in dialog systems
    Ryu, Seonghan
    Kim, Seokhwan
    Choi, Junhwi
    Yu, Hwanjo
    Lee, Gary Geunbae
    PATTERN RECOGNITION LETTERS, 2017, 88 : 26 - 32
  • [43] Luke, I am Your Father: Dealing with Out-of-Domain Requests by Using Movies Subtitles
    Ameixa, David
    Coheur, Luisa
    Fialho, Pedro
    Quaresma, Paulo
    INTELLIGENT VIRTUAL AGENTS, IVA 2014, 2014, 8637 : 13 - 21
  • [44] Investigations into out-of-domain performance of a two-step ATR based on a fusion of thermal and environmental data
    Bragdon, Sophia P.
    Truong, Vuong H.
    Trautz, Andrew C.
    Bray, Matthew D.
    Clausen, Jay L.
    AUTOMATIC TARGET RECOGNITION XXXIV, 2024, 13039
  • [45] Dialogue Act Recognition for Chinese Out-of-Domain Utterances using Hybrid CNN-RF
    Wang, Jundong
    Huang, Peijie
    Huang, Qiangjia
    Ke, Zixuan
    Lin, Piyuan
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 14 - 17
  • [46] Evaluating RETFound for out-of-domain generalization in multi-disease detection from color fundus photographs
    Matta, Sarah
    Lamard, Mathieu
    Le Guilcher, Alexandre
    Borderie, Laurent
    Massin, Pascale
    Rottier, Jean-Bernard
    Cochener, Beatrice
    Quellec, Gwenole
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [47] Semi-supervised Training of Acoustic Models Leveraging Knowledge Transferred from Out-of-Domain Data
    Lo, Tien-Hong
    Chen, Berlin
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1400 - 1404
  • [48] Glioma subtype classification from histopathological images using in-domain and out-of-domain transfer learning: An experimental study
    Despotovic, Vladimir
    Kim, Sang-Yoon
    Hau, Ann-Christin
    Kakoichankava, Aliaksandra
    Klamminger, Gilbert Georg
    Borgmann, Felix Bruno Kleine
    Frauenknecht, Katrin B. M.
    Mittelbronn, Michel
    Nazarov, Petr, V
    HELIYON, 2024, 10 (05)
  • [49] Consistency-Guided Temperature Scaling Using Style and Content Information for Out-of-Domain Calibration
    Choi, Wonjeong
    Park, Jungwuk
    Han, Dong-Jun
    Park, Younghyun
    Moon, Jaekyun
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11588 - 11596
  • [50] A Gender Identification of Text Author in Mixture of Russian Multi-genre Texts with Distortions on Base of Data-driven Approach using Machine Learning Models
    Sboev, Alexander
    Gudovskikh, Dmitry
    Moloshnikov, Ivan
    Rybka, Roman
    INTERNATIONAL CONFERENCE ON NUMERICAL ANALYSIS AND APPLIED MATHEMATICS (ICNAAM-2018), 2019, 2116