Improving the Training Recipe for a Robust Conformer-based Hybrid Model

被引:2
|
作者
Zeineldeen, Mohammad [1 ,2 ]
Xu, Jingjing [1 ]
Luescher, Christoph [1 ,2 ]
Schlueter, Ralf [1 ,2 ]
Ney, Hermann [1 ,2 ]
机构
[1] Rhein Westfal TH Aachen, Dept Comp Sci, Human Language Technol & Pattern Recognit, D-52074 Aachen, Germany
[2] AppTek GmbH, D-52062 Aachen, Germany
来源
关键词
speech recognition; conformer acoustic model; speaker adaptation; NEURAL-NETWORKS; SPEAKER;
D O I
10.21437/Interspeech.2022-10723
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speaker adaptation is important to build robust automatic speech recognition (ASR) systems. In this work, we investigate various methods for speaker adaptive training (SAT) based on feature-space approaches for a conformer-based acoustic model (AM) on the Switchboard 300h dataset. We propose a method, called Weighted-Simple-Add, which adds weighted speaker information vectors to the input of the multi-head self-attention module of the conformer AM. Using this method for SAT, we achieve 3.5% and 4.5% relative improvement in terms of WER on the CallHome part of Hub5'00 and Hub5'01 respectively. Moreover, we build on top of our previous work where we proposed a novel and competitive training recipe for a conformerbased hybrid AM. We extend and improve this recipe where we achieve 11% relative improvement in terms of word-error-rate (WER) on Switchboard 300h Hub5'00 dataset. We also make this recipe efficient by reducing the total number of parameters by 34% relative.
引用
收藏
页码:1036 / 1040
页数:5
相关论文
共 50 条
  • [41] Efficient and Robust Long-Form Speech Recognition with Hybrid H3-Conformer
    Honda, Tomoki
    Sakai, Shinsuke
    Kawahara, Tatsuya
    INTERSPEECH 2024, 2024, : 2895 - 2899
  • [42] A robust double auction protocol based on a hybrid trust model
    Ha, J
    Zhou, JY
    Moon, S
    INFORMATION SYSTEMS SECURITY, PROCEEDINGS, 2005, 3803 : 77 - 90
  • [43] Efficient approach to improving pattern fidelity with multi OPC model and recipe
    Do, Munhoe
    Kang, Jaehyun
    Choi, Jaeyoung
    Lee, Junseok
    Lee, Yongsuk
    Kim, Keeho
    PHOTOMASK TECHNOLOGY 2006, PTS 1 AND 2, 2006, 6349
  • [44] Improving Robust Fairness via Balance Adversarial Training
    Sun, Chunyu
    Xu, Chenye
    Yao, Chengyuan
    Liang, Siyuan
    Wu, Yichao
    Liang, Ding
    Liu, Xianglong
    Liu, Aishan
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 15161 - 15169
  • [45] Improving Transformer-Kernel Ranking Model Using Conformer and Query Term Independence
    Mitra, Bhaskar
    Hofstatter, Sebastian
    Zamani, Hamed
    Craswell, Nick
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1697 - 1702
  • [46] Robust estimation based hybrid variational model for image restoration and application
    School of Computer Science and Technology, NUST, Nanjing 210094, China
    不详
    Nanjing Li Gong Daxue Xuebao, 2007, 4 (418-421+439):
  • [47] Robust Tube-based Model Predictive Control for Hybrid Systems
    Ghasemi, Mohammad Sajjad
    Afzalian, Ali A.
    Ramezani, M. H.
    2015 23RD IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2015, : 982 - 987
  • [48] Hybrid bond graph model based for robust fault detection and isolation
    Rahal, Mohamed Ilyas
    Bouamama, Belkacem Ould
    Meghebbar, Abdelmajid
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2016, 230 (02) : 145 - 163
  • [49] Mechanisms of Dangua Recipe in Improving Glycolipid Metabolic Disorders Based on Transcriptomics
    HENG Xian-pei
    WANG Zhi-ta
    LI Liang
    YANG Liu-qing
    HUANG Su-ping
    Chinese Journal of Integrative Medicine , 2022, (02) : 130 - 137
  • [50] Mechanisms of Dangua Recipe in Improving Glycolipid Metabolic Disorders Based on Transcriptomics
    HENG Xianpei
    WANG Zhita
    LI Liang
    YANG Liuqing
    HUANG Suping
    Chinese Journal of Integrative Medicine, 2022, 28 (02) : 130 - 137