SEMI-SUPERVISED TRAINING FOR END-TO-END MODELS VIA WEAK DISTILLATION

被引:0
|
作者
Li, Bo [1 ]
Sainath, Tara N. [1 ]
Pang, Ruoming [1 ]
Wu, Zelin [1 ]
机构
[1] Google LLC, Mountain View, CA 94043 USA
关键词
semi-supervised training; sequence to sequence;
D O I
10.1109/icassp.2019.8682172
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
End-to-end (E2E) models are a promising research direction in speech recognition, as the single all-neural E2E system offers a much simpler and more compact solution compared to a conventional model, which has a separate acoustic (AM), pronunciation (PM) and language model (LM). However, it has been noted that E2E models perform poorly on tail words and proper nouns, likely because the end-to-end optimization requires joint audio-text pairs, and does not take advantage of additional lexicons and large amounts of text-only data used to train the LMs in conventional models. There has been numerous efforts in training an RNN-LM on text-only data and fusing it into the end-to-end model. In this work, we contrast this approach to training the E2E model with audio-text pairs generated from unsupervised speech data. To target the proper noun issue specifically, we adopt a Part-of-Speech (POS) tagger to filter the unsupervised data to use only those with proper nouns. We show that training with filtered unsupervised-data provides up to a 13% relative reduction in word-error-rate (WER), and when used in conjunction with a cold-fusion RNN-LM, up to a 17% relative improvement.
引用
收藏
页码:2837 / 2841
页数:5
相关论文
共 50 条
  • [21] End-to-End Semi-Supervised Opportunistic Osteoporosis Screening Using Computed Tomography
    Oh, Jieun
    Kim, Boah
    Oh, Gyutaek
    Hwangbo, Yul
    Ye, Jong Chul
    ENDOCRINOLOGY AND METABOLISM, 2024, 39 (03) : 500 - 510
  • [22] Semi-supervised Trajectory Understanding with POI Attention for End-to-End Trip Recommendation
    Zhou, Fan
    Wu, Hantao
    Trajcevski, Goce
    Khokhar, Ashfaq
    Zhang, Kunpeng
    ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2020, 6 (02)
  • [23] Towards Precise End-to-end Semi-Supervised Human Head Detection Network
    Li, Rongchun
    Zhang, Junjie
    Liu, Yuntao
    Dou, Yong
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [24] Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework
    Zhou, Qiang
    Yu, Chaohui
    Wang, Zhibin
    Qian, Qi
    Li, Hao
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 4079 - 4088
  • [25] An end-to-end Graph Convolutional Network for Semi-supervised Subspace Clustering via label self-expressiveness
    Qi, Tingting
    Feng, Xiangchu
    Gao, Bian
    Wang, Kun
    KNOWLEDGE-BASED SYSTEMS, 2024, 286
  • [26] A comparison of supervised and unsupervised pre-training of end-to-end models
    Misra, Ananya
    Hwang, Dongseong
    Huo, Zhouyuan
    Garg, Shefali
    Siddhartha, Nikhil
    Narayanan, Arun
    Sim, Khe Chai
    INTERSPEECH 2021, 2021, : 731 - 735
  • [27] Towards End-to-End Semi-supervised Table Detection with Semantic Aligned Matching Transformer
    Shehzadi, Tahira
    Sarode, Shalini
    Stricker, Didier
    Afzal, Muhammad Zeshan
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 295 - 318
  • [28] End-to-end semi-supervised approach with modulated object queries for table detection in documents
    Ehsan, Iqraa
    Shehzadi, Tahira
    Stricker, Didier
    Afzal, Muhammad Zeshan
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2024, 27 (03) : 363 - 378
  • [29] End-to-End Semi-Supervised Ordinal Regression AUC Maximization with Convolutional Kernel Networks
    Xiong, Ziran
    Shi, Wanli
    Gu, Bin
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 2140 - 2150
  • [30] End-to-end semi-supervised deep learning model for surface crack detection of infrastructures
    Mohammed, Mohammed Ameen
    Han, Zheng
    Li, Yange
    Al-Huda, Zaid
    Li, Changli
    Wang, Weidong
    FRONTIERS IN MATERIALS, 2022, 9