Data selection by sequence summarizing neural network in mismatch condition training

被引:2
|
作者
Zmolikova, Katerina [1 ,2 ]
Karafiat, Martin [1 ,2 ]
Vesely, Karel [1 ,2 ]
Delcroix, Marc [3 ]
Watanabe, Shinji [4 ]
Burget, Lukas [1 ,2 ]
Cernocky, Jan Honza [1 ,2 ]
机构
[1] Brno Univ Technol, Speech FIT, Brno, Czech Republic
[2] IT4I Ctr Excellence, Brno, Czech Republic
[3] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan
[4] MERL, Cambridge, MA USA
关键词
Automatic speech recognition; Data augmentation; Data selection; Mismatch training condition; Sequence summarization;
D O I
10.21437/Interspeech.2016-741
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Data augmentation is a simple and efficient technique to improve the robustness of a speech recognizer when deployed in mismatched training-test conditions. Our paper proposes a new approach for selecting data with respect to similarity of acoustic conditions. The similarity is computed based on a sequence summarizing neural network which extracts vectors containing acoustic summary (e.g. noise and reverberation characteristics) of an utterance. Several configurations of this network and different methods of selecting data using these "summary-vectors" were explored. The results are reported on a mismatched condition using AMI training set with the proposed data selection and CHiME3 test set.
引用
收藏
页码:2354 / 2358
页数:5
相关论文
共 50 条
  • [31] A NEURAL NETWORK BASED SUMMARIZING METHOD OF PERIODIC IMAGE SEQUENCES
    Berkane, Mohamed
    Clarysse, Patrick
    Njiwa, Josiane Yankam
    Zhu, Yue Min
    Magnin, Isabelle E.
    NEURAL NETWORK WORLD, 2010, 20 (06) : 687 - 703
  • [32] Selection of minimum training data for generalization and on-line training by multilayer neural networks
    Hara, K
    Nakayama, K
    ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 436 - 441
  • [33] MaxUp: Lightweight Adversarial Training with Data Augmentation Improves Neural Network Training
    Gong, Chengyue
    Ren, Tongzheng
    Ye, Mao
    Liu, Qiang
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2474 - 2483
  • [34] On Training Data Selection in Condition Monitoring Applications-Case Azimuth Thrusters
    Nikula, Riku-Pekka
    Ruusunen, Mika
    Bohme, Stephan Andre
    APPLIED SCIENCES-BASEL, 2022, 12 (08):
  • [35] Crucial Data Selection based on Random Weight Neural Network
    Ji, Jie
    Jiang, Hongcheng
    Zhao, Bin
    Zhai, Peng
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 1017 - 1022
  • [36] The Application of Data Mining and BP Neural Network in Supplier Selection
    Shi Cheng-dong
    Chen Ju-hong
    Sun Qi-xia
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 947 - +
  • [37] Test Case Selection for Neural Network via Data Mutation
    Cao, Xue-Jie
    Chen, Jun-Jie
    Yan, Ming
    You, Han-Mo
    Wu, Zhuo
    Wang, Zan
    Ruan Jian Xue Bao/Journal of Software, 2024, 35 (11): : 4973 - 4992
  • [38] Evaluation of Stratified Validation in Neural Network Training with Imbalanced Data
    Takase, Tomoumi
    Oyama, Satoshi
    Kurihara, Masahito
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 379 - 382
  • [39] Editing training data for kNN classifiers with neural network ensemble
    Jiang, Y
    Zhou, ZH
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 356 - 361
  • [40] Data-scaling problems in neural-network training
    Koprinkova, P
    Petrova, M
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 1999, 12 (03) : 281 - 296