Data selection by sequence summarizing neural network in mismatch condition training

被引:2
|
作者
Zmolikova, Katerina [1 ,2 ]
Karafiat, Martin [1 ,2 ]
Vesely, Karel [1 ,2 ]
Delcroix, Marc [3 ]
Watanabe, Shinji [4 ]
Burget, Lukas [1 ,2 ]
Cernocky, Jan Honza [1 ,2 ]
机构
[1] Brno Univ Technol, Speech FIT, Brno, Czech Republic
[2] IT4I Ctr Excellence, Brno, Czech Republic
[3] NTT Corp, NTT Commun Sci Labs, Kyoto, Japan
[4] MERL, Cambridge, MA USA
关键词
Automatic speech recognition; Data augmentation; Data selection; Mismatch training condition; Sequence summarization;
D O I
10.21437/Interspeech.2016-741
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Data augmentation is a simple and efficient technique to improve the robustness of a speech recognizer when deployed in mismatched training-test conditions. Our paper proposes a new approach for selecting data with respect to similarity of acoustic conditions. The similarity is computed based on a sequence summarizing neural network which extracts vectors containing acoustic summary (e.g. noise and reverberation characteristics) of an utterance. Several configurations of this network and different methods of selecting data using these "summary-vectors" were explored. The results are reported on a mismatched condition using AMI training set with the proposed data selection and CHiME3 test set.
引用
收藏
页码:2354 / 2358
页数:5
相关论文
共 50 条
  • [1] SEQUENCE SUMMARIZING NEURAL NETWORK FOR SPEAKER ADAPTATION
    Vesely, Karel
    Watanabe, Shinji
    Zmolikova, Katerina
    Karafiat, Martin
    Burget, Lukas
    Cernocky, Jan Honza
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5315 - 5319
  • [2] Selection of Training Data for Locally Recurrent Neural Network
    Patan, Krzysztof
    Patan, Maciej
    ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT II, 2010, 6353 : 134 - 137
  • [3] Instance Selection Optimization for Neural Network Training
    Kordos, Miroslaw
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2016, 2016, 9692 : 610 - 620
  • [4] Performance of a Neural Network Receiver under Mismatch of Channel Training Samples
    de Souza, Pedro H. C.
    Mendes, Luciano L.
    Souza, Richard Demo
    2022 IEEE FUTURE NETWORKS WORLD FORUM, FNWF, 2022, : 532 - 537
  • [5] Sequence Summarizing Neural Networks for Spoken Language Recognition
    Pesan, Jan
    Burget, Lukas
    Cernocky, Jan Honza
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3285 - 3288
  • [6] PROTOTYPE SELECTION-RULES FOR NEURAL NETWORK TRAINING
    GUPTA, L
    WANG, JS
    CHARLES, A
    KISATSKY, P
    PATTERN RECOGNITION, 1992, 25 (11) : 1401 - 1408
  • [7] Effective training data selection in tool condition monitoring system
    Sun, J
    Hong, GS
    Wong, YS
    Rahman, M
    Wang, ZG
    INTERNATIONAL JOURNAL OF MACHINE TOOLS & MANUFACTURE, 2006, 46 (02): : 218 - 224
  • [8] Data conflict elimination in neural network training
    Raggad, BG
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 1996, 36 (04) : 92 - 98
  • [9] Graph Neural Network Training with Data Tiering
    Min, Seung Won
    Wu, Kun
    Hidayetoglu, Mert
    Xiong, Jinjun
    Song, Xiang
    Hwu, Wen-mei
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3555 - 3565
  • [10] Effect of data standardization on neural network training
    Shanker, M
    Hu, MY
    Hung, MS
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1996, 24 (04): : 385 - 397