Training augmentation with TANDEM acoustic modelling in Punjabi adult speech recognition system

被引:0
|
作者
Virender Kadyan
Shashi Bala
Puneet Bawa
机构
[1] University of Petroleum & Energy Studies (UPES),Department of Informatics, School of Computer Science
[2] Chitkara University Institute of Engineering and Technology,Centre of Excellence for Speech and Multimodal Laboratory
[3] Chitkara University,undefined
关键词
Tandem-NN; Data augmentation; Bottleneck features; Punjabi ASR; DNN-HMM;
D O I
暂无
中图分类号
学科分类号
摘要
Processing of low resource pre and post acoustic signals always faced the challenge of data scarcity in its training module. It’s difficult to obtain high system accuracy with limited corpora in train set which results into extraction of large discriminative feature vector. These vectors information are distorted due to acoustic mismatch occurs because of real environment and inter speaker variations. In this paper, context independent information of an input speech signal is pre-processed using bottleneck features and later in modeling phase Tandem-NN model has been employ to enhance system accuracy. Later to fulfill the requirement of train data issues, in-domain training augmentation is perform using fusion of original clean and artificially created modified train noisy data and to further boost this training data, tempo modification of input speech signal is perform with maintenance of its spectral envelope and pitch in corresponding input audio signal. Experimental result shows that a relative improvement of 13.53% is achieved in clean and 32.43% in noisy conditions with Tandem-NN system in comparison to that of baseline system respectively.
引用
收藏
页码:473 / 481
页数:8
相关论文
共 50 条
  • [1] Training augmentation with TANDEM acoustic modelling in Punjabi adult speech recognition system
    Kadyan, Virender
    Bala, Shashi
    Bawa, Puneet
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 473 - 481
  • [2] In domain training data augmentation on noise robust Punjabi Children speech recognition
    Kadyan, Virender
    Bawa, Puneet
    Hasija, Taniya
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 13 (5) : 2705 - 2721
  • [3] In domain training data augmentation on noise robust Punjabi Children speech recognition
    Virender Kadyan
    Puneet Bawa
    Taniya Hasija
    Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 2705 - 2721
  • [4] An automatic speech recognition system for spontaneous Punjabi speech corpus
    Kumar Y.
    Singh N.
    International Journal of Speech Technology, 2017, 20 (2) : 297 - 303
  • [5] Multi-setting acoustic feature training for data augmentation of speech recognition
    Ueno, Sei
    Lee, Akinobu
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2024, 45 (04) : 195 - 203
  • [6] Effect of pitch enhancement in Punjabi children's speech recognition system under disparate acoustic conditions
    Bhardwaj, Vivek
    Kukreja, Vinay
    APPLIED ACOUSTICS, 2021, 177
  • [7] Effect of pitch enhancement in Punjabi children's speech recognition system under disparate acoustic conditions
    Bhardwaj, Vivek
    Kukreja, Vinay
    Applied Acoustics, 2021, 177
  • [8] Implementation of Phonetic Level Speech Recognition System for Punjabi Language
    Mittal, Shama
    Kaur, Rupinderdeep
    2016 1ST INDIA INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (IICIP), 2016,
  • [9] Acoustic training system for speaker independent continuous arabic speech recognition system
    Nofal, M
    Abdel-Raheem, E
    El Henawy, H
    Kader, NA
    Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 200 - 203
  • [10] Speech Recognition System of the Punjabi Language for Multi-Resolution Speech Analysis
    Guglani, Jyoti
    Mishra, A.N.
    SSRN, 1600,