Transfer learning through perturbation-based in-domain spectrogram augmentation for adult speech recognition

被引:0
|
作者
Kadyan, Virender [1 ]
Bawa, Puneet [2 ]
机构
[1] Speech and Language Research Centre, School of Computer Science, University of Petroleum & Energy Studies (UPES), Energy Acres, Bidholi, Uttarakhand, Dehradun,248007, India
[2] Centre of Excellence for Speech and Multimodal Laboratory, Chitkara University Institute of Engineering & Technology, Chitkara University, Punjab, Rajpura, India
来源
Neural Computing and Applications | 2022年 / 34卷 / 23期
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic speech recognition system - Data augmentation - Data scarcity - Learning techniques - Overfitting - Pedagogical practices - Punjabi speech recognition - Spectrogram augmentation - Spectrograms - Transfer learning
引用
收藏
页码:21015 / 21033
相关论文
共 50 条
  • [41] LEARNING NOISE INVARIANT FEATURES THROUGH TRANSFER LEARNING FOR ROBUST END-TO-END SPEECH RECOGNITION
    Zhang, Shucong
    Do, Cong-Thanh
    Doddipatla, Rama
    Renals, Steve
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7024 - 7028
  • [42] Thermal error model of machine tool spindle based on in-domain alignment and transfer learning under variable working conditions
    Zheng Y.
    Fu G.
    Lei G.
    Zhou L.
    Zhu S.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2023, 44 (05): : 33 - 43
  • [43] Sparse Autoencoder-based Feature Transfer Learning for Speech Emotion Recognition
    Deng, Jun
    Zhang, Zixing
    Marchi, Erik
    Schuller, Bjoern
    2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 511 - 516
  • [44] Research on automatic speech recognition based on a DL-T and transfer learning
    Zhang W.
    Liu C.
    Fei H.-B.
    Li W.
    Yu J.-H.
    Cao Y.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2021, 43 (03): : 433 - 441
  • [45] Helicopter cockpit speech recognition method based on transfer learning and context biasing
    Wang, Guotao
    Wang, Jiaqi
    Wang, Shicheng
    Wu, Qianyu
    Teng, Yuru
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (03):
  • [46] SENet-based speech emotion recognition using synthesis-style transfer data augmentation
    Rajan R.
    Hridya Raj T.V.
    International Journal of Speech Technology, 2023, 26 (04) : 1017 - 1030
  • [47] Unmanned Aerial Vehicle Control through Domain-Based Automatic Speech Recognition
    Contreras, Ruben
    Ayala, Angel
    Cruz, Francisco
    COMPUTERS, 2020, 9 (03) : 1 - 15
  • [48] High-order similarity learning based domain adaptation for speech emotion recognition
    Wang, Hao
    Ji, Yixuan
    Song, Peng
    Liu, Zhaowei
    APPLIED ACOUSTICS, 2025, 231
  • [49] Language dialect based speech emotion recognition through deep learning techniques
    Sukumar Rajendran
    Sandeep Kumar Mathivanan
    Prabhu Jayagopal
    Maheshwari Venkatasen
    Thanapal Pandi
    Manivannan Sorakaya Somanathan
    Muthamilselvan Thangaval
    Prasanna Mani
    International Journal of Speech Technology, 2021, 24 : 625 - 635
  • [50] Interventions in STEM Education Through Speech Recognition-Based Learning Analysis
    Lin, Chia-Ju
    Wang, Wei-Sheng
    Lee, Hsin-Yu
    Huang, Yueh-Min
    Wu, Ting-Ting
    JOURNAL OF EDUCATIONAL COMPUTING RESEARCH, 2025, 63 (02) : 311 - 335