Transfer learning through perturbation-based in-domain spectrogram augmentation for adult speech recognition

被引:0
|
作者
Kadyan, Virender [1 ]
Bawa, Puneet [2 ]
机构
[1] Speech and Language Research Centre, School of Computer Science, University of Petroleum & Energy Studies (UPES), Energy Acres, Bidholi, Uttarakhand, Dehradun,248007, India
[2] Centre of Excellence for Speech and Multimodal Laboratory, Chitkara University Institute of Engineering & Technology, Chitkara University, Punjab, Rajpura, India
来源
Neural Computing and Applications | 2022年 / 34卷 / 23期
关键词
Compilation and indexing terms; Copyright 2024 Elsevier Inc;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic speech recognition system - Data augmentation - Data scarcity - Learning techniques - Overfitting - Pedagogical practices - Punjabi speech recognition - Spectrogram augmentation - Spectrograms - Transfer learning
引用
收藏
页码:21015 / 21033
相关论文
共 50 条
  • [1] Transfer learning through perturbation-based in-domain spectrogram augmentation for adult speech recognition
    Kadyan, Virender
    Bawa, Puneet
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (23): : 21015 - 21033
  • [2] Retraction Note: Transfer learning through perturbation-based in-domain spectrogram augmentation for adult speech recognition
    Virender Kadyan
    Puneet Bawa
    Neural Computing and Applications, 2024, 36 (24) : 15235 - 15235
  • [3] Landmark perturbation-based data augmentation for unconstrained face recognition
    Lv, Jiang-Jing
    Cheng, Cheng
    Tian, Guo-Dong
    Zhou, Xiang-Dong
    Zhou, Xi
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 465 - 475
  • [4] Data Augmentation Techniques for Transfer Learning-Based Continuous Dysarthric Speech Recognition
    Celin, T. A. Mariya
    Vijayalakshmi, P.
    Nagarajan, T.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (01) : 601 - 622
  • [5] Data Augmentation Techniques for Transfer Learning-Based Continuous Dysarthric Speech Recognition
    T. A. Mariya Celin
    P. Vijayalakshmi
    T. Nagarajan
    Circuits, Systems, and Signal Processing, 2023, 42 : 601 - 622
  • [6] Perturbation-Based Two-Stage Multi-Domain Active Learning
    He, Rui
    Dai, Zeyu
    He, Shan
    Tang, Ke
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 3933 - 3937
  • [7] Speech emotion recognition based on meta-transfer learning with domain adaption
    Liu, Zhen -Tao
    Wu, Bao-Han
    Han, Meng -Ting
    Cao, Wei -Hua
    Wu, Min
    APPLIED SOFT COMPUTING, 2023, 147
  • [8] Vocal Tract Length Perturbation-based Pseudo-Speaker Augmentation for Speaker Embedding Learning
    Wakamatsu, Tomoka
    Shiota, Sayaka
    Kiya, Hitoshi
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2228 - 2232
  • [9] Image Perturbation-Based Deep Learning for Face Recognition Utilizing Discrete Cosine Transform
    Park, Jaehun
    Kim, Kwangsu
    ELECTRONICS, 2022, 11 (01)
  • [10] Category-based and Target-based Data Augmentation for Dysarthric Speech Recognition Using Transfer Learning
    Nawroly, Sarkhell Sirwan
    Popescu, Decebal
    Antony, Mariya Celin T. H. E. K. E. K. A. R. A.
    STUDIES IN INFORMATICS AND CONTROL, 2024, 33 (04):