Parameter Tuning-Free Missing-Feature Reconstruction for Robust Sound Recognition

被引:4
|
作者
Liu, Qi [1 ]
Wu, Jibin [1 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 119077, Singapore
基金
新加坡国家研究基金会;
关键词
Spectrogram; Matrix decomposition; Acoustics; Task analysis; Computational modeling; Speech recognition; Tuning; Missing-feature reconstruction; matrix factorization; deep neural networks (DNNs); automatic speech recognition (ASR); environmental sound classification; AUTOMATIC SPEECH RECOGNITION; MATRIX COMPLETION; FEATURE-EXTRACTION; ALGORITHM; RECOVERY; OPTIMIZATION;
D O I
10.1109/JSTSP.2020.3038054
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the advent of the deep neural network, automatic speech recognition (ASR) has seen significant improvements in recent years. However, ASR performance degrades rapidly when the acoustic environment, such as communication channels or noise backgrounds, differ from those of training data. In the missing feature approach to speech processing, the unreliable feature components are identified and reconstructed to overcome signal degradation and the mismatch of the acoustic environment. To reduce the model dependency, we investigate the matrix completion technique in missing feature reconstruction tasks. However, most of the matrix completion techniques require a priori tuning parameters, e.g., target rank, which is hard to determine in practice. In this work, we propose a matrix completion method based on matrix factorization for the missing-feature reconstruction task, that does not require model training nor parameter tuning. Experiments show superior feature reconstruction performance and computational efficiency in both speech recognition and environmental sound classification tasks.
引用
收藏
页码:78 / 89
页数:12
相关论文
共 50 条
  • [21] Tuning-Free, Low Memory Robust Estimator to Mitigate GPS Spoofing Attacks
    Lee, Junhwan
    Taha, Ahmad F.
    Gatsis, Nikolaos
    Akopian, David
    IEEE CONTROL SYSTEMS LETTERS, 2020, 4 (01): : 145 - 150
  • [22] Reconstruction of missing features for robust speech recognition
    Raj, B
    Seltzer, ML
    Stern, RM
    SPEECH COMMUNICATION, 2004, 43 (04) : 275 - 296
  • [23] InterAug: A Tuning-Free Augmentation Policy for Data-Efficient and Robust Object Detection
    Thopalli, Kowshik
    Devi, S.
    Thiagarajan, Jayaraman J.
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 253 - 261
  • [24] Comment on "A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression"
    Fan, Jianqing
    Ma, Cong
    Wang, Kaizheng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1720 - 1725
  • [25] Discussion of "A Tuning-Free Robust and Efficient Approach to High-Dimensional Regression"
    Li, Xiudi
    Shojaie, Ali
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1717 - 1719
  • [26] MISSING FEATURE RECONSTRUCTION METHODS FOR ROBUST SPEAKER IDENTIFICATION
    Zhang, Xueliang
    Zhang, Hui
    Gao, Guanglai
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1482 - 1486
  • [27] Square Root Principal Component Pursuit: Tuning-Free Noisy Robust Matrix Recovery
    Zhang, Junhui
    Yan, Jingkai
    Wright, John
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
  • [28] Robust and Tuning-Free Sparse Linear Regression via Square-Root Slope
    Minsker, Stanislav
    Ndaoud, Mohamed
    Wang, Lang
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2024, 6 (02): : 428 - 453
  • [29] Using Blob Detection in Missing Feature Linear-Frequency Cepstral Coefficients for Robust Sound Event Recognition
    Leng, Yi Ren
    Huy Dat Tran
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2505 - 2508
  • [30] Missing-feature based speech recognition for two simultaneous speech signals separated by ICA with a pair of humanoid ears
    Takeda, Ryu
    Yamamoto, Shun'ichi
    Komatani, Kazunori
    Ogata, Tetsuya
    Okuno, Hiroshi G.
    2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 878 - +