Matching Pursuit and Sparse Coding for Auditory Representation

被引:6
|
作者
Tran, Dung Kim [1 ]
Unoki, Masashi [1 ]
机构
[1] Japan Adv Inst Sci & Technol, Grad Sch Adv Sci & Technol, Nomi, Ishikawa 9231292, Japan
关键词
Kernel; Time-frequency analysis; Spectrogram; Matching pursuit algorithms; Bandwidth; Dictionaries; Psychoacoustics; Auditory filterbank; equivalent rectangular bandwidth; gammatone; gammachirp; masking effect; matching pursuit; perceptual features; sparse coding; spectrogram; spikegram; RECOGNITION; FILTER; DOMAIN;
D O I
10.1109/ACCESS.2021.3135011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous studies have revealed that by mimicking the neural activity patterns of the auditory periphery to obtain perceptual features of speech signals, the resultant auditory representation is beneficial to speech-coding and pattern-analysis applications in comparison with spectrogram and spikegram representations. However, current solutions use outdated techniques such as the Bark scale and gammatone basis to decompose speech signals. We propose a method of using more physiological accurate techniques such as the equivalent rectangular bandwidth scale, gammachirp basis, and auditory masking effects of gammachirp kernels. Our experimental results indicate that the auditory representation created with our proposed method requires the lowest bitrate (1066 coefficients per second on average) to achieve similar perceptual evaluation scores (0.89 PEMO-Q and 3.27 PESQ scores) compared with spectrogram and spikegram representations. The proposed method also provides the highest matching accuracy with a pattern-matching algorithm.
引用
收藏
页码:167084 / 167095
页数:12
相关论文
共 50 条
  • [31] Research of multi-focus image fusion algorithm based on sparse representation and orthogonal matching pursuit
    Li, Xuejun
    Wang, Minghui
    Communications in Computer and Information Science, 2014, 437 : 57 - 66
  • [32] Micro-Doppler Parameter Estimation via Parametric Sparse Representation and Pruned Orthogonal Matching Pursuit
    Li, Gang
    Varshney, Pramod K.
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (12) : 4937 - 4948
  • [33] Image coding by matching pursuit and perceptual pruning
    Horowitz, MJ
    Neuhoff, DL
    INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL III, 1997, : 654 - 657
  • [34] Dictionary approximation for matching pursuit video coding
    Neff, R
    Zakhor, A
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, : 828 - 831
  • [35] Color image scalable coding with matching pursuit
    Ventura, RMFI
    Vandergheynst, P
    Frossard, P
    Cavallaro, A
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 53 - 56
  • [36] Adaptive coding using matching pursuit in MRI
    Ro, YM
    Chung, JY
    MEDICAL IMAGING 1998: IMAGE PROCESSING, PTS 1 AND 2, 1998, 3338 : 1357 - 1364
  • [37] RAPID ESTIMATION OF ORTHOGONAL MATCHING PURSUIT REPRESENTATION
    Chatterjee, Ayan
    Yuen, Peter W. T.
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1315 - 1318
  • [38] OPTICAL IMPLEMENTATION OF A MATCHING PURSUIT FOR IMAGE REPRESENTATION
    CHAO, TH
    LAU, B
    MICELI, WJ
    OPTICAL ENGINEERING, 1994, 33 (07) : 2303 - 2309
  • [39] Efficiency of Orthogonal Matching Pursuit for Group Sparse Recovery
    Shao, Chunfang
    Wei, Xiujie
    Ye, Peixin
    Xing, Shuo
    AXIOMS, 2023, 12 (04)
  • [40] Active Orthogonal Matching Pursuit for Sparse Subspace Clustering
    Chen, Yanxi
    Li, Gen
    Gu, Yuantao
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (02) : 164 - 168