Interpretable and Learnable Super-Resolution Time-Frequency Representation

被引:0
|
作者
Balestriero, Randall [1 ]
Glotin, Herve [2 ,3 ]
Baraniuk, Richard G. [1 ]
机构
[1] Rice Univ, ECE Dept, Houston, TX 77005 USA
[2] Univ Toulon & Var, Toulon, France
[3] Aix Marseille Univ, CNRS, LIS, DYNI,INPS, Marseille, France
关键词
Learnable Time-Frequency Representation; Time-Series; Cohen Class; Wigner-Ville; Spectrogram; Wavelet; Chirplet; Gabor Transform; Audio Classification; Interpretability; Explainability; Super-Resolution; Speech Recognition; Environmental Sounds; biosonar; bioacoustics; WIGNER DISTRIBUTION; COMPRESSION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a novel interpretable and learnable time-frequency representation (TFR) that produces a super-resolved quadratic signal representation for time-series analysis; the proposed TFR is a Gaussian filtering of the Wigner-Ville (WV) transform of a signal parametrized with a few interpretable parameters. Our approach has two main hallmarks. First, by varying the filters applied onto the WV, our new TFR can interpolate between known TFRs such as spectrograms, wavelet transforms, and chirplet transforms. Beyond that, our representation can also reach perfect time and frequency localization, hence super-resolution; this generalizes standard TFRs whose resolution is limited by the Heisenberg uncertainty principle. Second, our proposed TFR is interpretable thanks to an explicit low-dimensional and physical parametrization of the WV Gaussian filtering. We demonstrate that our approach enables us to learn highly adapted TFRs and is able to tackle a range of large-scale classification tasks, where we reach higher performance compared to baseline and learned TFRs. Ours is to the best of our knowledge the first learnable TFR that can continuously interpolate between super-resolution representation and commonly employed TFRs based on a few learnable parameters and which preserves full interpretability of the produced TFR, even after learning.
引用
收藏
页码:118 / 152
页数:35
相关论文
共 50 条
  • [1] Time-frequency super-resolution with superlets
    Moca, Vasile V.
    Barzan, Harald
    Nagy-Dabacan, Adriana
    Muresan, Raul C.
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [2] Time-frequency super-resolution with superlets
    Vasile V. Moca
    Harald Bârzan
    Adriana Nagy-Dăbâcan
    Raul C. Mureșan
    Nature Communications, 12
  • [3] TIME-FREQUENCY NETWORKS FOR AUDIO SUPER-RESOLUTION
    Lim, Teck Yian
    Yeh, Raymond A.
    Xu, Yijia
    Do, Minh N.
    Hasegawa-Johnson, Mark
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 646 - 650
  • [4] TIME-FREQUENCY LOSS FOR CNN BASED SPEECH SUPER-RESOLUTION
    Wang, Heming
    Wang, Deliang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 861 - 865
  • [5] Super-resolution analysis method for time-frequency signal with very small carrier frequency difference
    Yang W.
    Li D.
    Zhang J.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2020, 42 (09): : 1881 - 1889
  • [6] Dynamic learnable degradation for blind super-resolution
    Xu, Ling
    Xiao, Qingguo
    Chen, Qiaochuan
    Li, Guangyao
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [7] Global Learnable Attention for Single Image Super-Resolution
    Su, Jian-Nan
    Gan, Min
    Chen, Guang-Yong
    Yin, Jia-Li
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8453 - 8465
  • [8] Degramnet: effective audio analysis based on a fully learnable time-frequency representation
    Foggia, Pasquale
    Greco, Antonio
    Roberto, Antonio
    Saggese, Alessia
    Vento, Mario
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 20207 - 20219
  • [9] An efficient high-resolution time-frequency representation
    Nelson, D. J.
    Smith, D. C.
    ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS XVI, 2006, 6313
  • [10] Learnable Nonlocal Contrastive Network for Single Image Super-Resolution
    Xu, Binbin
    Zheng, Yuhui
    APPLIED SCIENCES-BASEL, 2023, 13 (12):