Phase-aware music super-resolution using generative adversarial networks

被引:10
|
作者
Hu, Shichao [1 ]
Zhang, Bin [1 ]
Liang, Beici [1 ]
Zhao, Ethan [1 ]
Lui, Simon [1 ]
机构
[1] Tencent Mus Entertainment TME, Shenzhen 518057, Peoples R China
来源
关键词
Music super-resolution; Bandwidth expansion; Generative adversarial network; Phase estimation; BANDWIDTH EXTENSION; NARROW-BAND; SPEECH;
D O I
10.21437/Interspeech.2020-2605
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Audio super-resolution is a challenging task of recovering the missing high-resolution features from a low-resolution signal. To address this, generative adversarial networks (GAN) have been used to achieve promising results by training the mappings between magnitudes of the low and high-frequency components. However, phase information is not well-considered for waveform reconstruction in conventional methods. In this paper, we tackle the problem of music super-resolution and conduct a thorough investigation on the importance of phase for this task. We use GAN to predict the magnitudes of the high-frequency components. The corresponding phase information can be extracted using either a GAN-based waveform synthesis system or a modified Griffin-Lim algorithm. Experimental results show that phase information plays an important role in the improvement of the reconstructed music quality. Moreover, our proposed method significantly outperforms other state-of-the-art methods in terms of objective evaluations.
引用
收藏
页码:4074 / 4078
页数:5
相关论文
共 50 条
  • [41] Image super-resolution using conditional generative adversarial network
    Qiao, Jiaojiao
    Song, Huihui
    Zhang, Kaihua
    Zhang, Xiaolu
    Liu, Qingshan
    IET IMAGE PROCESSING, 2019, 13 (14) : 2673 - 2679
  • [42] Face Video Super-Resolution with Identity Guided Generative Adversarial Networks
    Li, Dingyi
    Wang, Zengfu
    COMPUTER VISION, PT II, 2017, 772 : 357 - 369
  • [43] Super-resolution generative adversarial networks of randomly-seeded fields
    Guemes, Alejandro
    Vila, Carlos Sanmiguel
    Discetti, Stefano
    NATURE MACHINE INTELLIGENCE, 2022, 4 (12) : 1165 - +
  • [44] BESRGAN: Boundary equilibrium face super-resolution generative adversarial networks
    Ren, Xinyi
    Hui, Qiang
    Zhao, Xingke
    Xiong, Jianping
    Yin, Jun
    IET IMAGE PROCESSING, 2023, 17 (06) : 1784 - 1796
  • [45] D-SRGAN: DEM Super-Resolution with Generative Adversarial Networks
    Demiray B.Z.
    Sit M.
    Demir I.
    SN Computer Science, 2021, 2 (1)
  • [46] Super-Resolution Reconstruction of Cell Images Based on Generative Adversarial Networks
    Pan, Bin
    Du, Yifeng
    Guo, Xiaoming
    IEEE ACCESS, 2024, 12 : 72252 - 72263
  • [47] Image Super-Resolution using a Improved Generative Adversarial Network
    Wang, Han
    Wu, Wei
    Su, Yang
    Duan, Yongsheng
    Wang, Pengze
    PROCEEDINGS OF 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2019), 2019, : 312 - 315
  • [48] Facial super-resolution reconstruction method based on generative adversarial networks
    Zhang, Xi
    Ku, Shao-Ping
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2025, 55 (01): : 333 - 338
  • [49] LPSRGAN: Generative adversarial networks for super-resolution of license plate image
    Pan, Yuecheng
    Tang, Jin
    Tjahjadi, Tardi
    NEUROCOMPUTING, 2024, 580
  • [50] Improving Image Super-Resolution Based on Multiscale Generative Adversarial Networks
    Yuan, Cao
    Deng, Kaidi
    Li, Chen
    Zhang, Xueting
    Li, Yaqin
    ENTROPY, 2022, 24 (08)