Generation and Analysis of Vocal Spectrograms: Combining Generative Adversarial Networks

被引:0
|
作者
Yang, Zhe [1 ]
机构
[1] Weifang Engn Vocat Coll, Weifang 262500, Shandong, Peoples R China
关键词
Generative Adversarial Network; Vocal Music Spectrum Map; Speech Enhancement; Deep Learning;
D O I
10.1145/3662739.3672183
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vocal spectrogram is a representation of sound in the frequency domain and has important application value in fields such as music and speech. Through generative an adversarial network (GAN), realistic vocal spectrograms can be generated or analyzed using the generated spectrograms. This article introduces the basic principles and structure of GAN, including the design of generator and discriminator networks, discusses the data preparation and definition of loss function in vocal spectrogram generation, and describes in detail the steps of training GAN, including alternating training of generator and discriminator to generate more realistic vocal spectrograms. After generating vocal spectrograms, it further introduces how to use corresponding technologies and algorithms to analyze the generated spectrograms, and studies the evaluation indicators for the generated vocal spectrograms or analysis results. The vocal spectrogram generated by generative adversarial networks has high performance, with the highest clarity reaching 92%. The generation and analysis of vocal spectrograms can play an increasingly important role in audio processing and acoustic research, and bring new breakthroughs to the development of audio technology.
引用
收藏
页码:534 / 539
页数:6
相关论文
共 50 条
  • [21] Unsupervised Image Generation with Infinite Generative Adversarial Networks
    Ying, Hui
    Wang, He
    Shao, Tianjia
    Yang, Yin
    Zhou, Kun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14264 - 14273
  • [22] Generative adversarial networks for handwriting image generation: a review
    Elanwar, Randa
    Betke, Margrit
    VISUAL COMPUTER, 2025, 41 (04): : 2299 - 2322
  • [23] Experimental Quantum Generative Adversarial Networks for Image Generation
    Huang, He-Liang
    Du, Yuxuan
    Gong, Ming
    Zhao, Youwei
    Wu, Yulin
    Wang, Chaoyue
    Li, Shaowei
    Liang, Futian
    Lin, Jin
    Xu, Yu
    Yang, Rui
    Liu, Tongliang
    Hsich, Min-Hsiu
    Deng, Hui
    Rong, Hao
    Peng, Cheng-Zhi
    Lu, Chao-Yang
    Chen, Yu-Ao
    Tao, Dacheng
    Zhu, Xiaobo
    Pan, Jian-Wei
    PHYSICAL REVIEW APPLIED, 2021, 16 (02):
  • [24] A Research on Generative Adversarial Networks Applied to Text Generation
    Zhang, Chao
    Xiong, Caiquan
    Wang, Lingyun
    14TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND EDUCATION (ICCSE 2019), 2019, : 913 - 917
  • [25] Constrained Generative Adversarial Networks for Interactive Image Generation
    Heim, Eric
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10745 - 10753
  • [26] Attributes Aware Face Generation with Generative Adversarial Networks
    Yuan, Zheng
    Zhang, Jie
    Shan, Shiguang
    Chen, Xilin
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 1657 - 1664
  • [27] A survey on text generation using generative adversarial networks
    de Rosa, Gustavo H.
    Papa, Joao P.
    PATTERN RECOGNITION, 2021, 119
  • [28] Interpreting Generative Adversarial Networks for Interactive Image Generation
    Zhou, Bolei
    XXAI - BEYOND EXPLAINABLE AI: International Workshop, Held in Conjunction with ICML 2020, July 18, 2020, Vienna, Austria, Revised and Extended Papers, 2022, 13200 : 167 - 175
  • [29] Cast Shadow Generation Using Generative Adversarial Networks
    Taif, Khasrouf
    Ugail, Hassan
    Mehmood, Irfan
    COMPUTATIONAL SCIENCE - ICCS 2020, PT V, 2020, 12141 : 481 - 495
  • [30] Generation of Driving Scenario Trajectories with Generative Adversarial Networks
    Demetriou, Andreas
    Allsvag, Henrik
    Rahrovani, Sadegh
    Chehreghani, Morteza Haghir
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,