Generation and Analysis of Vocal Spectrograms: Combining Generative Adversarial Networks

被引:0
|
作者
Yang, Zhe [1 ]
机构
[1] Weifang Engn Vocat Coll, Weifang 262500, Shandong, Peoples R China
关键词
Generative Adversarial Network; Vocal Music Spectrum Map; Speech Enhancement; Deep Learning;
D O I
10.1145/3662739.3672183
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vocal spectrogram is a representation of sound in the frequency domain and has important application value in fields such as music and speech. Through generative an adversarial network (GAN), realistic vocal spectrograms can be generated or analyzed using the generated spectrograms. This article introduces the basic principles and structure of GAN, including the design of generator and discriminator networks, discusses the data preparation and definition of loss function in vocal spectrogram generation, and describes in detail the steps of training GAN, including alternating training of generator and discriminator to generate more realistic vocal spectrograms. After generating vocal spectrograms, it further introduces how to use corresponding technologies and algorithms to analyze the generated spectrograms, and studies the evaluation indicators for the generated vocal spectrograms or analysis results. The vocal spectrogram generated by generative adversarial networks has high performance, with the highest clarity reaching 92%. The generation and analysis of vocal spectrograms can play an increasingly important role in audio processing and acoustic research, and bring new breakthroughs to the development of audio technology.
引用
收藏
页码:534 / 539
页数:6
相关论文
共 50 条
  • [1] Optimized Generative Adversarial Networks for Adversarial Sample Generation
    Alghazzawi, Daniyal M.
    Hasan, Syed Hamid
    Bhatia, Surbhi
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3877 - 3897
  • [2] An analysis of DOOM level generation using Generative Adversarial Networks
    Giacomello, Edoardo
    Lanzi, Pier Luca
    Loiacono, Daniele
    ENTERTAINMENT COMPUTING, 2023, 46
  • [3] Unsupervised Learning Using Generative Adversarial Networks on micro-Doppler spectrograms
    Donerty, H. Garcia
    Cifola, L.
    Harmanny, R. I. A.
    Fioranelli, F.
    2019 16TH EUROPEAN RADAR CONFERENCE (EURAD), 2019, : 197 - 200
  • [4] Geophysical model generation with generative adversarial networks
    Puzyrev, Vladimir
    Salles, Tristan
    Surma, Greg
    Elders, Chris
    GEOSCIENCE LETTERS, 2022, 9 (01)
  • [5] A review on Generative Adversarial Networks for image generation
    de Souza, Vinicius Luis Trevisan
    Marques, Bruno Augusto Dorta
    Batagelo, Harlen Costa
    Gois, Joao Paulo
    COMPUTERS & GRAPHICS-UK, 2023, 114 : 13 - 25
  • [6] Review of Generative Adversarial Networks in Image Generation
    Chi, Wanle
    Choo, Yun Huoy
    Goh, Ong Sing
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2022, 26 (01) : 3 - 7
  • [7] Icon Generation Based on Generative Adversarial Networks
    Yang, Hongyi
    Xue, Chengqi
    Yang, Xiaoying
    Yang, Han
    APPLIED SCIENCES-BASEL, 2021, 11 (17):
  • [8] Emotional Dialogue Generation with Generative Adversarial Networks
    Li, Yun
    Wu, Bin
    PROCEEDINGS OF 2020 IEEE 4TH INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2020), 2020, : 868 - 873
  • [9] Game Character Generation with Generative Adversarial Networks
    Emekligil, Ferda Gul Aydin
    Oksuz, Ilkay
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [10] Question Generation via Generative Adversarial Networks
    Liu, Dong
    Hong, Yu
    Yao, Jianmin
    Zhou, Guodong
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,