Generation and Analysis of Vocal Spectrograms: Combining Generative Adversarial Networks

被引:0
|
作者
Yang, Zhe [1 ]
机构
[1] Weifang Engn Vocat Coll, Weifang 262500, Shandong, Peoples R China
关键词
Generative Adversarial Network; Vocal Music Spectrum Map; Speech Enhancement; Deep Learning;
D O I
10.1145/3662739.3672183
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vocal spectrogram is a representation of sound in the frequency domain and has important application value in fields such as music and speech. Through generative an adversarial network (GAN), realistic vocal spectrograms can be generated or analyzed using the generated spectrograms. This article introduces the basic principles and structure of GAN, including the design of generator and discriminator networks, discusses the data preparation and definition of loss function in vocal spectrogram generation, and describes in detail the steps of training GAN, including alternating training of generator and discriminator to generate more realistic vocal spectrograms. After generating vocal spectrograms, it further introduces how to use corresponding technologies and algorithms to analyze the generated spectrograms, and studies the evaluation indicators for the generated vocal spectrograms or analysis results. The vocal spectrogram generated by generative adversarial networks has high performance, with the highest clarity reaching 92%. The generation and analysis of vocal spectrograms can play an increasingly important role in audio processing and acoustic research, and bring new breakthroughs to the development of audio technology.
引用
收藏
页码:534 / 539
页数:6
相关论文
共 50 条
  • [31] Procedural Generation of Roads with Conditional Generative Adversarial Networks
    Kelvin, Lin Ziwen
    Bhojan, Anand
    2020 IEEE SIXTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM 2020), 2020, : 277 - 281
  • [32] Generation of Synthetic Data with Conditional Generative Adversarial Networks
    Vega-Marquez, Belen
    Rubio-Escudero, Cristina
    Nepomuceno-Chamorro, Isabel
    LOGIC JOURNAL OF THE IGPL, 2022, 30 (02) : 252 - 262
  • [33] Image generation and classification via generative adversarial networks
    Mirabedini, Shirin
    Dastgerdi, Shadi Hejareh
    Kangavari, Mohammadreza
    AhmadiPanah, Mandi
    BIOSCIENCE RESEARCH, 2020, 17 (02): : 1356 - 1363
  • [34] Synthetic Traffic Generation with Wasserstein Generative Adversarial Networks
    Wu, Chao-Lun
    Chen, Yu-Ying
    Chou, Po-Yu
    Wang, Chih-Yu
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1503 - 1508
  • [35] Generative Adversarial Network-based Postfilter for STFT Spectrograms
    Kaneko, Takuhiro
    Takaki, Shinji
    Kameoka, Hirokazu
    Yamagishi, Junichi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3389 - 3393
  • [36] Combining generative adversarial networks and agricultural transfer learning for weeds identification
    Espejo-Garcia, Borja
    Mylonas, Nikos
    Athanasakos, Loukas
    Vali, Eleanna
    Fountas, Spyros
    BIOSYSTEMS ENGINEERING, 2021, 204 : 79 - 89
  • [37] Combining Residual Attention Mechanisms and Generative Adversarial Networks for Hippocampus Segmentation
    Hongxia Deng
    Yuefang Zhang
    Ran Li
    Chunxiang Hu
    Zijian Feng
    Haifang Li
    Tsinghua Science and Technology, 2022, 27 (01) : 68 - 78
  • [38] Combining Residual Attention Mechanisms and Generative Adversarial Networks for Hippocampus Segmentation
    Deng, Hongxia
    Zhang, Yuefang
    Li, Ran
    Hu, Chunxiang
    Feng, Zijian
    Li, Haifang
    TSINGHUA SCIENCE AND TECHNOLOGY, 2022, 27 (01) : 68 - 78
  • [39] A Bibliometric Analysis of Papers on Generative Adversarial Networks
    Jiao, Fangyu
    Yu, Bei
    Chen, Lang
    Chen, Dunkui
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 434 - 439
  • [40] Generative adversarial networks in EEG analysis: an overview
    Habashi, Ahmed G.
    Azab, Ahmed M.
    Eldawlatly, Seif
    Aly, Gamal M.
    JOURNAL OF NEUROENGINEERING AND REHABILITATION, 2023, 20 (01)