Generation and Analysis of Vocal Spectrograms: Combining Generative Adversarial Networks

被引:0
|
作者
Yang, Zhe [1 ]
机构
[1] Weifang Engn Vocat Coll, Weifang 262500, Shandong, Peoples R China
关键词
Generative Adversarial Network; Vocal Music Spectrum Map; Speech Enhancement; Deep Learning;
D O I
10.1145/3662739.3672183
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Vocal spectrogram is a representation of sound in the frequency domain and has important application value in fields such as music and speech. Through generative an adversarial network (GAN), realistic vocal spectrograms can be generated or analyzed using the generated spectrograms. This article introduces the basic principles and structure of GAN, including the design of generator and discriminator networks, discusses the data preparation and definition of loss function in vocal spectrogram generation, and describes in detail the steps of training GAN, including alternating training of generator and discriminator to generate more realistic vocal spectrograms. After generating vocal spectrograms, it further introduces how to use corresponding technologies and algorithms to analyze the generated spectrograms, and studies the evaluation indicators for the generated vocal spectrograms or analysis results. The vocal spectrogram generated by generative adversarial networks has high performance, with the highest clarity reaching 92%. The generation and analysis of vocal spectrograms can play an increasingly important role in audio processing and acoustic research, and bring new breakthroughs to the development of audio technology.
引用
收藏
页码:534 / 539
页数:6
相关论文
共 50 条
  • [41] Comparative Analysis of Generative Adversarial Networks and their Variants
    Tahmid, Marjana
    Alam, Samiul
    Akram, Mohammad Kalim
    2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
  • [42] Generative adversarial networks in EEG analysis: an overview
    Ahmed G. Habashi
    Ahmed M. Azab
    Seif Eldawlatly
    Gamal M. Aly
    Journal of NeuroEngineering and Rehabilitation, 20
  • [43] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144
  • [44] Conditional generative adversarial networks for the data generation and seismic analysis of above and underground infrastructures
    Dalmasso, M.
    Civera, M.
    De Biagi, V.
    Surace, C.
    Chiaia, B.
    TUNNELLING AND UNDERGROUND SPACE TECHNOLOGY, 2025, 157
  • [45] Research on the Application of Generative Adversarial Networks in Aerial Image Generation
    Cai, H. X.
    Zhu, X. Y.
    Wen, P. C.
    Bai, L. T.
    Li, R. Q.
    Han, W.
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 416 - 420
  • [46] Synthetic Fingerprint Generation Using Generative Adversarial Networks: A Review
    Dhaneshwar, Ritika
    Taya, Arnav
    Kaur, Mandeep
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 1, CIS 2023, 2024, 868 : 375 - 387
  • [47] Generation of molecular conformations using generative adversarial neural networks
    Xu, Congsheng
    Deng, Xiaomei
    Lu, Yi
    Yu, Peiyuan
    DIGITAL DISCOVERY, 2025, 4 (01): : 161 - 171
  • [48] Dual Projection Generative Adversarial Networks for Conditional Image Generation
    Han, Ligong
    Min, Martin Renqiang
    Stathopoulos, Anastasis
    Tian, Yu
    Gao, Ruijiang
    Kadav, Asim
    Metaxas, Dimitris
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14418 - 14427
  • [49] Generative Adversarial Networks for Data Generation in Structural Health Monitoring
    Luleci, Furkan
    Catbas, F. Necati
    Avci, Onur
    FRONTIERS IN BUILT ENVIRONMENT, 2022, 8
  • [50] OptiGAN: Generative Adversarial Networks for Goal Optimized Sequence Generation
    Hossam, Mahmoud
    Trung Le
    Viet Huynh
    Papasimeont, Michael
    Dinh Phung
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,