TOWARDS AUDIO TO SCENE IMAGE SYNTHESIS USING GENERATIVE ADVERSARIAL NETWORK

被引:0
|
作者
Wan, Chia-Hung [1 ]
Chuang, Shun-Po [2 ]
Lee, Hung-Yi [2 ]
机构
[1] Natl Taiwan Univ, Grad Inst Elect Engn, Taipei, Taiwan
[2] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei, Taiwan
关键词
conditional GANs; audio-visual; cross-modal generation;
D O I
10.1109/icassp.2019.8682383
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Humans can imagine a scene from a sound. We want machines to do so by using conditional generative adversarial networks (GANs). By applying the techniques including spectral norm, projection discriminator and auxiliary classifier, compared with naive conditional GAN, the model can generate images with better quality in terms of both subjective and objective evaluations. Almost three-fourth of people agree that our model have the ability to generate images related to sounds. By inputting different volumes of the same sound, our model output different scales of changes based on the volumes, showing that our model truly knows the relationship between sounds and images to some extent.
引用
收藏
页码:496 / 500
页数:5
相关论文
共 50 条
  • [31] Image Inpainting Using Wasserstein Generative Adversarial Imputation Network
    Vasata, Daniel
    Halama, Tomas
    Friedjungova, Magda
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 575 - 586
  • [32] IMAGE REFLECTION REMOVAL USING THE WASSERSTEIN GENERATIVE ADVERSARIAL NETWORK
    Li, Tingtian
    Lun, Daniel P. K.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7695 - 7699
  • [33] Perceptual image quality using dual generative adversarial network
    Masoumeh Zareapoor
    Huiyu Zhou
    Jie Yang
    Neural Computing and Applications, 2020, 32 : 14521 - 14531
  • [34] Unpaired medical image colorization using generative adversarial network
    Liang, Yihuai
    Lee, Dongho
    Li, Yan
    Shin, Byeong-Seok
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (19) : 26669 - 26683
  • [35] Underwater Image Deblurring Framework Using A Generative Adversarial Network
    Li, Tengyue
    Rong, Shenghui
    He, Bo
    Chen, Long
    OCEANS 2022, 2022,
  • [36] Digital radiography image denoising using a generative adversarial network
    Sun, Yuewen
    Liu, Ximing
    Cong, Peng
    Li, Litao
    Zhao, Zhongwei
    JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2018, 26 (04) : 523 - 534
  • [37] Underwater image enhancement using a mixed generative adversarial network
    Mu, Delang
    Li, Heng
    Liu, Hui
    Dong, Ling
    Zhang, Guoyin
    IET IMAGE PROCESSING, 2023, 17 (04) : 1149 - 1160
  • [38] Realistic Sonar Image Simulation Using Generative Adversarial Network
    Sung, Minsung
    Kim, Jason
    Kim, Juhwan
    Yu, Son-Cheol
    IFAC PAPERSONLINE, 2019, 52 (21): : 291 - 296
  • [39] Single Image Haze Removal using a Generative Adversarial Network
    Raj, Bharath N.
    Venketeswaran, N.
    2020 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS SIGNAL PROCESSING AND NETWORKING (WISPNET), 2020, : 37 - 42
  • [40] Image Generation Using Different Models Of Generative Adversarial Network
    Al-qerem, Ahmad
    Alsalman, Yasmeen Shaher
    Mansour, Khalid
    2019 INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2019, : 241 - 245