TOWARDS AUDIO TO SCENE IMAGE SYNTHESIS USING GENERATIVE ADVERSARIAL NETWORK

被引:0
|
作者
Wan, Chia-Hung [1 ]
Chuang, Shun-Po [2 ]
Lee, Hung-Yi [2 ]
机构
[1] Natl Taiwan Univ, Grad Inst Elect Engn, Taipei, Taiwan
[2] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei, Taiwan
关键词
conditional GANs; audio-visual; cross-modal generation;
D O I
10.1109/icassp.2019.8682383
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Humans can imagine a scene from a sound. We want machines to do so by using conditional generative adversarial networks (GANs). By applying the techniques including spectral norm, projection discriminator and auxiliary classifier, compared with naive conditional GAN, the model can generate images with better quality in terms of both subjective and objective evaluations. Almost three-fourth of people agree that our model have the ability to generate images related to sounds. By inputting different volumes of the same sound, our model output different scales of changes based on the volumes, showing that our model truly knows the relationship between sounds and images to some extent.
引用
收藏
页码:496 / 500
页数:5
相关论文
共 50 条
  • [1] Background and foreground disentangled generative adversarial network for scene image synthesis
    Ni, Jiancheng
    Zhang, Susu
    Zhou, Zili
    Hou, Lijun
    Hou, Jie
    Gao, Feng
    COMPUTERS & GRAPHICS-UK, 2021, 97 : 54 - 66
  • [2] Improved Generative Adversarial Network for Image Scene Transformation
    面向图像场景转换的改进型生成对抗网络
    Xiao, Jin-Sheng (xiaojs@whu.edu.cn), 1600, Chinese Academy of Sciences (32): : 2755 - 2768
  • [3] Generative adversarial network for image deblurring using generative adversarial constraint loss
    Ji, Y.
    Dai, Y.
    Zhao, K.
    Li, S.
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 1180 - 1187
  • [4] Image Denoising Using A Generative Adversarial Network
    Alsaiari, Abeer
    Rustagi, Ridhi
    Alhakamy, A'eshah
    Thomas, Manu Mathew
    Forbes, Angus G.
    2019 IEEE 2ND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT), 2019, : 126 - 132
  • [5] Image Synthesis with a Convolutional Capsule Generative Adversarial Network
    Bass, Cher
    Dai, Tianhong
    Billot, Benjamin
    Arulkumaran, Kai
    Creswell, Antonia
    Clopath, Claudia
    De Paola, Vincenzo
    Bharath, Anil Anthony
    INTERNATIONAL CONFERENCE ON MEDICAL IMAGING WITH DEEP LEARNING, VOL 102, 2019, 102 : 39 - 62
  • [6] TEXT TO IMAGE SYNTHESIS WITH BIDIRECTIONAL GENERATIVE ADVERSARIAL NETWORK
    Wang, Zixu
    Quan, Zhe
    Wang, Zhi-Jie
    Hu, Xinjian
    Chen, Yangyang
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [7] Multimodal Fusion Generative Adversarial Network for Image Synthesis
    Zhao, Liang
    Hu, Qinghao
    Li, Xiaoyuan
    Zhao, Jingyuan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 1865 - 1869
  • [8] Towards Unsupervised Deep Image Enhancement With Generative Adversarial Network
    Ni, Zhangkai
    Yang, Wenhan
    Wang, Shiqi
    Ma, Lin
    Kwong, Sam
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 9140 - 9151
  • [9] A Scene Images Diversity Improvement Generative Adversarial Network for Remote Sensing Image Scene Classification
    Pan, Xin
    Zhao, Jian
    Xu, Jun
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (10) : 1692 - 1696
  • [10] Comparing Representations for Audio Synthesis Using Generative Adversarial Networks
    Nistal, Javier
    Lattner, Stefan
    Richard, Gael
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 161 - 165