TOWARDS AUDIO TO SCENE IMAGE SYNTHESIS USING GENERATIVE ADVERSARIAL NETWORK

被引:0
|
作者
Wan, Chia-Hung [1 ]
Chuang, Shun-Po [2 ]
Lee, Hung-Yi [2 ]
机构
[1] Natl Taiwan Univ, Grad Inst Elect Engn, Taipei, Taiwan
[2] Natl Taiwan Univ, Grad Inst Commun Engn, Taipei, Taiwan
关键词
conditional GANs; audio-visual; cross-modal generation;
D O I
10.1109/icassp.2019.8682383
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Humans can imagine a scene from a sound. We want machines to do so by using conditional generative adversarial networks (GANs). By applying the techniques including spectral norm, projection discriminator and auxiliary classifier, compared with naive conditional GAN, the model can generate images with better quality in terms of both subjective and objective evaluations. Almost three-fourth of people agree that our model have the ability to generate images related to sounds. By inputting different volumes of the same sound, our model output different scales of changes based on the volumes, showing that our model truly knows the relationship between sounds and images to some extent.
引用
收藏
页码:496 / 500
页数:5
相关论文
共 50 条
  • [41] Underwater image enhancement using improved generative adversarial network
    Zhang, Tingting
    Li, Yujie
    Takahashi, Shinya
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2021, 33 (22):
  • [42] Unpaired medical image colorization using generative adversarial network
    Yihuai Liang
    Dongho Lee
    Yan Li
    Byeong-Seok Shin
    Multimedia Tools and Applications, 2022, 81 : 26669 - 26683
  • [43] Anti-forensics of fake stereo audio using generative adversarial network
    Tianyun Liu
    Diqun Yan
    Nan Yan
    Gang Chen
    Multimedia Tools and Applications, 2022, 81 : 17155 - 17167
  • [44] Anti-forensics of fake stereo audio using generative adversarial network
    Liu, Tianyun
    Yan, Diqun
    Yan, Nan
    Chen, Gang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (12) : 17155 - 17167
  • [45] Anti-Forensics of Audio Source Identification Using Generative Adversarial Network
    Li, Xiaowen
    Yan, Diqun
    Dong, Li
    Wang, Rangding
    IEEE ACCESS, 2019, 7 : 184332 - 184339
  • [46] Towards Imbalanced Image Classification: A Generative Adversarial Network Ensemble Learning Method
    Huang, Yangru
    Jin, Yi
    Li, Yidong
    Lin, Zhiping
    IEEE ACCESS, 2020, 8 : 88399 - 88409
  • [47] Label-Guided Generative Adversarial Network for Realistic Image Synthesis
    Zhu, Junchen
    Gao, Lianli
    Song, Jingkuan
    Li, Yuan-Fang
    Zheng, Feng
    Li, Xuelong
    Shen, Heng Tao
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 3311 - 3328
  • [48] LEGAN: A Light and Effective Generative Adversarial Network for medical image synthesis
    Gao, Jing
    Zhao, Wenhan
    Li, Peng
    Huang, Wei
    Chen, Zhikui
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 148
  • [49] Semantic Image Synthesis via Location Aware Generative Adversarial Network
    Xu, Jiawei
    Liu, Rui
    Dong, Jing
    Yi, Pengfei
    Fan, Wanshu
    Zhou, Dongsheng
    2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN, 2022, : 791 - 796
  • [50] Instance Map Based Image Synthesis With a Denoising Generative Adversarial Network
    Zheng, Ziqiang
    Wang, Chao
    Yu, Zhibin
    Zheng, Haiyong
    Zheng, Bing
    IEEE ACCESS, 2018, 6 : 33654 - 33665