Frequency-Based Motion Representation for Video Generative Adversarial Networks

被引:0
|
作者
Hyun, Sangeek [1 ]
Lew, Jaihyun [2 ]
Chung, Jiwoo [1 ]
Kim, Euiyeon [1 ]
Heo, Jae-Pil [3 ]
机构
[1] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon 16419, South Korea
[2] Seoul Natl Univ, Interdisciplinary Program Artificial Intelligence, Seoul 08826, South Korea
[3] Sungkyunkwan Univ, Dept Comp Sci & Engn, Suwon 16419, South Korea
关键词
Generative adversarial networks; video generation; sinusoidal motion representation; speed-level motion manipulation;
D O I
10.1109/TIP.2023.3293767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Videos contain motions of various speeds. For example, the motions of one's head and mouth differ in terms of speed - the head being relatively stable and the mouth moving rapidly as one speaks. Despite its diverse nature, previous video GANs generate video based on a single unified motion representation without considering the aspect of speed. In this paper, we propose a frequency-based motion representation for video GANs to realize the concept of speed in video generation process. In detail, we represent motions as continuous sinusoidal signals of various frequencies by introducing a coordinate-based motion generator. We show, in that case, frequency is highly related to the speed of motion. Based on this observation, we present frequency-aware weight modulation that enables manipulation of motions within a specific range of speed, which could not be achieved with the previous techniques. Extensive experiments validate that the proposed method outperforms state-of-the-art video GANs in terms of generation quality by its capability to model various speed of motions. Furthermore, we also show that our temporally continuous representation enables to further synthesize intermediate and future frames of generated videos.
引用
收藏
页码:3949 / 3963
页数:15
相关论文
共 50 条
  • [11] GENERATIVE ADVERSARIAL NETWORKS BASED ERROR CONCEALMENT FOR LOW RESOLUTION VIDEO
    Xiang, Chongyang
    Xu, Jiajun
    Yan, Chuan
    Peng, Qiang
    Wu, Xiao
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1827 - 1831
  • [12] Old Video Quality Enhancement Technology Based on Generative Adversarial Networks
    Su, Shao-Rui
    Hsia, Shih-Chang
    Yang, Shi-Kai
    2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, : 425 - 426
  • [13] Generative Adversarial Networks for Video-to-Video Domain Adaptation
    Chen, Jiawei
    Li, Yuexiang
    Ma, Kai
    Zheng, Yefeng
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3462 - 3469
  • [14] EDGAN: motion deblurring algorithm based on enhanced generative adversarial networks
    Yong Zhang
    Shao Yong Ma
    Xi Zhang
    Li Li
    Wai Hung Ip
    Kai Leung Yung
    The Journal of Supercomputing, 2020, 76 : 8922 - 8937
  • [15] EDGAN: motion deblurring algorithm based on enhanced generative adversarial networks
    Zhang, Yong
    Ma, Shao Yong
    Zhang, Xi
    Li, Li
    Ip, Wai Hung
    Yung, Kai Leung
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (11): : 8922 - 8937
  • [16] Sonar feature representation with autoencoders and generative adversarial networks
    Linhardt, Timothy
    Sen Gupta, Ananya
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (03):
  • [17] Recursive Conditional Generative Adversarial Networks for Video Transformation
    Kim, San
    Suh, Doug Young
    IEEE ACCESS, 2019, 7 : 37807 - 37821
  • [18] Human Video Synthesis Using Generative Adversarial Networks
    Azeem, Abdullah
    Riaz, Waqar
    Siddique, Abubakar
    Saifullah
    Junaid, Tahir
    FIFTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2020, 11526
  • [19] WalkGAN: Network Representation Learning With Sequence-Based Generative Adversarial Networks
    Jin, Taisong
    Yang, Xixi
    Yu, Zhengtao
    Luo, Han
    Zhang, Yongmei
    Jie, Feiran
    Zeng, Xiangxiang
    Jiang, Min
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5684 - 5694
  • [20] No-reference omnidirectional video quality assessment based on generative adversarial networks
    Guo, Jiefeng
    Luo, Yao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 27531 - 27552