Frequency-Based Motion Representation for Video Generative Adversarial Networks

被引:0
|
作者
Hyun, Sangeek [1 ]
Lew, Jaihyun [2 ]
Chung, Jiwoo [1 ]
Kim, Euiyeon [1 ]
Heo, Jae-Pil [3 ]
机构
[1] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon 16419, South Korea
[2] Seoul Natl Univ, Interdisciplinary Program Artificial Intelligence, Seoul 08826, South Korea
[3] Sungkyunkwan Univ, Dept Comp Sci & Engn, Suwon 16419, South Korea
关键词
Generative adversarial networks; video generation; sinusoidal motion representation; speed-level motion manipulation;
D O I
10.1109/TIP.2023.3293767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Videos contain motions of various speeds. For example, the motions of one's head and mouth differ in terms of speed - the head being relatively stable and the mouth moving rapidly as one speaks. Despite its diverse nature, previous video GANs generate video based on a single unified motion representation without considering the aspect of speed. In this paper, we propose a frequency-based motion representation for video GANs to realize the concept of speed in video generation process. In detail, we represent motions as continuous sinusoidal signals of various frequencies by introducing a coordinate-based motion generator. We show, in that case, frequency is highly related to the speed of motion. Based on this observation, we present frequency-aware weight modulation that enables manipulation of motions within a specific range of speed, which could not be achieved with the previous techniques. Extensive experiments validate that the proposed method outperforms state-of-the-art video GANs in terms of generation quality by its capability to model various speed of motions. Furthermore, we also show that our temporally continuous representation enables to further synthesize intermediate and future frames of generated videos.
引用
收藏
页码:3949 / 3963
页数:15
相关论文
共 50 条
  • [31] Spatial Frequency Bias in Convolutional Generative Adversarial Networks
    Khayatkhoei, Mahyar
    Elgammal, Ahmed
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7152 - 7159
  • [32] Information-Based Boundary Equilibrium Generative Adversarial Networks with Interpretable Representation Learning
    Hah, Junghoon
    Lee, Woojin
    Lee, Jaewook
    Park, Saerom
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [33] Recurrent generative adversarial networks for unsupervised WCE video summarization
    Lan, Libin
    Ye, Chunxiao
    KNOWLEDGE-BASED SYSTEMS, 2021, 222
  • [34] Generative Adversarial Networks for Stochastic Video Prediction With Action Control
    Hu, Zhihang
    Turki, Turki
    Wang, Jason T. L.
    IEEE ACCESS, 2020, 8 (08): : 63336 - 63348
  • [35] Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks
    He, Xufeng
    Hua, Yang
    Song, Tao
    Zhang, Zongpu
    Xue, Zhengui
    Ma, Ruhui
    Robertson, Neil
    Guan, Haibing
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2296 - 2304
  • [36] Exploiting Images for Video Recognition with Hierarchical Generative Adversarial Networks
    Yu, Feiwu
    Wu, Xinxiao
    Sun, Yuchao
    Duan, Lixin
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1107 - 1113
  • [37] Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications
    Liu, Ming-Yu
    Huang, Xun
    Yu, Jiahui
    Wang, Ting-Chun
    Mallya, Arun
    PROCEEDINGS OF THE IEEE, 2021, 109 (05) : 839 - 862
  • [38] Efficient Video Frame Interpolation Using Generative Adversarial Networks
    Tran, Quang Nhat
    Yang, Shih-Hsuan
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [39] Temporal-Spatial Generative Adversarial Networks for Video Inpainting
    Yu B.
    Ding Y.
    Xie Z.
    Huang D.
    Ma L.
    Xie, Zhifeng (zhifeng_xie@shu.edu.cn), 1600, Institute of Computing Technology (32): : 769 - 779
  • [40] Convolutional Transformer based Dual Discriminator Generative Adversarial Networks for Video Anomaly Detection
    Feng, Xinyang
    Song, Dongjin
    Chen, Yuncong
    Chen, Zhengzhang
    Ni, Jingchao
    Chen, Haifeng
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5546 - 5554