Frequency-Based Motion Representation for Video Generative Adversarial Networks

被引：0

作者：

Hyun, Sangeek ^{[1
]}

Lew, Jaihyun ^{[2
]}

Chung, Jiwoo ^{[1
]}

Kim, Euiyeon ^{[1
]}

Heo, Jae-Pil ^{[3
]}

机构：

[1] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon 16419, South Korea

[2] Seoul Natl Univ, Interdisciplinary Program Artificial Intelligence, Seoul 08826, South Korea

[3] Sungkyunkwan Univ, Dept Comp Sci & Engn, Suwon 16419, South Korea

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Generative adversarial networks; video generation; sinusoidal motion representation; speed-level motion manipulation;

D O I：

10.1109/TIP.2023.3293767

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Videos contain motions of various speeds. For example, the motions of one's head and mouth differ in terms of speed - the head being relatively stable and the mouth moving rapidly as one speaks. Despite its diverse nature, previous video GANs generate video based on a single unified motion representation without considering the aspect of speed. In this paper, we propose a frequency-based motion representation for video GANs to realize the concept of speed in video generation process. In detail, we represent motions as continuous sinusoidal signals of various frequencies by introducing a coordinate-based motion generator. We show, in that case, frequency is highly related to the speed of motion. Based on this observation, we present frequency-aware weight modulation that enables manipulation of motions within a specific range of speed, which could not be achieved with the previous techniques. Extensive experiments validate that the proposed method outperforms state-of-the-art video GANs in terms of generation quality by its capability to model various speed of motions. Furthermore, we also show that our temporally continuous representation enables to further synthesize intermediate and future frames of generated videos.

引用

页码：3949 / 3963

页数：15

共 50 条

[31] Spatial Frequency Bias in Convolutional Generative Adversarial Networks
Khayatkhoei, Mahyar
Elgammal, Ahmed
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7152 - 7159
[32] Information-Based Boundary Equilibrium Generative Adversarial Networks with Interpretable Representation Learning
Hah, Junghoon
Lee, Woojin
Lee, Jaewook
Park, Saerom
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
[33] Recurrent generative adversarial networks for unsupervised WCE video summarization
Lan, Libin
Ye, Chunxiao
KNOWLEDGE-BASED SYSTEMS, 2021, 222
[34] Generative Adversarial Networks for Stochastic Video Prediction With Action Control
Hu, Zhihang
Turki, Turki
Wang, Jason T. L.
IEEE ACCESS, 2020, 8 (08): : 63336 - 63348
[35] Unsupervised Video Summarization with Attentive Conditional Generative Adversarial Networks
He, Xufeng
Hua, Yang
Song, Tao
Zhang, Zongpu
Xue, Zhengui
Ma, Ruhui
Robertson, Neil
Guan, Haibing
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2296 - 2304
[36] Exploiting Images for Video Recognition with Hierarchical Generative Adversarial Networks
Yu, Feiwu
Wu, Xinxiao
Sun, Yuchao
Duan, Lixin
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 1107 - 1113
[37] Generative Adversarial Networks for Image and Video Synthesis: Algorithms and Applications
Liu, Ming-Yu
Huang, Xun
Yu, Jiahui
Wang, Ting-Chun
Mallya, Arun
PROCEEDINGS OF THE IEEE, 2021, 109 (05) : 839 - 862
[38] Efficient Video Frame Interpolation Using Generative Adversarial Networks
Tran, Quang Nhat
Yang, Shih-Hsuan
APPLIED SCIENCES-BASEL, 2020, 10 (18):
[39] Temporal-Spatial Generative Adversarial Networks for Video Inpainting
Yu B.
Ding Y.
Xie Z.
Huang D.
Ma L.
Xie, Zhifeng (zhifeng_xie@shu.edu.cn), 1600, Institute of Computing Technology (32): : 769 - 779
[40] Convolutional Transformer based Dual Discriminator Generative Adversarial Networks for Video Anomaly Detection
Feng, Xinyang
Song, Dongjin
Chen, Yuncong
Chen, Zhengzhang
Ni, Jingchao
Chen, Haifeng
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5546 - 5554

← 1 2 3 4 5 →