Frequency-Based Motion Representation for Video Generative Adversarial Networks

被引：0

作者：

Hyun, Sangeek ^{[1
]}

Lew, Jaihyun ^{[2
]}

Chung, Jiwoo ^{[1
]}

Kim, Euiyeon ^{[1
]}

Heo, Jae-Pil ^{[3
]}

机构：

[1] Sungkyunkwan Univ, Dept Artificial Intelligence, Suwon 16419, South Korea

[2] Seoul Natl Univ, Interdisciplinary Program Artificial Intelligence, Seoul 08826, South Korea

[3] Sungkyunkwan Univ, Dept Comp Sci & Engn, Suwon 16419, South Korea

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Generative adversarial networks; video generation; sinusoidal motion representation; speed-level motion manipulation;

D O I：

10.1109/TIP.2023.3293767

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Videos contain motions of various speeds. For example, the motions of one's head and mouth differ in terms of speed - the head being relatively stable and the mouth moving rapidly as one speaks. Despite its diverse nature, previous video GANs generate video based on a single unified motion representation without considering the aspect of speed. In this paper, we propose a frequency-based motion representation for video GANs to realize the concept of speed in video generation process. In detail, we represent motions as continuous sinusoidal signals of various frequencies by introducing a coordinate-based motion generator. We show, in that case, frequency is highly related to the speed of motion. Based on this observation, we present frequency-aware weight modulation that enables manipulation of motions within a specific range of speed, which could not be achieved with the previous techniques. Extensive experiments validate that the proposed method outperforms state-of-the-art video GANs in terms of generation quality by its capability to model various speed of motions. Furthermore, we also show that our temporally continuous representation enables to further synthesize intermediate and future frames of generated videos.

引用

页码：3949 / 3963

页数：15

共 50 条

[11] GENERATIVE ADVERSARIAL NETWORKS BASED ERROR CONCEALMENT FOR LOW RESOLUTION VIDEO
Xiang, Chongyang
Xu, Jiajun
Yan, Chuan
Peng, Qiang
Wu, Xiao
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1827 - 1831
[12] Old Video Quality Enhancement Technology Based on Generative Adversarial Networks
Su, Shao-Rui
Hsia, Shih-Chang
Yang, Shi-Kai
2024 11TH INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN, ICCE-TAIWAN 2024, 2024, : 425 - 426
[13] Generative Adversarial Networks for Video-to-Video Domain Adaptation
Chen, Jiawei
Li, Yuexiang
Ma, Kai
Zheng, Yefeng
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 3462 - 3469
[14] EDGAN: motion deblurring algorithm based on enhanced generative adversarial networks
Yong Zhang
Shao Yong Ma
Xi Zhang
Li Li
Wai Hung Ip
Kai Leung Yung
The Journal of Supercomputing, 2020, 76 : 8922 - 8937
[15] EDGAN: motion deblurring algorithm based on enhanced generative adversarial networks
Zhang, Yong
Ma, Shao Yong
Zhang, Xi
Li, Li
Ip, Wai Hung
Yung, Kai Leung
JOURNAL OF SUPERCOMPUTING, 2020, 76 (11): : 8922 - 8937
[16] Sonar feature representation with autoencoders and generative adversarial networks
Linhardt, Timothy
Sen Gupta, Ananya
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (03):
[17] Recursive Conditional Generative Adversarial Networks for Video Transformation
Kim, San
Suh, Doug Young
IEEE ACCESS, 2019, 7 : 37807 - 37821
[18] Human Video Synthesis Using Generative Adversarial Networks
Azeem, Abdullah
Riaz, Waqar
Siddique, Abubakar
Saifullah
Junaid, Tahir
FIFTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2020, 11526
[19] WalkGAN: Network Representation Learning With Sequence-Based Generative Adversarial Networks
Jin, Taisong
Yang, Xixi
Yu, Zhengtao
Luo, Han
Zhang, Yongmei
Jie, Feiran
Zeng, Xiangxiang
Jiang, Min
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5684 - 5694
[20] No-reference omnidirectional video quality assessment based on generative adversarial networks
Guo, Jiefeng
Luo, Yao
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (18) : 27531 - 27552

← 1 2 3 4 5 →