Learning Temporal Coherence via Self-Supervision for GAN-based Video Generation

被引：119

作者：

Chu, Mengyu ^{[1
]}

Xie, You ^{[1
]}

Mayer, Jonas ^{[1
]}

Leal-Taix, Laura ^{[1
]}

Thuerey, Nils ^{[1
]}

机构：

[1] Tech Univ Munich, Dept Comp Sci, Munich, Germany

来源：

ACM TRANSACTIONS ON GRAPHICS | 2020年 / 39卷 / 04期

关键词：

Generative adversarial network; temporal cycle-consistency; self-supervision; video super-resolution; unpaired video translation; MOTION;

D O I：

10.1145/3386569.3392457

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Our work explores temporal self-supervision for GAN-based video generation tasks. While adversarial training successfully yields generative models for a variety of areas, temporal relationships in the generated data are much less explored. Natural temporal changes are crucial for sequential generation tasks, e.g. video super-resolution and unpaired video translation. For the former, state-of-the-art methods often favor simpler norm losses such as L-2 over adversarial training. However, their averaging nature easily leads to temporally smooth results with an undesirable lack of spatial detail. For unpaired video translation, existing approaches modify the generator networks to form spatio-temporal cycle consistencies. In contrast, we focus on improving learning objectives and propose a temporally self-supervised algorithm. For both tasks, we show that temporal adversarial learning is key to achieving temporally coherent solutions without sacrificing spatial detail. We also propose a novel Ping-Pong loss to improve the long-term temporal consistency. It effectively prevents recurrent networks from accumulating artifacts temporally without depressing detailed features. Additionally, we propose a first set of metrics to quantitatively evaluate the accuracy as well as the perceptual quality of the temporal evolution. A series of user studies confirm the rankings computed with these metrics. Code, data, models, and results are provided at https://github.com/thunil/TecoGAN.

引用

页数：13

共 50 条

[41] Forecasting Fine-Grained Urban Flows Via Spatio-Temporal Contrastive Self-Supervision
Qu, Hao
Gong, Yongshun
Chen, Meng
Zhang, Junbo
Zheng, Yu
Yin, Yilong
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 8008 - 8023
[42] Equivariant Spatio-temporal Self-supervision for LiDAR Object Detection
Hegde, Deepti
Lohit, Suhas
Peng, Kuan-Chuan
Jones, Michael J.
Patel, Vishal M.
COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 475 - 491
[43] Non-Prehensile Manipulation Learning through Self-Supervision
Gao, Ziyan
Elibol, Armagan
Chong, Nak Young
2020 FOURTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2020), 2020, : 93 - 99
[44] MetaDetector: Detecting Outliers by Learning to Learn from Self-supervision
Tan, Jeremy
Kart, Turkay
Hou, Benjamin
Batten, James
Kainz, Bernhard
BIOMEDICAL IMAGE REGISTRATION, DOMAIN GENERALISATION AND OUT-OF-DISTRIBUTION ANALYSIS, 2022, 13166 : 119 - 126
[45] Learning multi-view visual correspondences with self-supervision
Zhang, Pengcheng
Zhou, Lei
Bai, Xiao
Wang, Chen
Zhou, Jun
Zhang, Liang
Zheng, Jin
DISPLAYS, 2022, 72
[46] ContraCluster: Learning to Classify without Labels by Contrastive Self-Supervision and Prototype-Based Semi-Supervision
Joe, Seongho
Kim, Byoungjip
Kang, Hoyoung
Park, Kyoungwon
Kim, Bogun
Park, Jaeseon
Lee, Joonseok
Gwon, Youngjune
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4685 - 4692
[47] FedGL: Federated graph learning framework with global self-supervision
Chen, Chuan
Xu, Ziyue
Hu, Weibo
Zheng, Zibin
Zhang, Jie
INFORMATION SCIENCES, 2024, 657
[48] DoubleMatch: Improving Semi-Supervised Learning with Self-Supervision
Wallin, Erik
Svensson, Lennart
Kahl, Fredrik
Hammarstrand, Lars
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2871 - 2877
[49] Learning Unsupervised Visual Grounding Through Semantic Self-Supervision
Javed, Syed Ashar
Saxena, Shreyas
Gandhi, Vineet
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 796 - 802
[50] A GAN-Based Video Intra Coding
Zhong, Guangyu
Wang, Jun
Hu, Jiyuan
Liang, Fan
ELECTRONICS, 2021, 10 (02) : 1 - 13

← 1 2 3 4 5 →