Learning Temporal Coherence via Self-Supervision for GAN-based Video Generation

被引:119
|
作者
Chu, Mengyu [1 ]
Xie, You [1 ]
Mayer, Jonas [1 ]
Leal-Taix, Laura [1 ]
Thuerey, Nils [1 ]
机构
[1] Tech Univ Munich, Dept Comp Sci, Munich, Germany
来源
ACM TRANSACTIONS ON GRAPHICS | 2020年 / 39卷 / 04期
关键词
Generative adversarial network; temporal cycle-consistency; self-supervision; video super-resolution; unpaired video translation; MOTION;
D O I
10.1145/3386569.3392457
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Our work explores temporal self-supervision for GAN-based video generation tasks. While adversarial training successfully yields generative models for a variety of areas, temporal relationships in the generated data are much less explored. Natural temporal changes are crucial for sequential generation tasks, e.g. video super-resolution and unpaired video translation. For the former, state-of-the-art methods often favor simpler norm losses such as L-2 over adversarial training. However, their averaging nature easily leads to temporally smooth results with an undesirable lack of spatial detail. For unpaired video translation, existing approaches modify the generator networks to form spatio-temporal cycle consistencies. In contrast, we focus on improving learning objectives and propose a temporally self-supervised algorithm. For both tasks, we show that temporal adversarial learning is key to achieving temporally coherent solutions without sacrificing spatial detail. We also propose a novel Ping-Pong loss to improve the long-term temporal consistency. It effectively prevents recurrent networks from accumulating artifacts temporally without depressing detailed features. Additionally, we propose a first set of metrics to quantitatively evaluate the accuracy as well as the perceptual quality of the temporal evolution. A series of user studies confirm the rankings computed with these metrics. Code, data, models, and results are provided at https://github.com/thunil/TecoGAN.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Forecasting Fine-Grained Urban Flows Via Spatio-Temporal Contrastive Self-Supervision
    Qu, Hao
    Gong, Yongshun
    Chen, Meng
    Zhang, Junbo
    Zheng, Yu
    Yin, Yilong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 8008 - 8023
  • [42] Equivariant Spatio-temporal Self-supervision for LiDAR Object Detection
    Hegde, Deepti
    Lohit, Suhas
    Peng, Kuan-Chuan
    Jones, Michael J.
    Patel, Vishal M.
    COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 475 - 491
  • [43] Non-Prehensile Manipulation Learning through Self-Supervision
    Gao, Ziyan
    Elibol, Armagan
    Chong, Nak Young
    2020 FOURTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2020), 2020, : 93 - 99
  • [44] MetaDetector: Detecting Outliers by Learning to Learn from Self-supervision
    Tan, Jeremy
    Kart, Turkay
    Hou, Benjamin
    Batten, James
    Kainz, Bernhard
    BIOMEDICAL IMAGE REGISTRATION, DOMAIN GENERALISATION AND OUT-OF-DISTRIBUTION ANALYSIS, 2022, 13166 : 119 - 126
  • [45] Learning multi-view visual correspondences with self-supervision
    Zhang, Pengcheng
    Zhou, Lei
    Bai, Xiao
    Wang, Chen
    Zhou, Jun
    Zhang, Liang
    Zheng, Jin
    DISPLAYS, 2022, 72
  • [46] ContraCluster: Learning to Classify without Labels by Contrastive Self-Supervision and Prototype-Based Semi-Supervision
    Joe, Seongho
    Kim, Byoungjip
    Kang, Hoyoung
    Park, Kyoungwon
    Kim, Bogun
    Park, Jaeseon
    Lee, Joonseok
    Gwon, Youngjune
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4685 - 4692
  • [47] FedGL: Federated graph learning framework with global self-supervision
    Chen, Chuan
    Xu, Ziyue
    Hu, Weibo
    Zheng, Zibin
    Zhang, Jie
    INFORMATION SCIENCES, 2024, 657
  • [48] DoubleMatch: Improving Semi-Supervised Learning with Self-Supervision
    Wallin, Erik
    Svensson, Lennart
    Kahl, Fredrik
    Hammarstrand, Lars
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 2871 - 2877
  • [49] Learning Unsupervised Visual Grounding Through Semantic Self-Supervision
    Javed, Syed Ashar
    Saxena, Shreyas
    Gandhi, Vineet
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 796 - 802
  • [50] A GAN-Based Video Intra Coding
    Zhong, Guangyu
    Wang, Jun
    Hu, Jiyuan
    Liang, Fan
    ELECTRONICS, 2021, 10 (02) : 1 - 13