End-to-End Time-Lapse Video Synthesis from a Single Outdoor Image

被引:24
|
作者
Nam, Seonghyeon [1 ]
Ma, Chongyang [2 ]
Chai, Menglei [2 ]
Brendel, William [2 ]
Xu, Ning [3 ]
Kim, Seon Joo [1 ]
机构
[1] Yonsei Univ, Seoul, South Korea
[2] Snap Inc, Santa Monica, CA USA
[3] Amazon Go, Seattle, WA USA
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/CVPR.2019.00150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time-lapse videos usually contain visually appealing content but are often difficult and costly to create. In this paper, we present an end-to-end solution to synthesize a time-lapse video from a single outdoor image using deep neural networks. Our key idea is to train a conditional generative adversarial network based on existing datasets of time-lapse videos and image sequences. We propose a multi-frame joint conditional generation framework to effectively learn the correlation between the illumination change of an outdoor scene and the time of the day. We further present a multi-domain training scheme for robust training of our generative models from two datasets with different distributions and missing timestamp labels. Compared to alternative time-lapse video synthesis algorithms, our method uses the timestamp as the control variable and does not require a reference video to guide the synthesis of the final output. We conduct ablation studies to validate our algorithm and compare with state-of-the-art techniques both qualitatively and quantitatively.
引用
收藏
页码:1409 / 1418
页数:10
相关论文
共 50 条
  • [41] End-to-end Video Matting with Trimap Propagation
    Huang, Wei-Lun
    Lee, Ming-Sui
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 14337 - 14347
  • [42] End-to-End United Video Dehazing and Detection
    Li, Boyi
    Peng, Xiulian
    Wang, Zhangyang
    Xu, Jizheng
    Feng, Dan
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7016 - 7023
  • [43] FrameProv: Towards End-To-End Video Provenance
    Ahmed-Rengers, Mansoor
    NSPW'19: PROCEEDINGS OF THE NEW SECURITY PARADIGMS WORKSHOP, 2019, : 68 - 77
  • [44] REAL-TIME 3D FACE RECONSTRUCTION FROM SINGLE IMAGE USING END-TO-END CNN REGRESSION
    Wang, Shan
    Shen, Xukun
    Yu, Kun
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3293 - 3297
  • [45] A framework for end-to-end video quality prediction of MPEG video
    Koumaras, Harilaos
    Lin, C. -H.
    Shieh, C-K.
    Kourtis, Anastasios
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2010, 21 (02) : 139 - 154
  • [46] End-to-End Video-to-Speech Synthesis Using Generative Adversarial Networks
    Mira, Rodrigo
    Vougioukas, Konstantinos
    Ma, Pingchuan
    Petridis, Stavros
    Schuller, Bjoern W.
    Pantic, Maja
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (06) : 3454 - 3466
  • [47] Towards the Design of an End-to-End Automated System for Image an Video-based Recognition
    Chellappa, Rama
    Chen, Jun-Cheng
    Ranjan, Rajeev
    Sankaranarayanan, Swami
    Kumar, Amit
    Patel, Vishal M.
    Castillo, Carlos D.
    2016 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2016,
  • [48] TIME-LAPSE VIDEO IMAGING OF THE HEMATOPOIETIC MICROENVIRONMENT
    ALLEN, TD
    EXPERIMENTAL HEMATOLOGY, 1992, 20 (01) : 122 - 125
  • [49] A DISK TAPE SYSTEM FOR TIME-LAPSE VIDEO
    YONEKURA, A
    SMPTE JOURNAL, 1982, 91 (10): : 902 - 905
  • [50] Single Image Dehazing Using End-to-End Deep-Dehaze Network
    Fahim, Masud An-Nur Islam
    Jung, Ho Yub
    ELECTRONICS, 2021, 10 (07)