DVC: An End-to-end Deep Video Compression Framework

被引:332
|
作者
Lu, Guo [1 ]
Ouyang, Wanli [2 ]
Xu, Dong [3 ]
Zhang, Xiaoyun [1 ]
Cai, Chunlei [1 ]
Gao, Zhiyong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Univ Sydney, SenseTime Comp Vis Res Grp, Sydney, NSW, Australia
[3] Univ Sydney, Sydney, NSW, Australia
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
D O I
10.1109/CVPR.2019.01126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional video compression method and the powerful nonlinear representation ability of neural networks, we propose the first end-to-end video compression deep model that jointly optimizes all the components for video compression. Specifically, learning based optical flow estimation is utilized to obtain the motion information and reconstruct the current frames. Then we employ two auto-encoder style neural networks to compress the corresponding motion and residual information. All the modules are jointly learned through a single loss function, in which they collaborate with each other by considering the trade-off between reducing the number of compression bits and improving quality of the decoded video. Experimental results show that the proposed approach can outperform the widely used video coding standard H.264 in terms of PSNR and be even on par with the latest standard H.265 in terms of MS-SSIM.
引用
收藏
页码:10998 / 11007
页数:10
相关论文
共 50 条
  • [1] An End-to-End Learning Framework for Video Compression
    Lu, Guo
    Zhang, Xiaoyun
    Ouyang, Wanli
    Chen, Li
    Gao, Zhiyong
    Xu, Dong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3292 - 3308
  • [2] FVC: An End-to-End Framework Towards Deep Video Compression in Feature Space
    Hu, Zhihao
    Xu, Dong
    Lu, Guo
    Jiang, Wei
    Wang, Wei
    Liu, Shan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) : 4569 - 4585
  • [3] Review and Evaluation of End-to-End Video Compression with Deep-Learning
    Yasin, Hajar Maseeh
    Ameen, Siddeeq Yosef
    2021 INTERNATIONAL CONFERENCE OF MODERN TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY INDUSTRY (MTICTI 2021), 2021, : 81 - 88
  • [4] New Results in End-to-end Image and Video Compression by Deep Learning
    Ozsoy, Gokberk
    Yilmaz, Melih
    Kirmemis, Ogun
    Tekalp, A. Murat
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [5] Retargeting Video With an End-to-End Framework
    Le, Thi-Ngoc-Hanh
    Huang, HuiGuang
    Chen, Yi-Ru
    Lee, Tong-Yee
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) : 6164 - 6176
  • [6] Comprehensive Review of End-to-End Video Compression
    Shi, Liangfan
    Lu, Huimin
    20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 43 - 48
  • [7] An End-to-End Deep Learning Image Compression Framework Based on Semantic Analysis
    Wang, Cheng
    Han, Yifei
    Wang, Weidong
    APPLIED SCIENCES-BASEL, 2019, 9 (17):
  • [8] End-to-End Deep ROI Image Compression
    Akutsu, Hiroaki
    Naruko, Takahiro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (05): : 1031 - 1038
  • [9] Deep-PCAC: An End-to-End Deep Lossy Compression Framework for Point Cloud Attributes
    Sheng, Xihua
    Li, Li
    Liu, Dong
    Xiong, Zhiwei
    Li, Zhu
    Wu, Feng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2617 - 2632
  • [10] End-to-end video compression for surveillance and conference videos
    Wang, Shenhao
    Zhao, Yu
    Gao, Han
    Ye, Mao
    Li, Shuai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42713 - 42730