DVC: An End-to-end Deep Video Compression Framework

被引:332
|
作者
Lu, Guo [1 ]
Ouyang, Wanli [2 ]
Xu, Dong [3 ]
Zhang, Xiaoyun [1 ]
Cai, Chunlei [1 ]
Gao, Zhiyong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Univ Sydney, SenseTime Comp Vis Res Grp, Sydney, NSW, Australia
[3] Univ Sydney, Sydney, NSW, Australia
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
D O I
10.1109/CVPR.2019.01126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Conventional video compression approaches use the predictive coding architecture and encode the corresponding motion information and residual information. In this paper, taking advantage of both classical architecture in the conventional video compression method and the powerful nonlinear representation ability of neural networks, we propose the first end-to-end video compression deep model that jointly optimizes all the components for video compression. Specifically, learning based optical flow estimation is utilized to obtain the motion information and reconstruct the current frames. Then we employ two auto-encoder style neural networks to compress the corresponding motion and residual information. All the modules are jointly learned through a single loss function, in which they collaborate with each other by considering the trade-off between reducing the number of compression bits and improving quality of the decoded video. Experimental results show that the proposed approach can outperform the widely used video coding standard H.264 in terms of PSNR and be even on par with the latest standard H.265 in terms of MS-SSIM.
引用
收藏
页码:10998 / 11007
页数:10
相关论文
共 50 条
  • [21] End-to-end Optimized Video Compression with MV-Residual Prediction
    Wu, XiangJi
    Zhang, Ziwen
    Feng, Jie
    Zhou, Lei
    Wu, Junmin
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 611 - 614
  • [22] End-to-End Learning for Video Frame Compression with Self-Attention
    Zou, Nannan
    Zhang, Honglei
    Cricri, Francesco
    Tavakoli, Hamed R.
    Lainema, Jani
    Aksu, Emre
    Hannuksela, Miska
    Rahtu, Esa
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 580 - 584
  • [23] Compression of End-to-End Models
    Pang, Ruoming
    Sainath, Tara N.
    Prabhavalkar, Rohit
    Gupta, Suyog
    Wu, Yonghui
    Zhang, Shuyuan
    Chiu, Chung-cheng
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 27 - 31
  • [24] End-to-end deep learning framework for digital holographic reconstruction
    Zhenbo Ren
    Zhimin Xu
    Edmund Y.Lam
    Advanced Photonics, 2019, (01) : 76 - 87
  • [25] End-to-End Video Captioning
    Olivastri, Silvio
    Singh, Gurkirt
    Cuzzolin, Fabio
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1474 - 1482
  • [26] DeepAttest: An End-to-End Attestation Framework for Deep Neural Networks
    Chen, Huili
    Fu, Cheng
    Rouhani, Bita Darvish
    Zhao, Jishen
    Koushanfar, Farinaz
    PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19), 2019, : 487 - 498
  • [27] End-to-End Deep Learning Proactive Content Caching Framework
    Bakr, Eslam Mohamed
    Ben-Ammar, Hamza
    Eraqi, Hesham M.
    Aly, Sherif G.
    Elbatt, Tamer
    Ghamri-Doudane, Yacine
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 1043 - 1048
  • [28] An End-to-End Deep Learning Framework for Wideband Signal Recognition
    Vagollari, Adela
    Hirschbeck, Martin
    Gerstacker, Wolfgang
    IEEE ACCESS, 2023, 11 : 52899 - 52922
  • [29] A Deep Learning Framework for End-to-End Control of Powered Prostheses
    Nuesslein, Christoph P. O.
    Young, Aaron J.
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 3988 - 3994
  • [30] End-to-End Learning-Based Image Compression With a Decoupled Framework
    Zhang, Zhaobin
    Esenlik, Semih
    Wu, Yaojun
    Wang, Meng
    Zhang, Kai
    Zhang, Li
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3067 - 3081