Towards Real-Time Neural Video Codec for Cross-Platform Application Using Calibration Information

被引:1
|
作者
Tian, Kuan [1 ]
Guan, Yonghang [1 ]
Xiang, Jinxi [1 ]
Zhang, Jun [1 ]
Han, Xiao [1 ]
Yang, Wei [1 ]
机构
[1] Tencent AI Lab, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
关键词
Neural video codec; cross-platform; real-time codec;
D O I
10.1145/3581783.3611955
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The state-of-the-art neural video codecs have outperformed the most sophisticated traditional codecs in terms of rate-distortion (RD) performance in certain cases. However, utilizing them for practical applications is still challenging for two major reasons. 1) Cross-platform computational errors resulting from floating point operations can lead to inaccurate decoding of the bitstream. 2) The high computational complexity of the encoding and decoding process poses a challenge in achieving real-time performance. In this paper, we propose a real-time cross-platform neural video codec, which is capable of efficiently decoding (approximate to 25FPS) of 720P video bitstream from other encoding platforms on a consumer-grade GPU (e.g., NVIDIA RTX 2080). First, to solve the problem of inconsistency of codec caused by the uncertainty of floating point calculations across platforms, we design a calibration transmitting system to guarantee the consistent quantization of entropy parameters between the encoding and decoding stages. The parameters that may have transboundary quantization between encoding and decoding are identified in the encoding stage, and their coordinates will be delivered by auxiliary transmitted bitstream. By doing so, these inconsistent parameters can be processed properly in the decoding stage. Furthermore, to reduce the bitrate of the auxiliary bitstream, we rectify the distribution of entropy parameters using a piecewise Gaussian constraint. Second, to match the computational limitations on the decoding side for real-time video codec, we design a lightweight model. A series of efficiency techniques, such as model pruning, motion downsampling, and arithmetic coding skipping, enable our model to achieve 25 FPS decoding speed on NVIDIA RTX 2080 GPU. Experimental results demonstrate that our model can achieve real-time decoding of 720P videos while encoding on another platform. Furthermore, the real-time model brings up to a maximum of 24.2% BD-rate improvement from the perspective of PSNR with the anchor H.265 (medium).
引用
收藏
页码:7961 / 7970
页数:10
相关论文
共 50 条
  • [21] MEWAR: Development of a Cross-Platform Mobile Application and Web Dashboard System for Real-Time Mosquito Surveillance in Northeast Brazil
    Aldosery, Aisha
    Musah, Anwar
    Birjovanu, Georgiana
    Moreno, Giselle
    Boscor, Andrei
    Dutra, Livia
    Santos, George
    Nunes, Vania
    Oliveira, Rossandra
    Ambrizzi, Tercio
    Massoni, Tiago
    dos Santos, Wellington Pinheiro
    Kostkova, Patty
    FRONTIERS IN PUBLIC HEALTH, 2021, 9
  • [22] Cross-platform virtual reality for real-time construction safety training using immersive web and industry foundation classes
    Bao, Lan
    Tran, Si Van -Tien
    Nguyen, Truong Linh
    Pham, Hai Chien
    Lee, Dongmin
    Park, Chansik
    AUTOMATION IN CONSTRUCTION, 2022, 143
  • [23] Real-time segmentation of video on a multiprocessor platform
    Arapis, C
    Gibbs, S
    Breiteneder, C
    PARALLEL COMPUTING, 1997, 23 (12) : 1777 - 1792
  • [24] Real-time Video Quality Assessment Platform
    Papp, Istvan
    Lukic, Nemanja
    Marceta, Zoran
    Teslic, Nikola
    2009 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2009, : 215 - +
  • [25] Real-Time Video Streaming using GStreamer In GNU Radio Platform
    Nimmi, S.
    Saranya, V
    Theerthadas
    Gandhiraj, R.
    2014 INTERNATIONAL CONFERENCE ON GREEN COMPUTING COMMUNICATION AND ELECTRICAL ENGINEERING (ICGCCEE), 2014,
  • [26] Flutter-Based Cross-Platform Data Visualization of Real-Time Road Incident Analysis & Prediction
    Walee, Nafeeul Alam
    Shalan, Atef
    2024 5TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, ROBOTICS AND CONTROL, AIRC 2024, 2024, : 133 - 137
  • [27] The Development and Application of Cross-Platform Coal Mine Mobile Information System
    Yu, Nan
    Liu, Chuanchang
    Chen, Junliang
    2011 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), VOLS 1-4, 2012, : 1492 - 1496
  • [28] Cross-Platform Real-Time Collaborative Modeling: An Architecture and a Prototype Implementation via EMF.Cloud
    Aslam, Kousar
    Chen, Yu
    Butt, Muhammad
    Malavolta, Ivano
    IEEE ACCESS, 2023, 11 : 49241 - 49260
  • [29] Cross Platform Real-time Voice Transfer
    Zidek, K.
    Seminsky, J.
    2010 IEEE 8TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, 2010, : 231 - 232
  • [30] Real-Time Software Video Codec with a Fast Adaptive Motion Vector Search
    Tatsuji Moriyoshi
    Hiroshi Shinohara
    Takashi Miyazaki
    Ichiro Kuroda
    Journal of VLSI signal processing systems for signal, image and video technology, 2001, 29 : 239 - 245