Enhanced Machine Learning-based Inter Coding for VVC

被引:4
|
作者
Benjak, Martin [1 ]
Meuel, Holger [1 ]
Laude, Thorsten [1 ]
Ostermann, Jorn [1 ]
机构
[1] Leibniz Univ Hannover, Inst Informat Verarbeitung, Hannover, Germany
关键词
VVC; inter coding; video coding; machine learning; recurrent neural networks;
D O I
10.1109/ICAIIC51459.2021.9415184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture buffer are processed using a recurrent neural network to generate an artificial reference picture at the time instance of the currently coded picture. The network is trained using a SATD cost function to minimize the bit rate cost for the prediction error rather than the pixel-wise difference. By this we achieved average weighted BD-rate gains of 0.94%. The coding time increased about 5% for the encoder and 300% for the decoder due to the use of a neural network.
引用
收藏
页码:21 / 25
页数:5
相关论文
共 50 条
  • [31] LEARNING-BASED RATE CONTROL FOR LEARNING-BASED POINT CLOUD GEOMETRY CODING
    Ruivo, Manuel
    Guarda, Andre F. R.
    Pereira, Fernando
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 251 - 255
  • [32] An Enhanced Machine Learning-Based Analysis of Teaching and Learning Process for Higher Education System
    Alsafyani, Majed
    ADVANCES IN INFORMATION SYSTEMS, ARTIFICIAL INTELLIGENCE AND KNOWLEDGE MANAGEMENT, ICIKS 2023, 2024, 486 : 321 - 332
  • [33] Learning-Based Fast Depth Inter Coding for 3D-HEVC via XGBoost
    Zhang, Zixiang
    Yu, Li
    Qian, Jian
    Wang, Hongkui
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 43 - 52
  • [34] Machine Learning-Based Fast Angular Prediction Mode Decision Technique in Video Coding
    Ryu, Sookyung
    Kang, Je-Won
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) : 5525 - 5538
  • [35] The JPEG Pleno Learning-Based Point Cloud Coding Standard: Serving Man and Machine
    Guarda, Andre F. R.
    Rodrigues, Nuno M. M.
    Pereira, Fernando
    IEEE ACCESS, 2025, 13 : 43289 - 43315
  • [36] Machine Learning-Based Coding Unit Depth Decisions for Flexible Complexity Allocation in High Efficiency Video Coding
    Zhang, Yun
    Kwong, Sam
    Wang, Xu
    Yuan, Hui
    Pan, Zhaoqing
    Xu, Long
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (07) : 2225 - 2238
  • [37] Learning-Based Complexity Reduction Scheme for VVC Intra-Frame Prediction
    Saldanha, Mario
    Sanchez, Gustavo
    Marcon, Cesar
    Agostini, Luciano
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [38] Learning-Based Fast Splitting and Directional Mode Decision for VVC Intra Prediction
    Huang, Yuanyuan
    Yu, Junyi
    Wang, Dayong
    Lu, Xin
    Dufaux, Frederic
    Guo, Hui
    Zhu, Ce
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (02) : 681 - 692
  • [39] Fast CTU Partition Decision Algorithm for VVC Intra and Inter Coding
    Tang, Na
    Cao, Jian
    Liang, Fan
    Wang, Jun
    Liu, Hongmei
    Wang, Xiaoyang
    Du, Xiaorong
    2019 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2019), 2019, : 361 - 364
  • [40] Machine Learning-Based Algorithmic Approach for Enhanced Anomaly Detection in Financial Transactions
    Sivakumar
    Mariyappan
    Prakash, P. G. Om
    SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2021, 2022, 93 : 779 - 790