Machine Learning Accelerated Transform Search For AV1

被引:4
|
作者
Su, Hui [1 ]
Chen, Mingliang [1 ]
Bokov, Alexander [1 ]
Mukherjee, Debargha [1 ]
Wang, Yunqing [1 ]
Chen, Yue [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
关键词
Video Coding; AV1; Machine Learning; Transform Search; Encoding Speedup;
D O I
10.1109/pcs48520.2019.8954514
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
AV1 is the state-of-the-art open and royalty-free video compression format that achieves significant bitrate savings over previous generation of video codecs. One of AV1's major improvement over its predecessor VP9 is the support of more diverse and flexible transform size and kernel selection. However, it also drastically increases the search space for transform unit rate-distortion optimization in AV1 encoders. Unlike conventional encoder speed features that are based on heuristics, we propose a machine learning (ML) based approach to accelerate the transform size and kernel search for AV1. The ML models use input features extracted from the prediction residue block such as standard deviation, correlation and energy distribution. The output of the models indicates the estimated likelihood of which transform size and kernel would be selected as the optimal choice. Based on the ML models, the encoder can prune out the transform size and kernel candidates that are unlikely to be selected and save unnecessary computation to compute their rate-distortion cost. The proposed approach is implemented and tested on the AV1 reference library libaom. The experimental results show that satisfactory encoding speed improvement can be achieved with extremely low compression performance loss. The framework and methodology can also be easily migrated to other video codecs and implementations.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Adaptive complexity control for AV1 video encoder using machine learning
    Bender, Isis
    Rehbein, Gustavo
    Correa, Guilherme
    Agostini, Luciano
    Porto, Marcelo
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (03)
  • [2] UNIFIED SECONDARY TRANSFORM FOR INTRA CODING BEYOND AV1
    Zhao, Xin
    Liu, Shan
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3393 - 3397
  • [3] MACHINE LEARNING BASED SYMBOL PROBABILITY DISTRIBUTION PREDICTION FOR ENTROPY CODING IN AV1
    Chen, Mingliang
    Su, Hui
    Deng, Sai
    Xu, Yaowu
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3374 - 3378
  • [4] A Technical Overview of AV1
    Han, Jingning
    Li, Bohan
    Mukherjee, Debargha
    Chiang, Ching-Han
    Grange, Adrian
    Chen, Cheng
    Su, Hui
    Parker, Sarah
    Deng, Sai
    Joshi, Urvang
    Chen, Yue
    Wang, Yunqing
    Wilkins, Paul
    Xu, Yaowu
    Bankoski, James
    PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1435 - 1462
  • [5] FastGW: A Machine Learning-Based Early Skip for the AV1 Global Warped Motion Compensation
    Kolodziejski, William
    Domanski, Robson
    Agostini, Luciano
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024,
  • [6] Analysis of AV1 Coding Tools
    Chuang, Hsiao-Chiang
    Lei, Zhijun
    Opalach, Agata
    Norkin, Andrey
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226
  • [7] Fast Transform Kernel Selection Based on Frequency Matching and Probability Model for AV1
    Hao, Zhijian
    Sun, Heming
    Xu, Guohao
    Liu, Jiaming
    Xiong, Xiankui
    Zhu, Xuanpeng
    Zeng, Xiaoyang
    Fan, Yibo
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (02) : 693 - 707
  • [8] A Multi-Pass Coding Mode Search Framework For AV1 Encoder Optimization
    Chiang, Ching-Han
    Han, Jingning
    Xu, Yaowu
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 458 - 467
  • [9] AV1/AVM development at Google
    Chong, In Suk
    Young, Joe
    Li, Shan
    McCullough, Conor
    Vitvitskyy, Stan
    Rautio, Ville-Mikko
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLVII, 2024, 13137
  • [10] TRIGGER FOR PHILIPS AV1 VENTILATOR
    ROBERTSON, DH
    LAING, A
    ANAESTHESIA, 1977, 32 (04) : 353 - 354