Machine Learning Accelerated Transform Search For AV1

被引:4
|
作者
Su, Hui [1 ]
Chen, Mingliang [1 ]
Bokov, Alexander [1 ]
Mukherjee, Debargha [1 ]
Wang, Yunqing [1 ]
Chen, Yue [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
关键词
Video Coding; AV1; Machine Learning; Transform Search; Encoding Speedup;
D O I
10.1109/pcs48520.2019.8954514
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
AV1 is the state-of-the-art open and royalty-free video compression format that achieves significant bitrate savings over previous generation of video codecs. One of AV1's major improvement over its predecessor VP9 is the support of more diverse and flexible transform size and kernel selection. However, it also drastically increases the search space for transform unit rate-distortion optimization in AV1 encoders. Unlike conventional encoder speed features that are based on heuristics, we propose a machine learning (ML) based approach to accelerate the transform size and kernel search for AV1. The ML models use input features extracted from the prediction residue block such as standard deviation, correlation and energy distribution. The output of the models indicates the estimated likelihood of which transform size and kernel would be selected as the optimal choice. Based on the ML models, the encoder can prune out the transform size and kernel candidates that are unlikely to be selected and save unnecessary computation to compute their rate-distortion cost. The proposed approach is implemented and tested on the AV1 reference library libaom. The experimental results show that satisfactory encoding speed improvement can be achieved with extremely low compression performance loss. The framework and methodology can also be easily migrated to other video codecs and implementations.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] IMPROVED INTRA MODE CODING BEYOND AV1
    Jin, Yize
    Zhao, Liang
    Zhao, Xin
    Liu, Shan
    Bovik, Alan C.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1580 - 1584
  • [22] AV1 Benchmarking Test for 3GPP
    Lei, Zhijun
    Song, Jun Sik
    Grange, Adrian
    Han, Jingning
    Simmons, John
    Norkin, Andrey
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226
  • [23] Performance Evaluation of AV1 Intra Coding Tools
    Thuong Nguyen Canh
    Xu, Motong
    Jeon, Byeungwoo
    2019 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2019,
  • [24] THE AV1 CONSTRAINED DIRECTIONAL ENHANCEMENT FILTER (CDEF)
    Midtskogen, Steinar
    Valin, Jean-Marc
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 1193 - 1197
  • [25] Film Grain Synthesis for AV1 Video Codec
    Norkin, Andrey
    Birkbeck, Neil
    2018 DATA COMPRESSION CONFERENCE (DCC 2018), 2018, : 3 - 12
  • [26] Bit-Width Optimized Transposition Buffer Design for the AV1 2D-DCT Transform
    Rodrigues, Jelson
    Goebel, Jones
    Agostini, Luciano
    Zatt, Bruno
    Porto, Marcelo
    2024 37TH SBC/SBMICRO/IEEE SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, SBCCI 2024, 2024, : 210 - 214
  • [27] Joint Asymptotic Closed-Loop Design of Secondary Transform and Scan Order for Inter Coding in AV1
    Sivakumar, Kruthika Koratti
    Vishwanath, Bharath
    Rose, Kenneth
    2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
  • [28] On the Evaluation of Coarse Grained Parallelism in AV1 Video Coding
    Papadopoulos, Panos K.
    Koziri, Maria G.
    Tziritas, Nikos
    Loukopoulos, Thanasis
    Anagnostopoulos, Ioannis
    Saloun, Petr
    Andresic, David
    2018 13TH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP 2018), 2018, : 55 - 59
  • [29] Benchmarking and Analysis of AV1 Software Decoding on Android Devices
    Grunau, Janne
    Kempf, Jean-Baptiste
    Storsjo, Martin
    Jeeva Raj, A.
    Patankar, Kaustubh
    Srinivasan, Mukund
    Bultje, Ronald S.
    Gramner, Henrik
    Trudeau, Luc
    Le Couviour Tuffet, Victorien
    Lei, Zhijun
    Katsavounidis, Ioannis
    Ronca, David
    Proceedings of SPIE - The International Society for Optical Engineering, 2022, 12226
  • [30] Rust AV1 Encoder (rav1e) project
    Barbato, Luca
    Barr, David M.
    Molodetskikh, Ivan
    Montgomery, Christopher 'Monty'
    Shreevari, S. P.
    Zumer, Raphael A.
    Egge, Nathan E.
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137