Machine Learning Accelerated Transform Search For AV1

被引:4
|
作者
Su, Hui [1 ]
Chen, Mingliang [1 ]
Bokov, Alexander [1 ]
Mukherjee, Debargha [1 ]
Wang, Yunqing [1 ]
Chen, Yue [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
关键词
Video Coding; AV1; Machine Learning; Transform Search; Encoding Speedup;
D O I
10.1109/pcs48520.2019.8954514
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
AV1 is the state-of-the-art open and royalty-free video compression format that achieves significant bitrate savings over previous generation of video codecs. One of AV1's major improvement over its predecessor VP9 is the support of more diverse and flexible transform size and kernel selection. However, it also drastically increases the search space for transform unit rate-distortion optimization in AV1 encoders. Unlike conventional encoder speed features that are based on heuristics, we propose a machine learning (ML) based approach to accelerate the transform size and kernel search for AV1. The ML models use input features extracted from the prediction residue block such as standard deviation, correlation and energy distribution. The output of the models indicates the estimated likelihood of which transform size and kernel would be selected as the optimal choice. Based on the ML models, the encoder can prune out the transform size and kernel candidates that are unlikely to be selected and save unnecessary computation to compute their rate-distortion cost. The proposed approach is implemented and tested on the AV1 reference library libaom. The experimental results show that satisfactory encoding speed improvement can be achieved with extremely low compression performance loss. The framework and methodology can also be easily migrated to other video codecs and implementations.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Benchmarking and Analysis of AV1 Software Decoding on Android Devices
    Grunau, Janne
    Kempf, Jean-Baptiste
    Storsjo, Martin
    Raj, Jeeva A.
    Patankar, Kaustubh
    Srinivasan, Mukund
    Bultje, Ronald S.
    Gramner, Henrik
    Trudeau, Luc
    Tuffet, Victorien Le Couviour
    Lei, Zhijun
    Katsavounidis, Ioannis
    Ronca, David
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226
  • [42] Direct Optimisation of λ for HDR Content Adaptive Transcoding in AV1
    Vibhoothi
    Pitie, Francois
    Katsenou, Angeliki
    Ringis, Daniel Joseph
    Su, Yeping
    Birkbeck, Neil
    Lin, Jessie
    Adsumilli, Balu
    Kokaram, Anil
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226
  • [43] AV1 In-loop Super-resolution Framework
    Joshi, Urvang
    Mukherjee, Debargha
    Chen, Yue
    Parker, Sarah
    Grange, Adrian
    Su, Hui
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
  • [44] In-loop Frame Super-resolution in AV1
    Joshi, Urvang
    Mukherjee, Debargha
    Chen, Yue
    Parker, Sarah
    Grange, Adrian
    2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [45] OPTIMIZING AV1 ENCODER FOR REAL-TIME COMMUNICATION
    Kyslov, Fyodor
    Paniconi, Marco
    Jiang, Jerome
    Wang, Yunqing
    Tsai, Chi Yo
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 941 - 945
  • [46] Selection of Intra Prediction Tools for Fast AV1 Encoding
    Xu, Motong
    Jeon, Byeungwoo
    2020 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2020,
  • [47] Performance Comparison of VVC, AV1, HEVC, and AVC for High Resolutions
    Uhrina, Miroslav
    Sevcik, Lukas
    Bienik, Juraj
    Smatanova, Lenka
    ELECTRONICS, 2024, 13 (05)
  • [48] Accelerated Search and Design of Stretchable Graphene Kirigami Using Machine Learning
    Hanakata, Paul Z.
    Cubuk, Ekin D.
    Campbell, David K.
    Park, Harold S.
    PHYSICAL REVIEW LETTERS, 2018, 121 (25)
  • [49] An Accelerated Convex Optimization Algorithm with Line Search and Applications in Machine Learning
    Chumpungam, Dawan
    Sarnmeta, Panitarn
    Suantai, Suthep
    MATHEMATICS, 2022, 10 (09)
  • [50] Machine learning accelerated search for new double perovskite oxide photocatalysis
    Wan Xin-Yang
    Zhang Ye-Hui
    Lu Shuai-Hua
    Wu Yi-Lei
    Zhou Qiong-Hua
    Wang Jin-Lan
    ACTA PHYSICA SINICA, 2022, 71 (17)