Machine Learning Accelerated Transform Search For AV1

被引：4

作者：

Su, Hui ^{[1
]}

Chen, Mingliang ^{[1
]}

Bokov, Alexander ^{[1
]}

Mukherjee, Debargha ^{[1
]}

Wang, Yunqing ^{[1
]}

Chen, Yue ^{[1
]}

机构：

[1] Google, Mountain View, CA 94043 USA

来源：

2019 PICTURE CODING SYMPOSIUM (PCS) | 2019年

关键词：

Video Coding; AV1; Machine Learning; Transform Search; Encoding Speedup;

D O I：

10.1109/pcs48520.2019.8954514

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

AV1 is the state-of-the-art open and royalty-free video compression format that achieves significant bitrate savings over previous generation of video codecs. One of AV1's major improvement over its predecessor VP9 is the support of more diverse and flexible transform size and kernel selection. However, it also drastically increases the search space for transform unit rate-distortion optimization in AV1 encoders. Unlike conventional encoder speed features that are based on heuristics, we propose a machine learning (ML) based approach to accelerate the transform size and kernel search for AV1. The ML models use input features extracted from the prediction residue block such as standard deviation, correlation and energy distribution. The output of the models indicates the estimated likelihood of which transform size and kernel would be selected as the optimal choice. Based on the ML models, the encoder can prune out the transform size and kernel candidates that are unlikely to be selected and save unnecessary computation to compute their rate-distortion cost. The proposed approach is implemented and tested on the AV1 reference library libaom. The experimental results show that satisfactory encoding speed improvement can be achieved with extremely low compression performance loss. The framework and methodology can also be easily migrated to other video codecs and implementations.

引用

页数：5

共 50 条

[41] Benchmarking and Analysis of AV1 Software Decoding on Android Devices
Grunau, Janne
Kempf, Jean-Baptiste
Storsjo, Martin
Raj, Jeeva A.
Patankar, Kaustubh
Srinivasan, Mukund
Bultje, Ronald S.
Gramner, Henrik
Trudeau, Luc
Tuffet, Victorien Le Couviour
Lei, Zhijun
Katsavounidis, Ioannis
Ronca, David
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226
[42] Direct Optimisation of λ for HDR Content Adaptive Transcoding in AV1
Vibhoothi
Pitie, Francois
Katsenou, Angeliki
Ringis, Daniel Joseph
Su, Yeping
Birkbeck, Neil
Lin, Jessie
Adsumilli, Balu
Kokaram, Anil
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226
[43] AV1 In-loop Super-resolution Framework
Joshi, Urvang
Mukherjee, Debargha
Chen, Yue
Parker, Sarah
Grange, Adrian
Su, Hui
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
[44] In-loop Frame Super-resolution in AV1
Joshi, Urvang
Mukherjee, Debargha
Chen, Yue
Parker, Sarah
Grange, Adrian
2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
[45] OPTIMIZING AV1 ENCODER FOR REAL-TIME COMMUNICATION
Kyslov, Fyodor
Paniconi, Marco
Jiang, Jerome
Wang, Yunqing
Tsai, Chi Yo
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 941 - 945
[46] Selection of Intra Prediction Tools for Fast AV1 Encoding
Xu, Motong
Jeon, Byeungwoo
2020 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2020,
[47] Performance Comparison of VVC, AV1, HEVC, and AVC for High Resolutions
Uhrina, Miroslav
Sevcik, Lukas
Bienik, Juraj
Smatanova, Lenka
ELECTRONICS, 2024, 13 (05)
[48] Accelerated Search and Design of Stretchable Graphene Kirigami Using Machine Learning
Hanakata, Paul Z.
Cubuk, Ekin D.
Campbell, David K.
Park, Harold S.
PHYSICAL REVIEW LETTERS, 2018, 121 (25)
[49] An Accelerated Convex Optimization Algorithm with Line Search and Applications in Machine Learning
Chumpungam, Dawan
Sarnmeta, Panitarn
Suantai, Suthep
MATHEMATICS, 2022, 10 (09)
[50] Machine learning accelerated search for new double perovskite oxide photocatalysis
Wan Xin-Yang
Zhang Ye-Hui
Lu Shuai-Hua
Wu Yi-Lei
Zhou Qiong-Hua
Wang Jin-Lan
ACTA PHYSICA SINICA, 2022, 71 (17)

← 1 2 3 4 5 →