Machine Learning Accelerated Transform Search For AV1

被引：4

作者：

Su, Hui ^{[1
]}

Chen, Mingliang ^{[1
]}

Bokov, Alexander ^{[1
]}

Mukherjee, Debargha ^{[1
]}

Wang, Yunqing ^{[1
]}

Chen, Yue ^{[1
]}

机构：

[1] Google, Mountain View, CA 94043 USA

来源：

2019 PICTURE CODING SYMPOSIUM (PCS) | 2019年

关键词：

Video Coding; AV1; Machine Learning; Transform Search; Encoding Speedup;

D O I：

10.1109/pcs48520.2019.8954514

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

AV1 is the state-of-the-art open and royalty-free video compression format that achieves significant bitrate savings over previous generation of video codecs. One of AV1's major improvement over its predecessor VP9 is the support of more diverse and flexible transform size and kernel selection. However, it also drastically increases the search space for transform unit rate-distortion optimization in AV1 encoders. Unlike conventional encoder speed features that are based on heuristics, we propose a machine learning (ML) based approach to accelerate the transform size and kernel search for AV1. The ML models use input features extracted from the prediction residue block such as standard deviation, correlation and energy distribution. The output of the models indicates the estimated likelihood of which transform size and kernel would be selected as the optimal choice. Based on the ML models, the encoder can prune out the transform size and kernel candidates that are unlikely to be selected and save unnecessary computation to compute their rate-distortion cost. The proposed approach is implemented and tested on the AV1 reference library libaom. The experimental results show that satisfactory encoding speed improvement can be achieved with extremely low compression performance loss. The framework and methodology can also be easily migrated to other video codecs and implementations.

引用

页数：5

共 50 条

[1] Adaptive complexity control for AV1 video encoder using machine learning
Bender, Isis
Rehbein, Gustavo
Correa, Guilherme
Agostini, Luciano
Porto, Marcelo
JOURNAL OF REAL-TIME IMAGE PROCESSING, 2024, 21 (03)
[2] UNIFIED SECONDARY TRANSFORM FOR INTRA CODING BEYOND AV1
Zhao, Xin
Liu, Shan
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3393 - 3397
[3] MACHINE LEARNING BASED SYMBOL PROBABILITY DISTRIBUTION PREDICTION FOR ENTROPY CODING IN AV1
Chen, Mingliang
Su, Hui
Deng, Sai
Xu, Yaowu
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3374 - 3378
[4] A Technical Overview of AV1
Han, Jingning
Li, Bohan
Mukherjee, Debargha
Chiang, Ching-Han
Grange, Adrian
Chen, Cheng
Su, Hui
Parker, Sarah
Deng, Sai
Joshi, Urvang
Chen, Yue
Wang, Yunqing
Wilkins, Paul
Xu, Yaowu
Bankoski, James
PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1435 - 1462
[5] FastGW: A Machine Learning-Based Early Skip for the AV1 Global Warped Motion Compensation
Kolodziejski, William
Domanski, Robson
Agostini, Luciano
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2024,
[6] Analysis of AV1 Coding Tools
Chuang, Hsiao-Chiang
Lei, Zhijun
Opalach, Agata
Norkin, Andrey
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLV, 2022, 12226
[7] Fast Transform Kernel Selection Based on Frequency Matching and Probability Model for AV1
Hao, Zhijian
Sun, Heming
Xu, Guohao
Liu, Jiaming
Xiong, Xiankui
Zhu, Xuanpeng
Zeng, Xiaoyang
Fan, Yibo
IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (02) : 693 - 707
[8] A Multi-Pass Coding Mode Search Framework For AV1 Encoder Optimization
Chiang, Ching-Han
Han, Jingning
Xu, Yaowu
2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 458 - 467
[9] AV1/AVM development at Google
Chong, In Suk
Young, Joe
Li, Shan
McCullough, Conor
Vitvitskyy, Stan
Rautio, Ville-Mikko
APPLICATIONS OF DIGITAL IMAGE PROCESSING XLVII, 2024, 13137
[10] TRIGGER FOR PHILIPS AV1 VENTILATOR
ROBERTSON, DH
LAING, A
ANAESTHESIA, 1977, 32 (04) : 353 - 354

← 1 2 3 4 5 →