Machine Learning Accelerated Transform Search For AV1

被引:4
|
作者
Su, Hui [1 ]
Chen, Mingliang [1 ]
Bokov, Alexander [1 ]
Mukherjee, Debargha [1 ]
Wang, Yunqing [1 ]
Chen, Yue [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
关键词
Video Coding; AV1; Machine Learning; Transform Search; Encoding Speedup;
D O I
10.1109/pcs48520.2019.8954514
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
AV1 is the state-of-the-art open and royalty-free video compression format that achieves significant bitrate savings over previous generation of video codecs. One of AV1's major improvement over its predecessor VP9 is the support of more diverse and flexible transform size and kernel selection. However, it also drastically increases the search space for transform unit rate-distortion optimization in AV1 encoders. Unlike conventional encoder speed features that are based on heuristics, we propose a machine learning (ML) based approach to accelerate the transform size and kernel search for AV1. The ML models use input features extracted from the prediction residue block such as standard deviation, correlation and energy distribution. The output of the models indicates the estimated likelihood of which transform size and kernel would be selected as the optimal choice. Based on the ML models, the encoder can prune out the transform size and kernel candidates that are unlikely to be selected and save unnecessary computation to compute their rate-distortion cost. The proposed approach is implemented and tested on the AV1 reference library libaom. The experimental results show that satisfactory encoding speed improvement can be achieved with extremely low compression performance loss. The framework and methodology can also be easily migrated to other video codecs and implementations.
引用
收藏
页数:5
相关论文
共 50 条
  • [11] ADAPTIVE INTERPOLATION FILTER SCHEME IN AV1
    Chiang, Ching-Han
    Han, Jingning
    Vitvitskyy, Stan
    Mukherjee, Debargha
    Xu, Yaowu
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 934 - 937
  • [12] Advanced transform coding methods for next-generation video coding beyond AV1
    Krishnan, Madhu P.
    Zhao, Xin
    Liu, Shan
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLVII, 2024, 13137
  • [13] Predicting Chroma from Luma in AV1
    Trudeau, Luc N.
    Egge, Nathan E.
    Barr, David
    2018 DATA COMPRESSION CONFERENCE (DCC 2018), 2018, : 374 - 382
  • [14] Performance Comparison of VVC, AV1 and EVC
    Topiwala, Pankaj
    Krishnan, Madhu
    Dai, Wei
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLII, 2019, 11137
  • [15] HIGH PERFORMANT AV1 FOR VOD APPLICATIONS
    Wang, Yunqing
    Tsai, Chi Yo
    Han, Jingning
    Xu, Yaowu
    2022 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS (IEEE ICMEW 2022), 2022,
  • [16] On Evaluating the Impact of Tile Partitioning in AV1
    Panagou, Natalia
    Papadopoulos, Panos K.
    Koziri, Maria
    Loukopoulos, Thanasis
    22ND PAN-HELLENIC CONFERENCE ON INFORMATICS (PCI 2018), 2018, : 121 - 126
  • [17] Wavefront Parallel Processing for AV1 Encoder
    Zhao, Yikai
    Wen, Jiangtao
    2018 PICTURE CODING SYMPOSIUM (PCS 2018), 2018, : 101 - 105
  • [18] MACHINE LEARNING ACCELERATED PARTITION SEARCH FOR VIDEO ENCODING
    Su, Hui
    Tsai, Chi-Yo
    Wang, Yunqing
    Xu, Yaowu
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 2661 - 2665
  • [19] Genetic Algorithm Based Rate Control for AV1
    Fang, Meiyuan
    Han, Yuxing
    Wen, Jiangtao
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 520 - 524
  • [20] INTEGRATING THOR TOOLS INTO THE EMERGING AV1 CODEC
    Midtskogen, S.
    Fuldseth, A.
    Bjontegaard, G.
    Davies, T.
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 930 - 933