Machine Learning Accelerated Transform Search For AV1

被引:4
|
作者
Su, Hui [1 ]
Chen, Mingliang [1 ]
Bokov, Alexander [1 ]
Mukherjee, Debargha [1 ]
Wang, Yunqing [1 ]
Chen, Yue [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
关键词
Video Coding; AV1; Machine Learning; Transform Search; Encoding Speedup;
D O I
10.1109/pcs48520.2019.8954514
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
AV1 is the state-of-the-art open and royalty-free video compression format that achieves significant bitrate savings over previous generation of video codecs. One of AV1's major improvement over its predecessor VP9 is the support of more diverse and flexible transform size and kernel selection. However, it also drastically increases the search space for transform unit rate-distortion optimization in AV1 encoders. Unlike conventional encoder speed features that are based on heuristics, we propose a machine learning (ML) based approach to accelerate the transform size and kernel search for AV1. The ML models use input features extracted from the prediction residue block such as standard deviation, correlation and energy distribution. The output of the models indicates the estimated likelihood of which transform size and kernel would be selected as the optimal choice. Based on the ML models, the encoder can prune out the transform size and kernel candidates that are unlikely to be selected and save unnecessary computation to compute their rate-distortion cost. The proposed approach is implemented and tested on the AV1 reference library libaom. The experimental results show that satisfactory encoding speed improvement can be achieved with extremely low compression performance loss. The framework and methodology can also be easily migrated to other video codecs and implementations.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Short Video Performance Evaluation of AV1 Coding Tools
    Xing, Peiyin
    Cai, Yangang
    Li, Xufeng
    Tian, Yonghong
    2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 378 - 378
  • [32] Quality Assessment of Gaming Videos Compressed via AV1
    Ashimov, Darkhan
    Martini, Maria G.
    Barman, Nabajeet
    2020 TWELFTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2020,
  • [33] Frame-parallel multithreading in libaom AV1 encoder
    Shyla, Nithya Viswanathan
    Prakasan, Remya
    Chakera, Mufaddal
    Singh, Tarundeep
    Chandran, Aasaipriya
    Wang, Yunqing
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)
  • [34] An Overview of Core Coding Tools in the AV1 Video Codec
    Chen, Yue
    Murherjee, Debargha
    Han, Jingning
    Grange, Adrian
    Xu, Yaowu
    Liu, Zoe
    Parker, Sarah
    Chen, Cheng
    Su, Hui
    Joshi, Urvang
    Chiang, Ching-Han
    Wang, Yunqing
    Wilkins, Paul
    Bankoski, Jim
    Trudeau, Luc
    Egge, Nathan
    Valin, Jean-Marc
    Davies, Thomas
    Midtskogent, Steinar
    Norkin, Andrey
    de Rivaz, Peter
    2018 PICTURE CODING SYMPOSIUM (PCS 2018), 2018, : 41 - 45
  • [35] Introducing AV1 Codec-Level Video Steganography
    Catania, Lorenzo
    Allegra, Dario
    Giudice, Oliver
    Stanco, Filippo
    Battiato, Sebastiano
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 : 284 - 294
  • [36] Speeding Up the AV1 Global Warped Motion Compensation
    Kolodziejski, William
    Domanski, Robson
    Agostini, Luciano
    15TH IEEE LATIN AMERICAN SYMPOSIUM ON CIRCUITS AND SYSTEMS, LASCAS 2024, 2024, : 153 - 157
  • [37] HDR Video Coding For Aerial Video With VVC and AV1
    Topiwala, P.
    Dai, W.
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [38] ADVANCED MOTION VECTOR DIFFERENCE CODING BEYOND AV1
    Zhao, Liang
    Zhao, Xin
    Liu, Shan
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3631 - 3635
  • [39] Microarchitectural Performance Evaluation of AV1 Video Encoding Workloads
    Jensen, Steffen
    Lee, Jaekyu
    Sunwoo, Dam
    Horsnell, Matthew J.
    John, Lizy K.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE (ISPASS 2022), 2022, : 251 - 253
  • [40] A SUBJECTIVE COMPARISON OF AV1 AND HEVC FOR ADAPTIVE VIDEO STREAMING
    Katsenou, Angeliki V.
    Zhang, Fan
    Afonso, Mariana
    Bull, David R.
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4145 - 4149