OPEN-SOURCE: ATTENTION-BASED NEURAL NETWORKS FOR CHROMA INTRA PREDICTION IN VIDEO CODING

被引:0
|
作者
Blanch, Marc Gorriz [1 ,2 ]
Blasi, Saverio [1 ]
Smeaton, Alan [2 ]
O'Connor, Noel E. [2 ]
Mrak, Marta [1 ]
机构
[1] British Broadcasting Corp, London, England
[2] Dublin City Univ, Dublin, Ireland
来源
2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW) | 2021年
关键词
Chroma intra prediction; convolutional neural networks; attention algorithms; complexity reduction;
D O I
10.1109/ICMEW53276.2021.9455958
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to be complex and computationally intense, and may be difficult to deploy in a practical video coding pipeline. The software presented in this paper introduces a collection of simplifications to reduce the complexity overhead of the attention-based architectures. The simplified models are integrated into the Versatile Video Coding (VVC) prediction pipeline, retaining compression efficiency of previous chroma intra-prediction methods based on neural networks, while offering different directions for significantly reducing coding complexity.
引用
收藏
页数:2
相关论文
共 50 条
  • [21] ASENN: attention-based selective embedding neural networks for road distress prediction
    Philip, Babitha
    Xu, Zhenyu
    Aljassmi, Hamad
    Zhang, Qieshi
    Ali, Luqman
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [22] Attention-based dynamic multilayer graph neural networks for loan default prediction
    Zandi, Sahab
    Korangi, Kamesh
    Oskarsdottir, Maria
    Mues, Christophe
    Bravo, Cristian
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2025, 321 (02) : 586 - 599
  • [23] Attention-based graph neural networks: a survey
    Sun, Chengcheng
    Li, Chenhao
    Lin, Xiang
    Zheng, Tianji
    Meng, Fanrong
    Rui, Xiaobin
    Wang, Zhixiao
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 2) : 2263 - 2310
  • [24] Attention-based graph neural networks: a survey
    Chengcheng Sun
    Chenhao Li
    Xiang Lin
    Tianji Zheng
    Fanrong Meng
    Xiaobin Rui
    Zhixiao Wang
    Artificial Intelligence Review, 2023, 56 : 2263 - 2310
  • [25] Enhanced Cross-Component Linear Model for Chroma Intra-Prediction in Video Coding
    Zhang, Kai
    Chen, Jianle
    Zhang, Li
    Li, Xiang
    Karczewicz, Marta
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) : 3983 - 3997
  • [26] Multi-model Based Cross-component Linear Model Chroma Intra-prediction for Video Coding
    Zhang, Kai
    Chen, Jianle
    Zhang, Li
    Li, Xiang
    Karczewicz, Marta
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [27] CowLog: Open-source software for coding behaviors from digital video
    Hanninen, Laura
    Pastell, Matti
    BEHAVIOR RESEARCH METHODS, 2009, 41 (02) : 472 - 476
  • [28] CowLog: Open-source software for coding behaviors from digital video
    Laura Hänninen
    Matti Pastell
    Behavior Research Methods, 2009, 41 : 472 - 476
  • [29] Enhanced Intra Prediction with Recurrent Neural Network in Video Coding
    Hu, Yueyu
    Yang, Wenhan
    Xia, Sifeng
    Cheng, Wen-Huang
    Liu, Jiaying
    2018 DATA COMPRESSION CONFERENCE (DCC 2018), 2018, : 413 - 413
  • [30] Dipole: Diagnosis Prediction in Healthcare via Attention-based Bidirectional Recurrent Neural Networks
    Ma, Fenglong
    Chitta, Radha
    Zhou, Jing
    You, Quanzeng
    Sun, Tong
    Gao, Jing
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1903 - 1911