OPEN-SOURCE: ATTENTION-BASED NEURAL NETWORKS FOR CHROMA INTRA PREDICTION IN VIDEO CODING

被引:0
|
作者
Blanch, Marc Gorriz [1 ,2 ]
Blasi, Saverio [1 ]
Smeaton, Alan [2 ]
O'Connor, Noel E. [2 ]
Mrak, Marta [1 ]
机构
[1] British Broadcasting Corp, London, England
[2] Dublin City Univ, Dublin, Ireland
来源
2021 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW) | 2021年
关键词
Chroma intra prediction; convolutional neural networks; attention algorithms; complexity reduction;
D O I
10.1109/ICMEW53276.2021.9455958
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to be complex and computationally intense, and may be difficult to deploy in a practical video coding pipeline. The software presented in this paper introduces a collection of simplifications to reduce the complexity overhead of the attention-based architectures. The simplified models are integrated into the Versatile Video Coding (VVC) prediction pipeline, retaining compression efficiency of previous chroma intra-prediction methods based on neural networks, while offering different directions for significantly reducing coding complexity.
引用
收藏
页数:2
相关论文
共 50 条
  • [41] Demystifying Oversmoothing in Attention-Based Graph Neural Networks
    Wu, Xinyi
    Ajorlou, Amir
    Wu, Zihui
    Jadbabaie, Ali
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [42] Attention-based neural joint source-channel coding of text for point to point and broadcast channel
    Liu, Ting
    Chen, Xuechen
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (03) : 2379 - 2407
  • [43] Attention-based neural joint source-channel coding of text for point to point and broadcast channel
    Ting Liu
    Xuechen Chen
    Artificial Intelligence Review, 2022, 55 : 2379 - 2407
  • [44] Video Summarization With Attention-Based Encoder-Decoder Networks
    Ji, Zhong
    Xiong, Kailin
    Pang, Yanwei
    Li, Xuelong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (06) : 1709 - 1717
  • [45] OpenHGNN: An Open-Source Toolkit for Heterogeneous Graph Neural Networks
    Han, Hui
    Zhao, Tianyu
    Yang, Cheng
    Zhang, Hongyi
    Liu, Yaoqi
    Wang, Xiao
    Shi, Chuan
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3993 - 3997
  • [46] Attention-based neural networks for trust evaluation in online social networks
    Xu, Yanwei
    Feng, Zhiyong
    Zhou, Xian
    Xing, Meng
    Wu, Hongyue
    Xue, Xiao
    Chen, Shizhan
    Wang, Chao
    Qi, Lianyong
    INFORMATION SCIENCES, 2023, 630 : 507 - 522
  • [47] PipeCNN: An OpenCL-Based Open-Source FPGA Accelerator for Convolution Neural Networks
    Wang, Dong
    Xu, Ke
    Jiang, Diankun
    2017 INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE TECHNOLOGY (ICFPT), 2017, : 279 - 282
  • [48] PredictPTB: an interpretable preterm birth prediction model using attention-based recurrent neural networks
    AlSaad, Rawan
    Malluhi, Qutaibah
    Boughorbel, Sabri
    BIODATA MINING, 2022, 15 (01)
  • [49] PredictPTB: an interpretable preterm birth prediction model using attention-based recurrent neural networks
    Rawan AlSaad
    Qutaibah Malluhi
    Sabri Boughorbel
    BioData Mining, 15
  • [50] High-Risk Prediction of Cardiovascular Diseases via Attention-Based Deep Neural Networks
    An, Ying
    Huang, Nengjun
    Chen, Xianlai
    Wu, FangXiang
    Wang, Jianxin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (03) : 1093 - 1105