Low Complexity In-Loop Filter for VVC Based on Convolution and Transformer

被引:0
|
作者
Feng, Zhen [1 ]
Jung, Cheolkon [1 ]
Zhang, Hao [1 ]
Liu, Yang [2 ]
Li, Ming [2 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Guangdong OPPO Mobile Telecommun Corp Ltd, Dongguan 523860, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Transformers; Convolutional neural networks; Artificial neural networks; Feature extraction; Training; Image coding; Video coding; Versatile video coding; compression artifacts; in-loop filter; convolutional neural network; transformer; VIDEO; CNN;
D O I
10.1109/ACCESS.2024.3438988
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Joint Video Experts Team (JVET) has explored neural network-based video coding (NNVC) and is trying to introduce NNVC into the versatile video coding (VVC). In NNVC, the NN-based in-loop filter is the most active area, which is very close to deployment of software. Recent NN-based in-loop filters start adopting Transformer to capture context information, but it causes a remarkable increase of complexity to about 1000 kMAC/Pixel. In this paper, we propose a low complexity NN-based in-loop filter for VVC based on convolution and Transformer, named ConvTransNet. ConvTransNet adopts a pyramid structure in feature extraction to capture both global contextual information and local details at multiple scales. Moreover, ConvTransNet combines convolutional neural network (CNN) and Transformer into the in-loop filter. CNN captures local features and reduces compression artifacts in an image, while Transformer captures long-range spatial dependency and enhances global structures in an image. Thus, ConvTransNet enables the NN-based in-loop filter to reduce compression artifacts and enhance visual quality in an image. In ConvTransNet, we use grouped convolutions in CNN and depthwise convolutions in Transformer to reduce the network complexity. Therefore, ConvTransNet successfully captures both local spatial structure and global contextual information in an image and achieves outstanding performance in terms of BD-rate and complexity. Experimental results show that the proposed NN-based in-loop filter based on ConvTransNet achieves average {6.58%, 23.02%, 23.04%} and {8.18%, 22.67%, 22.00%} BD-rate reductions for {Y, U, V} channels over VTM_11.0-NNVC_2.0 anchor under AI and RA configurations, respectively.
引用
收藏
页码:120316 / 120325
页数:10
相关论文
共 50 条
  • [31] Context-based adaptive in-loop filter for video compression
    Nam, Jung-Hak
    Jung, Kwang-Soo
    Jo, Hyun-Ho
    Sim, Donggyu
    Choi, Byeong-Doo
    OPTICAL ENGINEERING, 2012, 51 (05)
  • [32] In-loop filter algorithm based on content pre-analysis
    School of Electronic Engineering, Xidian Univ., Xi'an 710071, China
    Tongxin Xuebao, 2009, 6 (95-102):
  • [33] Low Complexity In-Loop Skin Tone Detection for ROI Coding in the HEVC Encoder
    Goswami, Piyali
    Srikanth, Palanki Venkata
    Rahiman, Jasmin
    2016 TWENTY SECOND NATIONAL CONFERENCE ON COMMUNICATION (NCC), 2016,
  • [34] A DenseNet Based Approach for Multi-Frame In-Loop Filter in HEVC
    Li, Tianyi
    Xu, Mai
    Yang, Ren
    Tao, Xiaoming
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 270 - 279
  • [35] Simplified CNN In-Loop Filter with fixed Classifications
    Lim, Wang-Q
    Stallenberger, Bjoern
    Pfaff, Jonathan
    Schwarz, Heiko
    Marpe, Detlev
    Wiegand, Thomas
    2024 PICTURE CODING SYMPOSIUM, PCS 2024, 2024,
  • [36] Residual Block Fusion in Low Complexity Neural Network-Based In-loop Filtering for Video Compression
    Shao, Tong
    Shingala, Jay N.
    Shyam, Ajay
    Yin, Peng
    Suneja, Ajat
    Badya, Siddarth P.
    Arora, Arjun
    McCarthy, Sean
    2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 392 - 401
  • [37] Perceptual in-Loop Filter for Image and Video Compression
    Wang, Huairui
    Ren, Guangjie
    Ouyang, Tong
    Zhang, Junxi
    Han, Wenwei
    Liu, Zizheng
    Chen, Zhenzhong
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 1769 - 1772
  • [38] In-Loop Filter for H.264/AVC
    Poornima, G. R.
    Kumar, S. C. Prasanna
    Ramachandran, S.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, : 2208 - 2211
  • [39] A NOVEL IN-LOOP FILTER BASED ON CLDT MASKING EFFECT MODEL FOR HEVC
    Luo, Binji
    Liu, Chonghua
    GongZhang
    Ye, Feng
    Yang, Bo
    2012 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2012,
  • [40] Dense Residual Convolutional Neural Network based In-Loop Filter for HEVC
    Wang, Yingbin
    Zhu, Han
    Li, Yiming
    Chen, Zhenzhong
    Liu, Shan
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,