Low Complexity In-Loop Filter for VVC Based on Convolution and Transformer

被引:0
|
作者
Feng, Zhen [1 ]
Jung, Cheolkon [1 ]
Zhang, Hao [1 ]
Liu, Yang [2 ]
Li, Ming [2 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Guangdong OPPO Mobile Telecommun Corp Ltd, Dongguan 523860, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Transformers; Convolutional neural networks; Artificial neural networks; Feature extraction; Training; Image coding; Video coding; Versatile video coding; compression artifacts; in-loop filter; convolutional neural network; transformer; VIDEO; CNN;
D O I
10.1109/ACCESS.2024.3438988
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Joint Video Experts Team (JVET) has explored neural network-based video coding (NNVC) and is trying to introduce NNVC into the versatile video coding (VVC). In NNVC, the NN-based in-loop filter is the most active area, which is very close to deployment of software. Recent NN-based in-loop filters start adopting Transformer to capture context information, but it causes a remarkable increase of complexity to about 1000 kMAC/Pixel. In this paper, we propose a low complexity NN-based in-loop filter for VVC based on convolution and Transformer, named ConvTransNet. ConvTransNet adopts a pyramid structure in feature extraction to capture both global contextual information and local details at multiple scales. Moreover, ConvTransNet combines convolutional neural network (CNN) and Transformer into the in-loop filter. CNN captures local features and reduces compression artifacts in an image, while Transformer captures long-range spatial dependency and enhances global structures in an image. Thus, ConvTransNet enables the NN-based in-loop filter to reduce compression artifacts and enhance visual quality in an image. In ConvTransNet, we use grouped convolutions in CNN and depthwise convolutions in Transformer to reduce the network complexity. Therefore, ConvTransNet successfully captures both local spatial structure and global contextual information in an image and achieves outstanding performance in terms of BD-rate and complexity. Experimental results show that the proposed NN-based in-loop filter based on ConvTransNet achieves average {6.58%, 23.02%, 23.04%} and {8.18%, 22.67%, 22.00%} BD-rate reductions for {Y, U, V} channels over VTM_11.0-NNVC_2.0 anchor under AI and RA configurations, respectively.
引用
收藏
页码:120316 / 120325
页数:10
相关论文
共 50 条
  • [41] Combined spatial temporal based In-loop filter for scalable extension of HEVC
    Dhanalakshmi, A.
    Nagarajan, G.
    ICT EXPRESS, 2020, 6 (04): : 306 - 311
  • [42] REDUCED COMPLEXITY MULTISCALE CNN FOR IN-LOOP VIDEO RESTORATION
    Misra, Kiran
    Segall, Andrew
    Choi, Byeongdoo
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 930 - 934
  • [43] Complexity Reduction of Learned In-Loop Filtering in Video Coding
    Bayliss, Woody
    Murn, Luka
    Izquierdo, Ebroul
    Zhang, Qianni
    Mrak, Marta
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 506 - 510
  • [44] Quality-aware CNN-based in-loop filter for Video Coding
    Chen, Wei
    Xiu, Xiaoyu
    Wang, Xianglin
    Chen, Yi-Wen
    Jhu, Hong-Jheng
    Kuo, Che-Wei
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [45] Suboptimal video coding for machines method based on selective activation of in-loop filter
    Kim, Ayoung
    An, Eun-Vin
    Jung, Soon-heung
    Choo, Hyon-Gon
    Seo, Jeongil
    Seo, Kwang-deok
    ETRI JOURNAL, 2024, 46 (03) : 538 - 549
  • [46] Spatial-Temporal Residue Network Based In-Loop Filter for Video Coding
    Jia, Chuanmin
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [47] Programmable In-loop Deblock Filter Processor for Video Decoders
    Janhunen, Janne
    Jaaskelainen, Pekka
    Hannuksela, Jari
    Rintaluoma, Tero
    Kuusela, Aki
    PROCEEDINGS OF THE 2014 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2014), 2014, : 109 - 114
  • [48] Hardware Efficient Integrated In-loop Filter for HEVC Encoder
    Poola, Lakshmi
    Aparna, P.
    IETE JOURNAL OF RESEARCH, 2024, 70 (10) : 7751 - 7762
  • [49] DEPTH IMAGE IN-LOOP FILTER VIA GRAPH CUT
    Zhou, Liguo
    Wang, Zhongyuan
    Fu, Youming
    Chen, Jun
    Xiang, Rui
    Zhong, Rui
    Wang, Shizheng
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 4027 - 4031
  • [50] Dense Inception Attention Neural Network for In-Loop Filter
    Xu, Xiaoyu
    Qian, Jian
    Yu, Li
    Wang, Hongkui
    Zeng, Xing
    Li, Zhengang
    Wang, Ning
    2019 PICTURE CODING SYMPOSIUM (PCS), 2019,