CONVOLUTIONAL NEURAL NETWORK-BASED INVERTIBLE HALF-PIXEL INTERPOLATION FILTER FOR VIDEO CODING

被引:0
|
作者
Yan, Ning [1 ]
Liu, Dong [1 ]
Li, Bin [2 ]
Li, Houqiang [1 ]
Xu, Tong [1 ]
Wu, Feng [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Anhui, Peoples R China
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
Convolutional neural network; High Efficiency Video Coding; interpolation filter; invertibility;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Fractional-pixel interpolation has been widely used in the modern video coding standards to improve the accuracy of motion compensated prediction. Traditional interpolation filters are designed based on the signal processing theory. However, video signal is non-stationary, making the traditional methods less effective. In this paper, we reveal that the interpolation filter can not only generate the fractional pixels from the integer pixels, but also reconstruct the integer pixels from the fractional ones. This property is called invertibility. Inspired by the invertibility of fractional-pixel interpolation, we propose an end-to-end scheme based on convolutional neural network (CNN) to derive the invertible interpolation filter, termed CNNInvIF. CNNInvIF does not need the "ground-truth" of fractional pixels for training. Experimental results show that the proposed CNNInvIF can achieve up to 4.6% and on average 2.2% BD-rate reduction than HEVC under the low-delay P configuration.
引用
收藏
页码:201 / 205
页数:5
相关论文
共 50 条
  • [41] Neural network-based cross-channel chroma prediction for versatile video coding
    Liang, Fang
    Zhang, Jingde
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (09): : 12166 - 12185
  • [42] A CONVOLUTIONAL NEURAL NETWORK-BASED MODEL OF NEURAL PATHWAYS IN THE RETINA
    Zamani, Yasin
    Nategh, Neda
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 6906 - 6909
  • [43] A Group Variational Transformation Neural Network for Fractional Interpolation of Video Coding
    Xia, Sifeng
    Yang, Wenhan
    Hu, Yueyu
    Ma, Siwei
    Liu, Jiaying
    2018 DATA COMPRESSION CONFERENCE (DCC 2018), 2018, : 127 - 136
  • [44] CONVOLUTIONAL NEURAL NETWORK-BASED FRACTAL CODING METHOD FOR IMAGE TRANSLATION IN MULTIMODAL CHANGE DETECTION
    Radoi, Anamaria
    Unsalan, Melisa
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 1063 - 1066
  • [45] Recurrent Neural Network-Based Video Compression
    Montajabi, Zahra
    Ghassab, Vahid Khorasani
    Bouguila, Nizar
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 925 - 930
  • [46] Invertible three-dimensional analysis/synthesis system for video coding with half-pixel-accurate motion compensation
    Hsiang, ST
    Woods, JW
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 537 - 546
  • [47] Image interpolation with spiking neural network based pixel similarity
    Kilicaslan, Mahmut
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (10) : 6925 - 6936
  • [48] Convolutional Neural Network-based harmonic mitigation technique for an adaptive shunt active power filter
    Sugavanam, K. R.
    Mohana Sundaram, K.
    Jeyabharath, R.
    Veena, P.
    AUTOMATIKA, 2021, 62 (3-4) : 471 - 485
  • [49] Residual Reconstruction Algorithm Based on Half-Pixel Multi-Hypothesis Prediction for Distributed Compressive Video Sensing
    Tong, Ying
    Chen, Rui
    Yang, Jie
    Wu, Minghu
    INTERNATIONAL JOURNAL OF MOBILE COMPUTING AND MULTIMEDIA COMMUNICATIONS, 2018, 9 (04) : 16 - 33
  • [50] Convolutional Neural Network-Based Automated System for Dog Tracking and Emotion Recognition in Video Surveillance
    Chen, Huan-Yu
    Lin, Chuen-Horng
    Lai, Jyun-Wei
    Chan, Yung-Kuan
    APPLIED SCIENCES-BASEL, 2023, 13 (07):