Spatiotemporal visual saliency guided perceptual high efficiency video coding with neural network

被引:37
|
作者
Zhu, Shiping [1 ]
Xu, Ziyao [1 ]
机构
[1] Beihang Univ, Sch Instrumentat Sci & Optoelect Engn, Dept Measurement Control & Informat Technol, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Perception; HD video; Saliency; Video compression; HEVC; RATE-DISTORTION OPTIMIZATION; MODEL;
D O I
10.1016/j.neucom.2017.08.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The perceptual video coding systems for optimization have been developed on the basis of different attributes of the human visual system. The attention-based coding system is considered as an important part of it. The saliency map method representing the region-of-interest (ROI) from the video signal has become a reliable method due to advances in the computer performance and the visual algorithms. In the present study, we propose a hybrid compression algorithm that uses the deep convolutional neural network to compute the spatial saliency followed by extraction of the temporal saliency from the compressed-domain motion information. The level of uncertainty is calculated to combine to form the video's saliency map. Afterwards, the QP search range is dynamically adjusted in HEVC, and a rate distortion calculation method is proposed to choose the pattern and guide the allocation of bits during the video compression process. Empirical reporting results proved the superiority of the proposed method over the state-of-the-art perceptual coding algorithms in terms of saliency detection and perceptual compression quality. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:511 / 522
页数:12
相关论文
共 50 条
  • [31] Perceptual sensitivity-based rate control method for high efficiency video coding
    Huanqiang Zeng
    Aisheng Yang
    King Ngi Ngan
    Miaohui Wang
    Multimedia Tools and Applications, 2016, 75 : 10383 - 10396
  • [32] Perceptual sensitivity-based rate control method for high efficiency video coding
    Zeng, Huanqiang
    Yang, Aisheng
    Ngan, King Ngi
    Wang, Miaohui
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (17) : 10383 - 10396
  • [33] Object-Based Video Coding by Visual Saliency and Temporal Correlation
    Ogasawara, Kazuya
    Miyazaki, Tomo
    Sugaya, Yoshihiro
    Omachi, Shinichiro
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2020, 8 (01) : 168 - 178
  • [34] STI-Net: Spatiotemporal integration network for video saliency detection
    Zhou, Xiaofei
    Cao, Weipeng
    Gao, Hanxiao
    Ming, Zhong
    Zhang, Jiyong
    INFORMATION SCIENCES, 2023, 628 : 134 - 147
  • [35] Saliency-based coding tree unit-level rate control for high-efficiency video coding
    Wei, Henglu
    Zhou, Wei
    Zhou, Xin
    Bai, Rui
    Duan, Zhemin
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (04)
  • [36] Video Processing for Human Perceptual Visual Quality-Oriented Video Coding
    Oh, Hyungsuk
    Kim, Wonha
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (04) : 1524 - 1533
  • [37] Improving the Efficiency of Video Coding by using Perceptual Preprocessing Filter
    Vanam, Rahul
    Reznik, Yuriy A.
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 524 - 524
  • [38] SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION
    Fischer, Kristian
    Fleckenstein, Felix
    Herglotz, Christian
    Kaup, Andre
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1505 - 1509
  • [39] Visual sensitivity guided bit allocation for video coding
    Tang, CW
    Chen, CH
    Yu, YH
    Tsai, CJ
    IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (01) : 11 - 18
  • [40] Saliency guided Wavelet compression for low-bitrate Image and Video coding
    Barua, Souptik
    Mitra, Kaushik
    Veeraraghavan, Ashok
    2015 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2015, : 1185 - 1189