Spatiotemporal visual saliency guided perceptual high efficiency video coding with neural network

被引:37
|
作者
Zhu, Shiping [1 ]
Xu, Ziyao [1 ]
机构
[1] Beihang Univ, Sch Instrumentat Sci & Optoelect Engn, Dept Measurement Control & Informat Technol, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Perception; HD video; Saliency; Video compression; HEVC; RATE-DISTORTION OPTIMIZATION; MODEL;
D O I
10.1016/j.neucom.2017.08.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The perceptual video coding systems for optimization have been developed on the basis of different attributes of the human visual system. The attention-based coding system is considered as an important part of it. The saliency map method representing the region-of-interest (ROI) from the video signal has become a reliable method due to advances in the computer performance and the visual algorithms. In the present study, we propose a hybrid compression algorithm that uses the deep convolutional neural network to compute the spatial saliency followed by extraction of the temporal saliency from the compressed-domain motion information. The level of uncertainty is calculated to combine to form the video's saliency map. Afterwards, the QP search range is dynamically adjusted in HEVC, and a rate distortion calculation method is proposed to choose the pattern and guide the allocation of bits during the video compression process. Empirical reporting results proved the superiority of the proposed method over the state-of-the-art perceptual coding algorithms in terms of saliency detection and perceptual compression quality. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:511 / 522
页数:12
相关论文
共 50 条
  • [1] High Efficiency Video Coding Compliant Perceptual Video Coding Using Entropy Based Visual Saliency Model
    Zeeshan, Muhammad
    Majid, Muhammad
    ENTROPY, 2019, 21 (10)
  • [2] Visual Saliency Based Perceptual Video Coding in HEVC
    Wei, Henglu
    Zhou, Xin
    Zhou, Wei
    Yan, Chang
    Duan, Zhemin
    Shan, Nana
    2016 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2016, : 2547 - 2550
  • [3] Perceptual feature guided rate distortion optimization for high efficiency video coding
    Yang, Aisheng
    Zeng, Huanqiang
    Chen, Jing
    Zhu, Jianqing
    Cai, Canhui
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2017, 28 (04) : 1249 - 1266
  • [4] Perceptual feature guided rate distortion optimization for high efficiency video coding
    Aisheng Yang
    Huanqiang Zeng
    Jing Chen
    Jianqing Zhu
    Canhui Cai
    Multidimensional Systems and Signal Processing, 2017, 28 : 1249 - 1266
  • [5] Visual saliency-based fast intracoding algorithm for high efficiency video coding
    Zhou, Xin
    Shi, Guangming
    Zhou, Wei
    Duan, Zhemin
    JOURNAL OF ELECTRONIC IMAGING, 2017, 26 (01)
  • [6] Perceptual Video Coding based on Visual Saliency Modulated Just Noticeable Distortion
    Cui, Jing
    Xiong, Ruiqin
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 565 - 565
  • [7] Saliency Based Rate Control Scheme for High Efficiency Video Coding
    Bai, Lixun
    Song, Li
    Xie, Rong
    Xie, Jianfeng
    Chen, M.
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [8] Perceptual Adaptive Lagrangian Multiplier for High Efficiency Video Coding
    Zeng, Huanqiang
    Ngan, King Ngi
    Wang, Miaohui
    2013 PICTURE CODING SYMPOSIUM (PCS), 2013, : 69 - 72
  • [9] Interlayer Restoration Deep Neural Network for Scalable High Efficiency Video Coding
    He, Gang
    Xu, Li
    Lei, Jie
    Xie, Weiying
    Li, Yunsong
    Fan, Yibo
    Zhou, Jinjia
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3217 - 3234
  • [10] Visual saliency guided video compression algorithm
    Gupta, Rupesh
    Khanna, Meera Thapar
    Chaudhury, Santanu
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (09) : 1006 - 1022