Spatiotemporal visual saliency guided perceptual high efficiency video coding with neural network

被引:37
|
作者
Zhu, Shiping [1 ]
Xu, Ziyao [1 ]
机构
[1] Beihang Univ, Sch Instrumentat Sci & Optoelect Engn, Dept Measurement Control & Informat Technol, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Perception; HD video; Saliency; Video compression; HEVC; RATE-DISTORTION OPTIMIZATION; MODEL;
D O I
10.1016/j.neucom.2017.08.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The perceptual video coding systems for optimization have been developed on the basis of different attributes of the human visual system. The attention-based coding system is considered as an important part of it. The saliency map method representing the region-of-interest (ROI) from the video signal has become a reliable method due to advances in the computer performance and the visual algorithms. In the present study, we propose a hybrid compression algorithm that uses the deep convolutional neural network to compute the spatial saliency followed by extraction of the temporal saliency from the compressed-domain motion information. The level of uncertainty is calculated to combine to form the video's saliency map. Afterwards, the QP search range is dynamically adjusted in HEVC, and a rate distortion calculation method is proposed to choose the pattern and guide the allocation of bits during the video compression process. Empirical reporting results proved the superiority of the proposed method over the state-of-the-art perceptual coding algorithms in terms of saliency detection and perceptual compression quality. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:511 / 522
页数:12
相关论文
共 50 条
  • [21] Statistical and spatiotemporal correlation based low-complexity video coding for high-efficiency video coding
    Shanghai University, School of Communication and Information Engineering, No. 333, Nanchen Road, Baoshan District
    Shanghai
    200444, China
    J. Electron. Imaging, 2
  • [22] Perceptual loss guided Generative adversarial network for saliency detection
    Cai, Xiaoxu
    Wang, Gaige
    Lou, Jianwen
    Jian, Muwei
    Dong, Junyu
    Chen, Rung-Ching
    Stevens, Brett
    Yu, Hui
    INFORMATION SCIENCES, 2024, 654
  • [23] Enhanced Bi-Prediction With Convolutional Neural Network for High-Efficiency Video Coding
    Zhao, Zhenghui
    Wang, Shiqi
    Wang, Shanshe
    Zhang, Xinfeng
    Ma, Siwei
    Yang, Jiansheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) : 3291 - 3301
  • [24] Video saliency prediction using enhanced spatiotemporal alignment network
    Chen, Jin
    Song, Huihui
    Zhang, Kaihua
    Liu, Bo
    Liu, Qingshan
    PATTERN RECOGNITION, 2021, 109
  • [25] Hierarchical spatiotemporal Feature Interaction Network for video saliency prediction
    Jin, Yingjie
    Zhou, Xiaofei
    Zhang, Zhenjie
    Fang, Hao
    Shi, Ran
    Xu, Xiaobin
    IMAGE AND VISION COMPUTING, 2025, 154
  • [26] Low-Complexity Saliency Detection Algorithm for Fast Perceptual Video Coding
    Liu, Pengyu
    Jia, Kebin
    SCIENTIFIC WORLD JOURNAL, 2013,
  • [27] A Practical High Efficiency Video Coding Solution for Visual Sensor Network using Raspberry Pi Platform
    Thao Nguyen Thi Huong
    Huy Phi Cong
    Tien Vu Huu
    Xiem HoangVan
    2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2018), 2018, : 64 - 68
  • [28] Semantics-Guided and Saliency-Focused Learning of Perceptual Video Compression
    Li, Bingyao
    IEEE ACCESS, 2024, 12 : 68611 - 68623
  • [29] Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding
    Jia, Chuanmin
    Wang, Shiqi
    Zhang, Xinfeng
    Wang, Shanshe
    Liu, Jiaying
    Pu, Shiliang
    Ma, Siwei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3343 - 3356
  • [30] High efficiency video coding
    Rao, Kamisetty Ramamohan
    2016 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2016, : 11 - 11