Spatiotemporal visual saliency guided perceptual high efficiency video coding with neural network

被引：37

作者：

Zhu, Shiping ^{[1
]}

Xu, Ziyao ^{[1
]}

机构：

[1] Beihang Univ, Sch Instrumentat Sci & Optoelect Engn, Dept Measurement Control & Informat Technol, Beijing 100191, Peoples R China

来源：

NEUROCOMPUTING | 2018年 / 275卷

基金：

中国国家自然科学基金;

关键词：

Perception; HD video; Saliency; Video compression; HEVC; RATE-DISTORTION OPTIMIZATION; MODEL;

D O I：

10.1016/j.neucom.2017.08.054

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The perceptual video coding systems for optimization have been developed on the basis of different attributes of the human visual system. The attention-based coding system is considered as an important part of it. The saliency map method representing the region-of-interest (ROI) from the video signal has become a reliable method due to advances in the computer performance and the visual algorithms. In the present study, we propose a hybrid compression algorithm that uses the deep convolutional neural network to compute the spatial saliency followed by extraction of the temporal saliency from the compressed-domain motion information. The level of uncertainty is calculated to combine to form the video's saliency map. Afterwards, the QP search range is dynamically adjusted in HEVC, and a rate distortion calculation method is proposed to choose the pattern and guide the allocation of bits during the video compression process. Empirical reporting results proved the superiority of the proposed method over the state-of-the-art perceptual coding algorithms in terms of saliency detection and perceptual compression quality. (C) 2017 Elsevier B.V. All rights reserved.

引用

页码：511 / 522

页数：12

共 50 条

[21] Statistical and spatiotemporal correlation based low-complexity video coding for high-efficiency video coding
Shanghai University, School of Communication and Information Engineering, No. 333, Nanchen Road, Baoshan District
Shanghai
200444, China
J. Electron. Imaging, 2
[22] Perceptual loss guided Generative adversarial network for saliency detection
Cai, Xiaoxu
Wang, Gaige
Lou, Jianwen
Jian, Muwei
Dong, Junyu
Chen, Rung-Ching
Stevens, Brett
Yu, Hui
INFORMATION SCIENCES, 2024, 654
[23] Enhanced Bi-Prediction With Convolutional Neural Network for High-Efficiency Video Coding
Zhao, Zhenghui
Wang, Shiqi
Wang, Shanshe
Zhang, Xinfeng
Ma, Siwei
Yang, Jiansheng
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (11) : 3291 - 3301
[24] Video saliency prediction using enhanced spatiotemporal alignment network
Chen, Jin
Song, Huihui
Zhang, Kaihua
Liu, Bo
Liu, Qingshan
PATTERN RECOGNITION, 2021, 109
[25] Hierarchical spatiotemporal Feature Interaction Network for video saliency prediction
Jin, Yingjie
Zhou, Xiaofei
Zhang, Zhenjie
Fang, Hao
Shi, Ran
Xu, Xiaobin
IMAGE AND VISION COMPUTING, 2025, 154
[26] Low-Complexity Saliency Detection Algorithm for Fast Perceptual Video Coding
Liu, Pengyu
Jia, Kebin
SCIENTIFIC WORLD JOURNAL, 2013,
[27] A Practical High Efficiency Video Coding Solution for Visual Sensor Network using Raspberry Pi Platform
Thao Nguyen Thi Huong
Huy Phi Cong
Tien Vu Huu
Xiem HoangVan
2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP (MCSOC 2018), 2018, : 64 - 68
[28] Semantics-Guided and Saliency-Focused Learning of Perceptual Video Compression
Li, Bingyao
IEEE ACCESS, 2024, 12 : 68611 - 68623
[29] Content-Aware Convolutional Neural Network for In-Loop Filtering in High Efficiency Video Coding
Jia, Chuanmin
Wang, Shiqi
Zhang, Xinfeng
Wang, Shanshe
Liu, Jiaying
Pu, Shiliang
Ma, Siwei
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3343 - 3356
[30] High efficiency video coding
Rao, Kamisetty Ramamohan
2016 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2016, : 11 - 11

← 1 2 3 4 5 →