Spatiotemporal cue fusion-based saliency extraction and its application in video compression

被引：0

作者：

Li K. ^{[1
]}

Luo Z. ^{[2
]}

Zhang T. ^{[1
]}

Ruan Y. ^{[2
]}

Zhou D. ^{[1
]}

机构：

[1] School of Mechanical and Electrical Engineering, Nanchang Institute of Technology, Nanchang

[2] School of Electronics and Information, Nanchang Institute of Technology, Nanchang

来源：

Cognitive Robotics | 2022年 / 2卷

关键词：

H.264; Saliency detection; Spatiotemporal information fusion; Video compression;

D O I：

10.1016/j.cogr.2022.06.003

中图分类号：

学科分类号：

摘要：

Extracting salient regions plays an important role in computer vision tasks, e.g., object detection, recognition and video compression. Previous saliency detection study is mostly conducted on individual frames and tends to extract saliency with spatial cues. The development of various motion feature further extends the saliency concept to the motion saliency from videos. In contrast to image-based saliency extraction, video-based saliency extraction is more challenging due to the complicated distractors, e.g., the background dynamics and shadows. In this paper, we propose a novel saliency extraction method by fusing temporal and spatial cues. In specific, the long-term and short-term variations are comprehensively fused to extract the temporal cue, which is then utilized to establish the background guidance for generating the spatial cue. Herein, the long-term variations and spatial cues jointly highlight the contrast between objects and the background, which can solve the problem caused by shadows. The short-term variations contribute to the removal of background dynamics. Spatiotemporal cues are fully exploited to constrain the saliency extraction across frames. The saliency extraction performance of our method is demonstrated by comparing it to both unsupervised and supervised methods. Moreover, this novel saliency extraction model is applied in the video compression tasks, helping to accelerate the video compression task and achieve a larger PSNR value for the region of interest (ROI). © 2022 The Authors

引用

页码：177 / 185

页数：8

共 50 条

[41] Video copy detection based on spatiotemporal fusion model
Li, Jianmin
Liang, Yingyu
Zhang, Bo
Tsinghua Science and Technology, 2012, 17 (01) : 51 - 59
[42] Video saliency detection via bagging-based prediction and spatiotemporal propagation
Zhou, Xiaofei
Liu, Zhi
Li, Kai
Sun, Guangling
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 51 : 131 - 143
[43] Spatiotemporal Saliency Detection Based on Maximum Consistency Superpixels Merging for Video Analysis
Zhang, Jianhua
Chen, Jingbo
Wang, Qichao
Chen, Shengyong
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) : 606 - 614
[44] Stereoscopic video saliency detection based on spatiotemporal correlation and depth confidence optimization
Zhang, Ping
Liu, Jingwen
Wang, Xiaoyang
Pu, Tian
Fei, Chun
Guo, Zhengkui
NEUROCOMPUTING, 2020, 377 : 256 - 268
[45] Novelty-based Spatiotemporal Saliency Detection for Prediction of Gaze in Egocentric Video
Polatsek, Patrik
Benesova, Wanda
Paletta, Lucas
Perko, Roland
IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (03) : 394 - 398
[46] Parallax information fusion-based for dance moving image posture extraction
Lyu, Yin
Teng, Lin
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (36):
[47] Extraction of Refined Deep Feature and Its Application in Saliency Detection
Fang Z.
Cao T.
Zheng Y.
Yang J.
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (02): : 324 - 331
[48] SALI360: Design and Implementation of Saliency based Video Compression for 360° Video Streaming
Baek, Duin
Kang, Hangil
Ryoo, Jihoon
MMSYS'20: PROCEEDINGS OF THE 2020 MULTIMEDIA SYSTEMS CONFERENCE, 2020, : 141 - 152
[49] Fusion-Based Approach for Respiratory Rate Recognition From Facial Video Images
Fiedler, Marc-Andre
Rapczynski, Micha
Al-Hamadi, Ayoub
IEEE ACCESS, 2020, 8 (08): : 130036 - 130047
[50] Multiple feature fusion-based video face tracking for IoT big data
Liu, Zhifeng
Ou, Jiayu
Huo, Wenxiao
Yan, Yejin
Li, Tianping
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10650 - 10669

← 1 2 3 4 5 →