Spatiotemporal cue fusion-based saliency extraction and its application in video compression

被引：0

作者：

Li K. ^{[1
]}

Luo Z. ^{[2
]}

Zhang T. ^{[1
]}

Ruan Y. ^{[2
]}

Zhou D. ^{[1
]}

机构：

[1] School of Mechanical and Electrical Engineering, Nanchang Institute of Technology, Nanchang

[2] School of Electronics and Information, Nanchang Institute of Technology, Nanchang

来源：

Cognitive Robotics | 2022年 / 2卷

关键词：

H.264; Saliency detection; Spatiotemporal information fusion; Video compression;

D O I：

10.1016/j.cogr.2022.06.003

中图分类号：

学科分类号：

摘要：

Extracting salient regions plays an important role in computer vision tasks, e.g., object detection, recognition and video compression. Previous saliency detection study is mostly conducted on individual frames and tends to extract saliency with spatial cues. The development of various motion feature further extends the saliency concept to the motion saliency from videos. In contrast to image-based saliency extraction, video-based saliency extraction is more challenging due to the complicated distractors, e.g., the background dynamics and shadows. In this paper, we propose a novel saliency extraction method by fusing temporal and spatial cues. In specific, the long-term and short-term variations are comprehensively fused to extract the temporal cue, which is then utilized to establish the background guidance for generating the spatial cue. Herein, the long-term variations and spatial cues jointly highlight the contrast between objects and the background, which can solve the problem caused by shadows. The short-term variations contribute to the removal of background dynamics. Spatiotemporal cues are fully exploited to constrain the saliency extraction across frames. The saliency extraction performance of our method is demonstrated by comparing it to both unsupervised and supervised methods. Moreover, this novel saliency extraction model is applied in the video compression tasks, helping to accelerate the video compression task and achieve a larger PSNR value for the region of interest (ROI). © 2022 The Authors

引用

页码：177 / 185

页数：8

共 50 条

[21] Video Saliency Detection Method Based on Spatiotemporal Features of Superpixels
Li Yandi
Xu Xiping
ACTA OPTICA SINICA, 2019, 39 (01)
[22] Spatio-Temporal Saliency Map and Its Application in Bitsteam Extraction for Scalable Video Coding
Liu, Wei
Chen, Xu
Liang, Yong-sheng
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 173 - 178
[23] A Saliency Based Approach for Foreground Extraction from a Video
Ahsan, Sk Md Masudul
Nafew, Abu Naser Md
Amit, Rifat Haque
2017 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT 2017), 2017,
[24] Explicit Performance Metric Optimization for Fusion-Based Video Retrieval
Kim, Ilseo
Oh, Sangmin
Byun, Byungki
Perera, A. G. Amitha
Lee, Chin-Hui
COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 395 - 405
[25] Image fusion-based video deraining using sparse representation
Mi, Zetian
Shang, Jinxia
Zhou, Huan
Wang, Minghui
ELECTRONICS LETTERS, 2016, 52 (18) : 1528 - 1529
[26] Robust video/ultrasonic fusion-based estimation for automotive applications
Pathirana, Pubudu N.
Lim, Allan E. K.
Savkin, Andrey V.
Hodgson, Peter D.
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2007, 56 (04) : 1631 - 1639
[27] Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction
Zhao, Minyi
Xu, Yi
Zhou, Shuigeng
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5646 - 5654
[28] Video saliency detection based on low-level saliency fusion and saliency-aware geodesic
Li, Weisheng
Feng, Siqin
Guan, Hua-Ping
Zhan, Ziwei
Gong, Cheng
JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (01)
[29] Spatiotemporal module for video saliency prediction based on self-attention
Wang, Yuhao
Liu, Zhuoran
Xia, Yibo
Zhu, Chunbo
Zhao, Danpei
IMAGE AND VISION COMPUTING, 2021, 112
[30] A spatiotemporal weighted dissimilarity-based method for video saliency detection
Duan, Lijuan
Xi, Tao
Cui, Song
Qi, Honggang
Bovik, Alan C.
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2015, 38 : 45 - 56

← 1 2 3 4 5 →