Spatiotemporal cue fusion-based saliency extraction and its application in video compression

被引：0

作者：

Li K. ^{[1
]}

Luo Z. ^{[2
]}

Zhang T. ^{[1
]}

Ruan Y. ^{[2
]}

Zhou D. ^{[1
]}

机构：

[1] School of Mechanical and Electrical Engineering, Nanchang Institute of Technology, Nanchang

[2] School of Electronics and Information, Nanchang Institute of Technology, Nanchang

来源：

Cognitive Robotics | 2022年 / 2卷

关键词：

H.264; Saliency detection; Spatiotemporal information fusion; Video compression;

D O I：

10.1016/j.cogr.2022.06.003

中图分类号：

学科分类号：

摘要：

Extracting salient regions plays an important role in computer vision tasks, e.g., object detection, recognition and video compression. Previous saliency detection study is mostly conducted on individual frames and tends to extract saliency with spatial cues. The development of various motion feature further extends the saliency concept to the motion saliency from videos. In contrast to image-based saliency extraction, video-based saliency extraction is more challenging due to the complicated distractors, e.g., the background dynamics and shadows. In this paper, we propose a novel saliency extraction method by fusing temporal and spatial cues. In specific, the long-term and short-term variations are comprehensively fused to extract the temporal cue, which is then utilized to establish the background guidance for generating the spatial cue. Herein, the long-term variations and spatial cues jointly highlight the contrast between objects and the background, which can solve the problem caused by shadows. The short-term variations contribute to the removal of background dynamics. Spatiotemporal cues are fully exploited to constrain the saliency extraction across frames. The saliency extraction performance of our method is demonstrated by comparing it to both unsupervised and supervised methods. Moreover, this novel saliency extraction model is applied in the video compression tasks, helping to accelerate the video compression task and achieve a larger PSNR value for the region of interest (ROI). © 2022 The Authors

引用

页码：177 / 185

页数：8

共 50 条

[31] Spatiotemporal Saliency Detection for Video Sequences Based on Random Walk With Restart
Kim, Hansang
Kim, Youngbae
Sim, Jae-Young
Kim, Chang-Su
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (08) : 2552 - 2564
[32] Fabric Defect Detection via Multi-scale Feature Fusion-Based Saliency
Liu, Zhoufeng
Huang, Ning
Li, Chunlei
Guo, Zijing
Gao, Chengli
PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 240 - 251
[33] Beyond Fusion: The Application of Fusion-Based Microwave Technology to Other Industries
Anderson, James P.
Doane, John L.
Grunloh, Howard J.
Brookman, Michael W.
2019 44TH INTERNATIONAL CONFERENCE ON INFRARED, MILLIMETER, AND TERAHERTZ WAVES (IRMMW-THZ), 2019,
[34] Video compression by computer and its application
Hu Yu
TRENDS IN BUILDING MATERIALS RESEARCH, PTS 1 AND 2, 2012, 450-451 : 1293 - 1296
[35] The effects of fusion-based feature extraction for fabric defect classification
Ciklacandir, Fatma Gunseli Yasar
Utku, Semih
Ozdemir, Hakan
TEXTILE RESEARCH JOURNAL, 2023, 93 (23-24) : 5448 - 5460
[36] Video object segmentation with shape cue based on spatiotemporal superpixel neighbourhood
Tian, Zhiqiang
Zheng, Nanning
Xue, Jianru
Lan, Xuguang
Li, Ce
Zhou, Gang
IET COMPUTER VISION, 2014, 8 (01) : 16 - 25
[37] Target Tracking Method in Aerial Video Based on Saliency Fusion
Han, Jie
Guo, Baolong
Sun, Wei
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2015, 8 : 723 - 727
[38] Fusion-based Spatiotemporal Convolutions with Constant Temporal Snapshots for Sign Language Recognition
Han, Yiming
Fan, Xiaocong
Bhosale, Radhika
Sundaram, Ramakrishnan
Liaw, Jonathan
2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
[39] Saliency Prediction using Scene Motion for JND based Video Compression
Wang, Ruei-Jiun
Chiu, Ching-Te
2011 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2011, : 73 - 77
[40] Video Copy Detection Based on Spatiotemporal Fusion Model
Jianmin Li
Tsinghua Science and Technology, 2012, 17 (01) : 51 - 59

← 1 2 3 4 5 →