Spatiotemporal cue fusion-based saliency extraction and its application in video compression

被引:0
|
作者
Li K. [1 ]
Luo Z. [2 ]
Zhang T. [1 ]
Ruan Y. [2 ]
Zhou D. [1 ]
机构
[1] School of Mechanical and Electrical Engineering, Nanchang Institute of Technology, Nanchang
[2] School of Electronics and Information, Nanchang Institute of Technology, Nanchang
来源
Cognitive Robotics | 2022年 / 2卷
关键词
H.264; Saliency detection; Spatiotemporal information fusion; Video compression;
D O I
10.1016/j.cogr.2022.06.003
中图分类号
学科分类号
摘要
Extracting salient regions plays an important role in computer vision tasks, e.g., object detection, recognition and video compression. Previous saliency detection study is mostly conducted on individual frames and tends to extract saliency with spatial cues. The development of various motion feature further extends the saliency concept to the motion saliency from videos. In contrast to image-based saliency extraction, video-based saliency extraction is more challenging due to the complicated distractors, e.g., the background dynamics and shadows. In this paper, we propose a novel saliency extraction method by fusing temporal and spatial cues. In specific, the long-term and short-term variations are comprehensively fused to extract the temporal cue, which is then utilized to establish the background guidance for generating the spatial cue. Herein, the long-term variations and spatial cues jointly highlight the contrast between objects and the background, which can solve the problem caused by shadows. The short-term variations contribute to the removal of background dynamics. Spatiotemporal cues are fully exploited to constrain the saliency extraction across frames. The saliency extraction performance of our method is demonstrated by comparing it to both unsupervised and supervised methods. Moreover, this novel saliency extraction model is applied in the video compression tasks, helping to accelerate the video compression task and achieve a larger PSNR value for the region of interest (ROI). © 2022 The Authors
引用
收藏
页码:177 / 185
页数:8
相关论文
共 50 条
  • [1] Defocus cue and saliency preserving video compression
    Khanna, Meera Thapar
    Chaudhury, Santanu
    Lall, Brejesh
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (06)
  • [2] Target Tracking Based on Spatiotemporal Saliency and Multiscale Appearance Cue Fusion
    Li, Xiangjuan
    Zhao, Chuanyuan
    Yang, Wenyang
    Proceedings - 2022 8th Annual International Conference on Network and Information Systems for Computers, ICNISC 2022, 2022, : 328 - 334
  • [3] A semiautomatic saliency model and its application to video compression
    Lyudvichenko, Vitaliy
    Erofeev, Mikhail
    Gitman, Yury
    Vatolin, Dmitriy
    2017 13TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP), 2017, : 403 - 410
  • [4] Video Object Extraction Based on Spatiotemporal Consistency Saliency Detection
    Guo, Yingchun
    Li, Zhuo
    Liu, Yi
    Yan, Gang
    Yu, Ming
    IEEE ACCESS, 2018, 6 : 35171 - 35181
  • [5] Superpixel-based video saliency detection via the fusion of spatiotemporal saliency and temporal coherency
    Li, Yandi
    Xu, Xiping
    Zhang, Ning
    Du, Enyu
    OPTICAL ENGINEERING, 2019, 58 (08)
  • [6] A Novel Multiresolution Spatiotemporal Saliency Detection Model and Its Applications in Image and Video Compression
    Guo, Chenlei
    Zhang, Liming
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (01) : 185 - 198
  • [7] Video Retargeting based on Adaptive Spatiotemporal Saliency mapping
    Du, Huan
    Xu, Zheng
    Yan, Zhiguo
    2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 248 - 251
  • [8] Spatiotemporal Saliency Detection based Video Quality Assessment
    Jia, Changcheng
    Lu, Wen
    He, Lihuo
    He, Ran
    8TH INTERNATIONAL CONFERENCE ON INTERNET MULTIMEDIA COMPUTING AND SERVICE (ICIMCS2016), 2016, : 340 - 343
  • [9] VIDEO SALIENCY DETECTION BASED ON SPATIOTEMPORAL FEATURE LEARNING
    Lee, Se-Ho
    Kim, Jin-Hwan
    Choi, Kwang Pyo
    Sim, Jae-Young
    Kim, Chang-Su
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 1120 - 1124
  • [10] Saliency-Based Spatiotemporal Attention for Video Captioning
    Chen, Yangyu
    Zhang, Weigang
    Wang, Shuhui
    Li, Liang
    Huang, Qingming
    2018 IEEE FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA BIG DATA (BIGMM), 2018,