Spatiotemporal cue fusion-based saliency extraction and its application in video compression

被引:0
|
作者
Li K. [1 ]
Luo Z. [2 ]
Zhang T. [1 ]
Ruan Y. [2 ]
Zhou D. [1 ]
机构
[1] School of Mechanical and Electrical Engineering, Nanchang Institute of Technology, Nanchang
[2] School of Electronics and Information, Nanchang Institute of Technology, Nanchang
来源
Cognitive Robotics | 2022年 / 2卷
关键词
H.264; Saliency detection; Spatiotemporal information fusion; Video compression;
D O I
10.1016/j.cogr.2022.06.003
中图分类号
学科分类号
摘要
Extracting salient regions plays an important role in computer vision tasks, e.g., object detection, recognition and video compression. Previous saliency detection study is mostly conducted on individual frames and tends to extract saliency with spatial cues. The development of various motion feature further extends the saliency concept to the motion saliency from videos. In contrast to image-based saliency extraction, video-based saliency extraction is more challenging due to the complicated distractors, e.g., the background dynamics and shadows. In this paper, we propose a novel saliency extraction method by fusing temporal and spatial cues. In specific, the long-term and short-term variations are comprehensively fused to extract the temporal cue, which is then utilized to establish the background guidance for generating the spatial cue. Herein, the long-term variations and spatial cues jointly highlight the contrast between objects and the background, which can solve the problem caused by shadows. The short-term variations contribute to the removal of background dynamics. Spatiotemporal cues are fully exploited to constrain the saliency extraction across frames. The saliency extraction performance of our method is demonstrated by comparing it to both unsupervised and supervised methods. Moreover, this novel saliency extraction model is applied in the video compression tasks, helping to accelerate the video compression task and achieve a larger PSNR value for the region of interest (ROI). © 2022 The Authors
引用
收藏
页码:177 / 185
页数:8
相关论文
共 50 条
  • [21] Video Saliency Detection Method Based on Spatiotemporal Features of Superpixels
    Li Yandi
    Xu Xiping
    ACTA OPTICA SINICA, 2019, 39 (01)
  • [22] Spatio-Temporal Saliency Map and Its Application in Bitsteam Extraction for Scalable Video Coding
    Liu, Wei
    Chen, Xu
    Liang, Yong-sheng
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 173 - 178
  • [23] A Saliency Based Approach for Foreground Extraction from a Video
    Ahsan, Sk Md Masudul
    Nafew, Abu Naser Md
    Amit, Rifat Haque
    2017 3RD INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT 2017), 2017,
  • [24] Explicit Performance Metric Optimization for Fusion-Based Video Retrieval
    Kim, Ilseo
    Oh, Sangmin
    Byun, Byungki
    Perera, A. G. Amitha
    Lee, Chin-Hui
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 395 - 405
  • [25] Image fusion-based video deraining using sparse representation
    Mi, Zetian
    Shang, Jinxia
    Zhou, Huan
    Wang, Minghui
    ELECTRONICS LETTERS, 2016, 52 (18) : 1528 - 1529
  • [26] Robust video/ultrasonic fusion-based estimation for automotive applications
    Pathirana, Pubudu N.
    Lim, Allan E. K.
    Savkin, Andrey V.
    Hodgson, Peter D.
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2007, 56 (04) : 1631 - 1639
  • [27] Recursive Fusion and Deformable Spatiotemporal Attention for Video Compression Artifact Reduction
    Zhao, Minyi
    Xu, Yi
    Zhou, Shuigeng
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5646 - 5654
  • [28] Video saliency detection based on low-level saliency fusion and saliency-aware geodesic
    Li, Weisheng
    Feng, Siqin
    Guan, Hua-Ping
    Zhan, Ziwei
    Gong, Cheng
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (01)
  • [29] Spatiotemporal module for video saliency prediction based on self-attention
    Wang, Yuhao
    Liu, Zhuoran
    Xia, Yibo
    Zhu, Chunbo
    Zhao, Danpei
    IMAGE AND VISION COMPUTING, 2021, 112
  • [30] A spatiotemporal weighted dissimilarity-based method for video saliency detection
    Duan, Lijuan
    Xi, Tao
    Cui, Song
    Qi, Honggang
    Bovik, Alan C.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2015, 38 : 45 - 56