Spatiotemporal cue fusion-based saliency extraction and its application in video compression

被引:0
|
作者
Li K. [1 ]
Luo Z. [2 ]
Zhang T. [1 ]
Ruan Y. [2 ]
Zhou D. [1 ]
机构
[1] School of Mechanical and Electrical Engineering, Nanchang Institute of Technology, Nanchang
[2] School of Electronics and Information, Nanchang Institute of Technology, Nanchang
来源
Cognitive Robotics | 2022年 / 2卷
关键词
H.264; Saliency detection; Spatiotemporal information fusion; Video compression;
D O I
10.1016/j.cogr.2022.06.003
中图分类号
学科分类号
摘要
Extracting salient regions plays an important role in computer vision tasks, e.g., object detection, recognition and video compression. Previous saliency detection study is mostly conducted on individual frames and tends to extract saliency with spatial cues. The development of various motion feature further extends the saliency concept to the motion saliency from videos. In contrast to image-based saliency extraction, video-based saliency extraction is more challenging due to the complicated distractors, e.g., the background dynamics and shadows. In this paper, we propose a novel saliency extraction method by fusing temporal and spatial cues. In specific, the long-term and short-term variations are comprehensively fused to extract the temporal cue, which is then utilized to establish the background guidance for generating the spatial cue. Herein, the long-term variations and spatial cues jointly highlight the contrast between objects and the background, which can solve the problem caused by shadows. The short-term variations contribute to the removal of background dynamics. Spatiotemporal cues are fully exploited to constrain the saliency extraction across frames. The saliency extraction performance of our method is demonstrated by comparing it to both unsupervised and supervised methods. Moreover, this novel saliency extraction model is applied in the video compression tasks, helping to accelerate the video compression task and achieve a larger PSNR value for the region of interest (ROI). © 2022 The Authors
引用
收藏
页码:177 / 185
页数:8
相关论文
共 50 条
  • [41] Video copy detection based on spatiotemporal fusion model
    Li, Jianmin
    Liang, Yingyu
    Zhang, Bo
    Tsinghua Science and Technology, 2012, 17 (01) : 51 - 59
  • [42] Video saliency detection via bagging-based prediction and spatiotemporal propagation
    Zhou, Xiaofei
    Liu, Zhi
    Li, Kai
    Sun, Guangling
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 51 : 131 - 143
  • [43] Spatiotemporal Saliency Detection Based on Maximum Consistency Superpixels Merging for Video Analysis
    Zhang, Jianhua
    Chen, Jingbo
    Wang, Qichao
    Chen, Shengyong
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (01) : 606 - 614
  • [44] Stereoscopic video saliency detection based on spatiotemporal correlation and depth confidence optimization
    Zhang, Ping
    Liu, Jingwen
    Wang, Xiaoyang
    Pu, Tian
    Fei, Chun
    Guo, Zhengkui
    NEUROCOMPUTING, 2020, 377 : 256 - 268
  • [45] Novelty-based Spatiotemporal Saliency Detection for Prediction of Gaze in Egocentric Video
    Polatsek, Patrik
    Benesova, Wanda
    Paletta, Lucas
    Perko, Roland
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (03) : 394 - 398
  • [46] Parallax information fusion-based for dance moving image posture extraction
    Lyu, Yin
    Teng, Lin
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (36):
  • [47] Extraction of Refined Deep Feature and Its Application in Saliency Detection
    Fang Z.
    Cao T.
    Zheng Y.
    Yang J.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (02): : 324 - 331
  • [48] SALI360: Design and Implementation of Saliency based Video Compression for 360° Video Streaming
    Baek, Duin
    Kang, Hangil
    Ryoo, Jihoon
    MMSYS'20: PROCEEDINGS OF THE 2020 MULTIMEDIA SYSTEMS CONFERENCE, 2020, : 141 - 152
  • [49] Fusion-Based Approach for Respiratory Rate Recognition From Facial Video Images
    Fiedler, Marc-Andre
    Rapczynski, Micha
    Al-Hamadi, Ayoub
    IEEE ACCESS, 2020, 8 (08): : 130036 - 130047
  • [50] Multiple feature fusion-based video face tracking for IoT big data
    Liu, Zhifeng
    Ou, Jiayu
    Huo, Wenxiao
    Yan, Yejin
    Li, Tianping
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 10650 - 10669