Spatiotemporal cue fusion-based saliency extraction and its application in video compression

被引:0
|
作者
Li K. [1 ]
Luo Z. [2 ]
Zhang T. [1 ]
Ruan Y. [2 ]
Zhou D. [1 ]
机构
[1] School of Mechanical and Electrical Engineering, Nanchang Institute of Technology, Nanchang
[2] School of Electronics and Information, Nanchang Institute of Technology, Nanchang
来源
Cognitive Robotics | 2022年 / 2卷
关键词
H.264; Saliency detection; Spatiotemporal information fusion; Video compression;
D O I
10.1016/j.cogr.2022.06.003
中图分类号
学科分类号
摘要
Extracting salient regions plays an important role in computer vision tasks, e.g., object detection, recognition and video compression. Previous saliency detection study is mostly conducted on individual frames and tends to extract saliency with spatial cues. The development of various motion feature further extends the saliency concept to the motion saliency from videos. In contrast to image-based saliency extraction, video-based saliency extraction is more challenging due to the complicated distractors, e.g., the background dynamics and shadows. In this paper, we propose a novel saliency extraction method by fusing temporal and spatial cues. In specific, the long-term and short-term variations are comprehensively fused to extract the temporal cue, which is then utilized to establish the background guidance for generating the spatial cue. Herein, the long-term variations and spatial cues jointly highlight the contrast between objects and the background, which can solve the problem caused by shadows. The short-term variations contribute to the removal of background dynamics. Spatiotemporal cues are fully exploited to constrain the saliency extraction across frames. The saliency extraction performance of our method is demonstrated by comparing it to both unsupervised and supervised methods. Moreover, this novel saliency extraction model is applied in the video compression tasks, helping to accelerate the video compression task and achieve a larger PSNR value for the region of interest (ROI). © 2022 The Authors
引用
收藏
页码:177 / 185
页数:8
相关论文
共 50 条
  • [31] Spatiotemporal Saliency Detection for Video Sequences Based on Random Walk With Restart
    Kim, Hansang
    Kim, Youngbae
    Sim, Jae-Young
    Kim, Chang-Su
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (08) : 2552 - 2564
  • [32] Fabric Defect Detection via Multi-scale Feature Fusion-Based Saliency
    Liu, Zhoufeng
    Huang, Ning
    Li, Chunlei
    Guo, Zijing
    Gao, Chengli
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 240 - 251
  • [33] Beyond Fusion: The Application of Fusion-Based Microwave Technology to Other Industries
    Anderson, James P.
    Doane, John L.
    Grunloh, Howard J.
    Brookman, Michael W.
    2019 44TH INTERNATIONAL CONFERENCE ON INFRARED, MILLIMETER, AND TERAHERTZ WAVES (IRMMW-THZ), 2019,
  • [34] Video compression by computer and its application
    Hu Yu
    TRENDS IN BUILDING MATERIALS RESEARCH, PTS 1 AND 2, 2012, 450-451 : 1293 - 1296
  • [35] The effects of fusion-based feature extraction for fabric defect classification
    Ciklacandir, Fatma Gunseli Yasar
    Utku, Semih
    Ozdemir, Hakan
    TEXTILE RESEARCH JOURNAL, 2023, 93 (23-24) : 5448 - 5460
  • [36] Video object segmentation with shape cue based on spatiotemporal superpixel neighbourhood
    Tian, Zhiqiang
    Zheng, Nanning
    Xue, Jianru
    Lan, Xuguang
    Li, Ce
    Zhou, Gang
    IET COMPUTER VISION, 2014, 8 (01) : 16 - 25
  • [37] Target Tracking Method in Aerial Video Based on Saliency Fusion
    Han, Jie
    Guo, Baolong
    Sun, Wei
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2015, 8 : 723 - 727
  • [38] Fusion-based Spatiotemporal Convolutions with Constant Temporal Snapshots for Sign Language Recognition
    Han, Yiming
    Fan, Xiaocong
    Bhosale, Radhika
    Sundaram, Ramakrishnan
    Liaw, Jonathan
    2021 16TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2021), 2021,
  • [39] Saliency Prediction using Scene Motion for JND based Video Compression
    Wang, Ruei-Jiun
    Chiu, Ching-Te
    2011 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2011, : 73 - 77
  • [40] Video Copy Detection Based on Spatiotemporal Fusion Model
    Jianmin Li
    Tsinghua Science and Technology, 2012, 17 (01) : 51 - 59