Video Saliency Detection Using Deep Convolutional Neural Networks

被引:2
|
作者
Zhou, Xiaofei [1 ,2 ,3 ]
Liu, Zhi [2 ,3 ]
Gong, Chen [4 ]
Li, Gongyang [2 ,3 ]
Huang, Mengke [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Inst Informat & Control, Hangzhou, Peoples R China
[2] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[3] Shanghai Univ, Sch Commun & Informat Engn, Shanghai, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Minist Educ, Key Lab Intelligent Percept & Syst High Dimens In, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Video saliency; Convolutional neural networks; Feature aggregation; VISUAL-ATTENTION; SEGMENTATION; IMAGE; MODEL;
D O I
10.1007/978-3-030-03335-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous deep learning based efforts have been done for image saliency detection, and thus, it is a natural idea that we can construct video saliency model on basis of these image saliency models in an effective way. Besides, as for the limited number of training videos, existing video saliency model is trained with large-scale synthetic video data. In this paper, we construct video saliency model based on existing image saliency model and perform training on the limited video data. Concretely, our video saliency model consists of three steps including feature extraction, feature aggregation and spatial refinement. Firstly, the concatenation of current frame and its optical flow image is fed into the feature extraction network, yielding feature maps. Then, a tensor, which consists of the generated feature maps and the original information including the current frame and the optical flow image, is passed to the aggregation network, in which the original information can provide complementary information for aggregation. Finally, in order to obtain a high-quality saliency map with well-defined boundaries, the output of aggregation network and the current frame are used to perform spatial refinement, yielding the final saliency map for the current frame. The extensive qualitative and quantitative experiments on two challenging video datasets show that the proposed model consistently outperforms the state-of-the-art saliency models for detecting salient objects in videos.
引用
收藏
页码:308 / 319
页数:12
相关论文
共 50 条
  • [31] Multiorgan structures detection using deep convolutional neural networks
    Onieva, Jorge Onieva
    Serrano, German Gonzalez
    Young, Thomas P.
    Washko, George R.
    Ledesma Carbayo, Maria Jesus
    Estepar, Raul San Jose
    MEDICAL IMAGING 2018: IMAGE PROCESSING, 2018, 10574
  • [32] Deep3DSaliency: Deep Stereoscopic Video Saliency Detection Model by 3D Convolutional Networks
    Fang, Yuming
    Ding, Guanqun
    Li, Jia
    Fang, Zhijun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) : 2305 - 2318
  • [33] Context-aware saliency detection for image retargeting using convolutional neural networks
    Ahmadi, Mahdi
    Karimi, Nader
    Samavi, Shadrokh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 11917 - 11941
  • [34] Context-aware saliency detection for image retargeting using convolutional neural networks
    Mahdi Ahmadi
    Nader Karimi
    Shadrokh Samavi
    Multimedia Tools and Applications, 2021, 80 : 11917 - 11941
  • [35] Deep green function convolution for improving saliency in convolutional neural networks
    Dominique Beaini
    Sofiane Achiche
    Alexandre Duperré
    Maxime Raison
    The Visual Computer, 2021, 37 : 227 - 244
  • [36] Deep green function convolution for improving saliency in convolutional neural networks
    Beaini, Dominique
    Achiche, Sofiane
    Duperre, Alexandre
    Raison, Maxime
    VISUAL COMPUTER, 2021, 37 (02): : 227 - 244
  • [37] Smoke Detection on Video Sequences Using Convolutional and Recurrent Neural Networks
    Filonenko, Alexander
    Kurnianggoro, Laksono
    Jo, Kang-Hyun
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2017, PT II, 2017, 10449 : 558 - 566
  • [38] Optic Disc Detection Based on Saliency Detection and Attention Convolutional Neural Networks
    Wang, Ying
    Yu, Xiaosheng
    Wu, Chengdong
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2021, E104A (09) : 1370 - 1374
  • [39] Two-Stream Recurrent Convolutional Neural Networks for Video Saliency Estimation
    Wei, Xiao
    Song, Li
    Xie, Rong
    Zhang, Wenjun
    2017 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2017, : 419 - 423
  • [40] An Optimized Framework of Video Compression Using Deep Convolutional Neural Networks (DCNN)
    Sreelatha, M.
    Tulasi, R. Lakshmi
    Kumar, K. Siva
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 515 - 522