Video Saliency Detection Using Deep Convolutional Neural Networks

被引:2
|
作者
Zhou, Xiaofei [1 ,2 ,3 ]
Liu, Zhi [2 ,3 ]
Gong, Chen [4 ]
Li, Gongyang [2 ,3 ]
Huang, Mengke [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Inst Informat & Control, Hangzhou, Peoples R China
[2] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[3] Shanghai Univ, Sch Commun & Informat Engn, Shanghai, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Minist Educ, Key Lab Intelligent Percept & Syst High Dimens In, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Video saliency; Convolutional neural networks; Feature aggregation; VISUAL-ATTENTION; SEGMENTATION; IMAGE; MODEL;
D O I
10.1007/978-3-030-03335-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous deep learning based efforts have been done for image saliency detection, and thus, it is a natural idea that we can construct video saliency model on basis of these image saliency models in an effective way. Besides, as for the limited number of training videos, existing video saliency model is trained with large-scale synthetic video data. In this paper, we construct video saliency model based on existing image saliency model and perform training on the limited video data. Concretely, our video saliency model consists of three steps including feature extraction, feature aggregation and spatial refinement. Firstly, the concatenation of current frame and its optical flow image is fed into the feature extraction network, yielding feature maps. Then, a tensor, which consists of the generated feature maps and the original information including the current frame and the optical flow image, is passed to the aggregation network, in which the original information can provide complementary information for aggregation. Finally, in order to obtain a high-quality saliency map with well-defined boundaries, the output of aggregation network and the current frame are used to perform spatial refinement, yielding the final saliency map for the current frame. The extensive qualitative and quantitative experiments on two challenging video datasets show that the proposed model consistently outperforms the state-of-the-art saliency models for detecting salient objects in videos.
引用
收藏
页码:308 / 319
页数:12
相关论文
共 50 条
  • [41] Evaluation of bottom-up saliency model using deep features pretrained by deep convolutional neural networks
    Mahdi, Ali
    Qin, Jun
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (03)
  • [42] Deep Convolutional Neural Networks for DGA Detection
    Catania, Carlos
    Garcia, Sebastian
    Torres, Pablo
    COMPUTER SCIENCE - CACIC 2018, 2019, 995 : 327 - 340
  • [43] Deep Convolutional Neural Networks for pedestrian detection
    Tome, D.
    Monti, F.
    Baroffio, L.
    Bondi, L.
    Tagliasacchi, M.
    Tubaro, S.
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 : 482 - 489
  • [44] Stenosis Detection with Deep Convolutional Neural Networks
    Antczak, Karol
    Liberadzki, Lukasz
    22ND INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATIONS AND COMPUTERS (CSCC 2018), 2018, 210
  • [45] Improving pedestrian detection using convolutional neural network and saliency detection
    Errami, Mounir
    Rziza, Mohammed
    Haboub, Abdelmoula
    FOURTEENTH INTERNATIONAL CONFERENCE ON QUALITY CONTROL BY ARTIFICIAL VISION, 2019, 11172
  • [46] Vessel Detection in Ultrasound Images Using Deep Convolutional Neural Networks
    Smistad, Erik
    Lovstakken, Lasse
    DEEP LEARNING AND DATA LABELING FOR MEDICAL APPLICATIONS, 2016, 10008 : 30 - 38
  • [47] A Deep Learning Approach for Early Detection of Facial Palsy in Video Using Convolutional Neural Networks: A Computational Study
    Arora, Anuja
    Zaeem, Jasir Mohammad
    Garg, Vibhor
    Jayal, Ambikesh
    Akhtar, Zahid
    COMPUTERS, 2024, 13 (08)
  • [48] Robot grasp detection using multimodal deep convolutional neural networks
    Wang, Zhichao
    Li, Zhiqi
    Wang, Bin
    Liu, Hong
    ADVANCES IN MECHANICAL ENGINEERING, 2016, 8 (09) : 1 - 12
  • [49] Pulmonary Tuberculosis Detection Using Deep Learning Convolutional Neural Networks
    Norval, Michael
    Wang, Zenghui
    Sun, Yanxia
    ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 47 - 51
  • [50] Automated Road Crack Detection Using Deep Convolutional Neural Networks
    Mandal, Vishal
    Uong, Lan
    Adu-Gyamfi, Yaw
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5212 - 5215