Video Saliency Detection Using Deep Convolutional Neural Networks

被引:2
|
作者
Zhou, Xiaofei [1 ,2 ,3 ]
Liu, Zhi [2 ,3 ]
Gong, Chen [4 ]
Li, Gongyang [2 ,3 ]
Huang, Mengke [2 ,3 ]
机构
[1] Hangzhou Dianzi Univ, Inst Informat & Control, Hangzhou, Peoples R China
[2] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Shanghai, Peoples R China
[3] Shanghai Univ, Sch Commun & Informat Engn, Shanghai, Peoples R China
[4] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Minist Educ, Key Lab Intelligent Percept & Syst High Dimens In, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Video saliency; Convolutional neural networks; Feature aggregation; VISUAL-ATTENTION; SEGMENTATION; IMAGE; MODEL;
D O I
10.1007/978-3-030-03335-4_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Numerous deep learning based efforts have been done for image saliency detection, and thus, it is a natural idea that we can construct video saliency model on basis of these image saliency models in an effective way. Besides, as for the limited number of training videos, existing video saliency model is trained with large-scale synthetic video data. In this paper, we construct video saliency model based on existing image saliency model and perform training on the limited video data. Concretely, our video saliency model consists of three steps including feature extraction, feature aggregation and spatial refinement. Firstly, the concatenation of current frame and its optical flow image is fed into the feature extraction network, yielding feature maps. Then, a tensor, which consists of the generated feature maps and the original information including the current frame and the optical flow image, is passed to the aggregation network, in which the original information can provide complementary information for aggregation. Finally, in order to obtain a high-quality saliency map with well-defined boundaries, the output of aggregation network and the current frame are used to perform spatial refinement, yielding the final saliency map for the current frame. The extensive qualitative and quantitative experiments on two challenging video datasets show that the proposed model consistently outperforms the state-of-the-art saliency models for detecting salient objects in videos.
引用
收藏
页码:308 / 319
页数:12
相关论文
共 50 条
  • [1] MESH SALIENCY DETECTION USING CONVOLUTIONAL NEURAL NETWORKS
    Nousias, Stavros
    Arvanitis, Gerasimos
    Lalos, Aris S.
    Moustakas, Konstantinos
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [2] Video-Based Fire Detection with Saliency Detection and Convolutional Neural Networks
    Shi, Lifeng
    Long, Fei
    Lin, ChenHan
    Zhao, Yihan
    ADVANCES IN NEURAL NETWORKS, PT II, 2017, 10262 : 299 - 309
  • [3] Landmark Detection with Surprise Saliency Using Convolutional Neural Networks
    Tang, Feng
    Lyons, Damian M.
    Leeds, Daniel D.
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2016, : 204 - 211
  • [4] Visibility Loss Detection for Video Camera Using Deep Convolutional Neural Networks
    Ivanov, Alexey
    Yudin, Dmitry
    PROCEEDINGS OF THE THIRD INTERNATIONAL SCIENTIFIC CONFERENCE INTELLIGENT INFORMATION TECHNOLOGIES FOR INDUSTRY (IITI'18), VOL 1, 2019, 874 : 434 - 443
  • [5] Artifact Detection in Endoscopic Video with Deep Convolutional Neural Networks
    Zhang, Chenxi
    Zhang, Ning
    Wang, Dechun
    Cao, Yu
    Liu, Benyuan
    2020 SECOND INTERNATIONAL CONFERENCE ON TRANSDISCIPLINARY AI (TRANSAI 2020), 2020, : 1 - 8
  • [6] Efficient saliency detection using convolutional neural networks with feature selection
    Cao, Feilong
    Liu, Yuehua
    Wang, Dianhui
    INFORMATION SCIENCES, 2018, 456 : 34 - 49
  • [7] Depth-aware saliency detection using convolutional neural networks
    Ding, Yu
    Liu, Zhi
    Huang, Mengke
    Shi, Ran
    Wang, Xiangyang
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 61 : 1 - 9
  • [8] Deep-fake video detection approaches using convolutional - recurrent neural networks
    Suratkar, Shraddha
    Bhiungade, Sayali
    Pitale, Jui
    Soni, Komal
    Badgujar, Tushar
    Kazi, Faruk
    JOURNAL OF CONTROL AND DECISION, 2023, 10 (02) : 198 - 214
  • [9] Light Field Saliency Detection With Deep Convolutional Networks
    Zhang, Jun
    Liu, Yamei
    Zhang, Shengping
    Poppe, Ronald
    Wang, Meng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 4421 - 4434
  • [10] Object Detection Using Deep Convolutional Neural Networks
    Qian, Huimin
    Xu, Jiawei
    Zhou, Jun
    2018 CHINESE AUTOMATION CONGRESS (CAC), 2018, : 1151 - 1156