Two-Stream Recurrent Convolutional Neural Networks for Video Saliency Estimation

被引:0
|
作者
Wei, Xiao [1 ,2 ]
Song, Li [1 ,2 ]
Xie, Rong [1 ,2 ]
Zhang, Wenjun [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai, Peoples R China
[2] Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China
关键词
Saliency estimation; Video processing; Optical flow; CNN; Recurrent connections;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, research has emphasized the need for video saliency estimation since its application covers a large domain. Traditional saliency prediction methods for video based on hand-crafted visual features lead to slow speed and ineffective results. In this paper, we propose a real-time end-to-end saliency estimation model combining two-stream convolutional neural networks from global-view to local-view. In global view, the temporal stream CNN extracts the inter-frame features from optical flow map, and spatial stream CNN extracts the intraframe information. In local view, we adopt the recurrent connnections to refine the local details through correcting the saliency map step by step. We test our model TSRCNN on three datasets in video saliency estimation, and it shows not only exceedingly commendable performance but almostly real-time GPU processing time of 0.088s compared to other state-of-art methods.
引用
收藏
页码:419 / 423
页数:5
相关论文
共 50 条
  • [31] Video Saliency Prediction Based on Spatial-Temporal Two-Stream Network
    Zhang, Kao
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (12) : 3544 - 3557
  • [32] Video Recognition of American Sign Language Using Two-Stream Convolution Neural Networks
    Nugraha, Fikri
    Djamal, Esmeralda C.
    PROCEEDING OF 2019 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI), 2019, : 400 - 405
  • [33] Two-Stream Convolutional Networks for Blind Image Quality Assessment
    Yan, Qingsen
    Gong, Dong
    Zhang, Yanning
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (05) : 2200 - 2211
  • [34] Temporal and Spectral Feature Learning With Two-Stream Convolutional Neural Networks for Appliance Recognition in NILM
    Chen, Junfeng
    Wang, Xue
    Zhang, Xiaotian
    Zhang, Weihang
    IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (01) : 762 - 772
  • [35] Rat Grooming Behavior Detection with Two-stream Convolutional Networks
    Lee, Chien-Cheng
    Wei-Wei, Gao
    Lui, Ping -Wing
    2019 NINTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2019,
  • [36] A Two-Stream Network Based on Capsule Networks and Sliced Recurrent Neural Networks for DGA Botnet Detection
    Xinjun Pei
    Shengwei Tian
    Long Yu
    Huanhuan Wang
    Yongfang Peng
    Journal of Network and Systems Management, 2020, 28 : 1694 - 1721
  • [37] Spatiotemporal Remote Sensing Image Fusion Using Multiscale Two-Stream Convolutional Neural Networks
    Chen, Yuehong
    Shi, Kaixin
    Ge, Yong
    Zhou, Ya'Nan
    IEEE Transactions on Geoscience and Remote Sensing, 2022, 60
  • [38] A Two-Stream Network Based on Capsule Networks and Sliced Recurrent Neural Networks for DGA Botnet Detection
    Pei, Xinjun
    Tian, Shengwei
    Yu, Long
    Wang, Huanhuan
    Peng, Yongfang
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2020, 28 (04) : 1694 - 1721
  • [39] Video Saliency Detection Using Deep Convolutional Neural Networks
    Zhou, Xiaofei
    Liu, Zhi
    Gong, Chen
    Li, Gongyang
    Huang, Mengke
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 308 - 319
  • [40] Spatiotemporal Remote Sensing Image Fusion Using Multiscale Two-Stream Convolutional Neural Networks
    Chen, Yuehong
    Shi, Kaixin
    Ge, Yong
    Zhou, Ya'nan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60