Two-Stream Recurrent Convolutional Neural Networks for Video Saliency Estimation

被引:0
|
作者
Wei, Xiao [1 ,2 ]
Song, Li [1 ,2 ]
Xie, Rong [1 ,2 ]
Zhang, Wenjun [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Shanghai, Peoples R China
[2] Cooperat Medianet Innovat Ctr, Shanghai, Peoples R China
关键词
Saliency estimation; Video processing; Optical flow; CNN; Recurrent connections;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, research has emphasized the need for video saliency estimation since its application covers a large domain. Traditional saliency prediction methods for video based on hand-crafted visual features lead to slow speed and ineffective results. In this paper, we propose a real-time end-to-end saliency estimation model combining two-stream convolutional neural networks from global-view to local-view. In global view, the temporal stream CNN extracts the inter-frame features from optical flow map, and spatial stream CNN extracts the intraframe information. In local view, we adopt the recurrent connnections to refine the local details through correcting the saliency map step by step. We test our model TSRCNN on three datasets in video saliency estimation, and it shows not only exceedingly commendable performance but almostly real-time GPU processing time of 0.088s compared to other state-of-art methods.
引用
收藏
页码:419 / 423
页数:5
相关论文
共 50 条
  • [21] Two-Stream Convolutional Networks for Hyperspectral Target Detection
    Zhu, Dehui
    Du, Bo
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08): : 6907 - 6921
  • [22] Two-Stream Multirate Recurrent Neural Network for Video-Based Pedestrian Reidentification
    Zeng, Zhiqiang
    Li, Zhihui
    Cheng, De
    Zhang, Huaxiang
    Zhan, Kun
    Yang, Yi
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (07) : 3179 - 3186
  • [23] Convolutional Two-Stream Network Fusion for Video Action Recognition
    Feichtenhofer, Christoph
    Pinz, Axel
    Zisserman, Andrew
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1933 - 1941
  • [24] Two-Stream Convolutional Networks for Action Recognition in Videos
    Simonyan, Karen
    Zisserman, Andrew
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [25] Two-stream convolutional networks for skin cancer classification
    Mohammed Aloraini
    Multimedia Tools and Applications, 2024, 83 : 30741 - 30753
  • [26] Two-stream Graph Attention Convolutional for Video Action Recognition
    Zhang, Deyuan
    Gao, Hongwei
    Dai, Hailong
    Shi, Xiangbin
    2021 IEEE 15TH INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (BIGDATASE 2021), 2021, : 23 - 27
  • [27] Pornographic Video Detection with Convolutional Two-Stream Network Fusion
    Lee, Wonjae
    Kim, Junghak
    Lee, Nam Kyung
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1273 - 1275
  • [28] Fusing two-stream convolutional neural networks for RGB-T object tracking
    Li, Chenglong
    Wu, Xiaohao
    Zhao, Nan
    Cao, Xiaochun
    Tang, Jin
    NEUROCOMPUTING, 2018, 281 : 78 - 85
  • [29] Crop disease diagnosis and prediction using two-stream hybrid convolutional neural networks
    Hong, Pengxiang
    Luo, Xi
    Bao, Lingxin
    CROP PROTECTION, 2024, 184
  • [30] SCANet: Sensor-based Continuous Authentication with Two-stream Convolutional Neural Networks
    Li, Yantao
    Hu, Hailong
    Zhu, Zhangqian
    Zhou, Gang
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2020, 16 (03)