Attention-based video object segmentation algorithm

被引:0
|
作者
Cao, Ying [1 ]
Sun, Lijuan [2 ,3 ]
Han, Chong [2 ,3 ]
Guo, Jian [2 ,3 ]
机构
[1] Henan Univ, Kaifeng, Henan, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1049/ipr2.12135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve the segmentation performance on videos with large object motion or deformation, a novel scheme is proposed which has two branches. In one branch, the attention mechanism is first utilized to highlight objects-related features. Then, to well consider the temporal coherence of videos, Conv3D is integrated to capture short-term temporal features, and the designed attention residual convolutional long-short-term memory is adopted to capture the long-short-term temporal information of objects under the interference of redundant video frames. Meanwhile, considering the negative effect of background motion, in another branch, the optical flow-based prediction model is introduced to predict objects regions in subsequent video frames with the annotated initial frame. At last, based on the fused results of two branches, the global thresholds and noising area clean method are employed to obtain segmented objects. The experiments on DAVIS2016 and CDnet2014 exhibit the competitive performance of the proposed scheme.
引用
收藏
页码:1668 / 1678
页数:11
相关论文
共 50 条
  • [41] Attention-based video summarisation in rushes collection
    Ren, Reede
    Swamy, Punitha Puttu
    Jose, Joemon M.
    Urban, Jana
    Proceedings of the ACM International Multimedia Conference and Exhibition, 2007, : 89 - 93
  • [42] Residual attention-based LSTM for video captioning
    Li, Xiangpeng
    Zhou, Zhilong
    Chen, Lijiang
    Gao, Lianli
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2019, 22 (02): : 621 - 636
  • [43] Video Object Segmentation Based on Disparity
    Xingming, Ouyang
    Wei, Wei
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 36 - 44
  • [44] Attention-based fusion factor in FPN for object detection
    Li, Yuancheng
    Zhou, Shenglong
    Chen, Hui
    APPLIED INTELLIGENCE, 2022, 52 (13) : 15547 - 15556
  • [45] Attention-Based Transformers for Instance Segmentation of Cells in Microstructures
    Prangemeier, Tim
    Reich, Christoph
    Koeppl, Heinz
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 700 - 707
  • [46] Saliency-based dual-attention network for unsupervised video object segmentation
    Zhang, Guifang
    Wong, Hon-Cheng
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (04): : 4996 - 5010
  • [47] Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps
    Heo, Yuk
    Koh, Yeong Jun
    Kim, Chang-Su
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7318 - 7326
  • [48] Attention-based Weighted Fusion Network for Object Detection
    Yu, Ruixing
    Wang, Chuyin
    Tang, Yifei
    JOURNAL OF IMAGING SCIENCE AND TECHNOLOGY, 2024, 68 (06) : 1 - 18
  • [49] Where and What: Driver Attention-based Object Detection
    Rong Y.
    Kassautzki N.-R.
    Fuhl W.
    Kasneci E.
    Proceedings of the ACM on Human-Computer Interaction, 2022, 6 (ETRA)
  • [50] Saliency-based dual-attention network for unsupervised video object segmentation
    Guifang Zhang
    Hon-Cheng Wong
    The Journal of Supercomputing, 2024, 80 (4) : 4996 - 5010