Attention-based video object segmentation algorithm

被引:0
|
作者
Cao, Ying [1 ]
Sun, Lijuan [2 ,3 ]
Han, Chong [2 ,3 ]
Guo, Jian [2 ,3 ]
机构
[1] Henan Univ, Kaifeng, Henan, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1049/ipr2.12135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve the segmentation performance on videos with large object motion or deformation, a novel scheme is proposed which has two branches. In one branch, the attention mechanism is first utilized to highlight objects-related features. Then, to well consider the temporal coherence of videos, Conv3D is integrated to capture short-term temporal features, and the designed attention residual convolutional long-short-term memory is adopted to capture the long-short-term temporal information of objects under the interference of redundant video frames. Meanwhile, considering the negative effect of background motion, in another branch, the optical flow-based prediction model is introduced to predict objects regions in subsequent video frames with the annotated initial frame. At last, based on the fused results of two branches, the global thresholds and noising area clean method are employed to obtain segmented objects. The experiments on DAVIS2016 and CDnet2014 exhibit the competitive performance of the proposed scheme.
引用
收藏
页码:1668 / 1678
页数:11
相关论文
共 50 条
  • [21] Improved semantic video object segmentation algorithm
    Ren, He
    Hua, Chazhen
    Jisuanji Gongcheng/Computer Engineering, 2002, 28 (08):
  • [22] An Attention-based Activity Recognition for Egocentric Video
    Matsuo, Kenji
    Yamada, Kentaro
    Ueno, Satoshi
    Naito, Sei
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 565 - +
  • [23] Attention-Based Convolutional LSTM for Describing Video
    Liu, Zhongyu
    Chen, Tian
    Ding, Enjie
    Liu, Yafeng
    Yu, Wanli
    IEEE Access, 2020, 8 : 133713 - 133724
  • [24] Multimodal attention-based transformer for video captioning
    Hemalatha Munusamy
    Chandra Sekhar C
    Applied Intelligence, 2023, 53 : 23349 - 23368
  • [25] Attention-Based Convolutional LSTM for Describing Video
    Liu, Zhongyu
    Chen, Tian
    Ding, Enjie
    Liu, Yafeng
    Yu, Wanli
    IEEE ACCESS, 2020, 8 : 133713 - 133724
  • [26] A novel automatic video object segmentation algorithm
    Zhang Guangyu
    Gong Guangzhen
    Zhu Weile
    CHINESE JOURNAL OF ELECTRONICS, 2007, 16 (01): : 115 - 118
  • [27] A fast algorithm for video segmentation and object tracking
    Giusto, DD
    Massidda, F
    Perra, C
    DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 697 - 700
  • [28] Video object segmentation using the EM algorithm
    Doulamis, ND
    Doulamis, AD
    Kollias, SD
    ADVANCES IN INTELLIGENT SYSTEMS: CONCEPTS, TOOLS AND APPLICATIONS, 1999, 21 : 321 - 332
  • [29] Residual Attention-based Fusion for Video Classification
    Pouyanfar, Samira
    Wang, Tianyi
    Chen, Shu-Ching
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 478 - 480
  • [30] Multimodal attention-based transformer for video captioning
    Munusamy, Hemalatha
    Sekhar, C. Chandra
    APPLIED INTELLIGENCE, 2023, 53 (20) : 23349 - 23368