Attention-based video object segmentation algorithm

被引:0
|
作者
Cao, Ying [1 ]
Sun, Lijuan [2 ,3 ]
Han, Chong [2 ,3 ]
Guo, Jian [2 ,3 ]
机构
[1] Henan Univ, Kaifeng, Henan, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Nanjing, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Jiangsu High Technol Res Key Lab Wireless Sensor, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1049/ipr2.12135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve the segmentation performance on videos with large object motion or deformation, a novel scheme is proposed which has two branches. In one branch, the attention mechanism is first utilized to highlight objects-related features. Then, to well consider the temporal coherence of videos, Conv3D is integrated to capture short-term temporal features, and the designed attention residual convolutional long-short-term memory is adopted to capture the long-short-term temporal information of objects under the interference of redundant video frames. Meanwhile, considering the negative effect of background motion, in another branch, the optical flow-based prediction model is introduced to predict objects regions in subsequent video frames with the annotated initial frame. At last, based on the fused results of two branches, the global thresholds and noising area clean method are employed to obtain segmented objects. The experiments on DAVIS2016 and CDnet2014 exhibit the competitive performance of the proposed scheme.
引用
收藏
页码:1668 / 1678
页数:11
相关论文
共 50 条
  • [31] A video object segmentation algorithm based on temporal edge memory compensation
    Zhu, Shi-Ping
    Ma, Li
    Hou, Yang-Shuan
    Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2010, 21 (08): : 1241 - 1246
  • [32] A STEREO VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON MOTION DETECTION AND DISPARITY
    Wang, Lingyun
    Li, Zhaohui
    Li, Dongmei
    PROCEEDINGS OF THE 3RD IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2012), 2012, : 356 - 359
  • [33] Attention-Based Multimodal Fusion for Video Description
    Hori, Chiori
    Hori, Takaaki
    Lee, Teng-Yok
    Zhang, Ziming
    Harsham, Bret
    Hershey, John R.
    Marks, Tim K.
    Sumi, Kazuhiko
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4203 - 4212
  • [34] A VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON THE FEATURE LEARNING AND SHAPE TRACKING
    Lee, Sang Hak
    Koo, Hyung Il
    Cho, Nam Ik
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 4673 - 4676
  • [35] Automatic video object segmentation based on clone algorithm and fuzzy mathematics
    Zhang Guangyu
    Gong Guangzhen
    Zhu Weile
    CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (03): : 482 - 486
  • [36] Attention-based Video Virtual Try-On
    Tsai, Wen-Jiin
    Tien, Yi-Cheng
    PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 209 - 216
  • [37] Attention-based interpolation network for video deblurring
    Zhang, Xiaoqin
    Jiang, Runhua
    Wang, Tao
    Huang, Pengcheng
    Zhao, Li
    NEUROCOMPUTING, 2021, 453 : 865 - 875
  • [38] Describing Video With Attention-Based Bidirectional LSTM
    Bin, Yi
    Yang, Yang
    Shen, Fumin
    Xie, Ning
    Shen, Heng Tao
    Li, Xuelong
    IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (07) : 2631 - 2641
  • [39] The effect of using video title in attention-based video summarization
    Li, Changwei
    Yeh, Zhi-Ting
    Gunuganti, Jeshmitha
    Chang, Jia-Bin
    Norouzi, Mehdi
    2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [40] Residual attention-based LSTM for video captioning
    Xiangpeng Li
    Zhilong Zhou
    Lijiang Chen
    Lianli Gao
    World Wide Web, 2019, 22 : 621 - 636