A temporal attention based appearance model for video object segmentation

被引:2
|
作者
Wang, Hui [1 ]
Liu, Weibin [1 ]
Xing, Weiwei [2 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Sch Software Engn, Beijing 100044, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Video object segmentation; Convolutional neural networks; Appearance model; Mixture loss;
D O I
10.1007/s10489-021-02547-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
More and more researchers have recently paid attention to video object segmentation because it is an important building block for numerous computer vision applications. Although many algorithms promote its development, there are still some open challenges. Efficient and robust pipelines are needed to address appearance changes and the distraction from similar background objects in the video object segmentation. This paper proposes a novel neural network that integrates a temporal attention based appearance model and a boundary-aware loss. The appearance model fuses the appearance information of the first frame, the previous frame, and the current frame in the feature space, which assists the proposed method to learn a discriminative and robust target representation and avoid the drift problem of traditional propagation schemes. Moreover, the boundary-aware loss is employed for network training. Equipped with the boundary-aware loss, the proposed method achieves more accurate segmentation results with clear boundaries. The proposed method is compared with several recent state-of-the-art algorithms on popular benchmark datasets. Comprehensive experiments show that the proposed method achieves favorable performance with a high frame rate.
引用
收藏
页码:2290 / 2300
页数:11
相关论文
共 50 条
  • [1] A temporal attention based appearance model for video object segmentation
    Hui Wang
    Weibin Liu
    Weiwei Xing
    Applied Intelligence, 2022, 52 : 2290 - 2300
  • [2] Appearance-consistent Video Object Segmentation Based on a Multinomial Event Model
    Chen, Yadang
    Hao, Chuanyan
    Liu, Alex X.
    Wu, Enhua
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2019, 15 (02)
  • [3] Attention-based video object segmentation algorithm
    Cao, Ying
    Sun, Lijuan
    Han, Chong
    Guo, Jian
    IET IMAGE PROCESSING, 2021, 15 (08) : 1668 - 1678
  • [4] Attention-Guided Memory Model for Video Object Segmentation
    Lin, Yunjian
    Tan, Yihua
    Communications in Computer and Information Science, 2022, 1566 CCIS : 67 - 85
  • [5] A Generative Appearance Model for End-to-end Video Object Segmentation
    Johnander, Joakim
    Danelljan, Martin
    Brissman, Emil
    Khan, Fahad Shahbaz
    Felsberg, Michael
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8945 - 8954
  • [6] BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation
    Yu, Ye
    Yuan, Jialin
    Mittal, Gaurav
    Li Fuxin
    Chen, Mei
    COMPUTER VISION, ECCV 2022, PT XXIX, 2022, 13689 : 612 - 629
  • [7] Visual Attention Guided Video Object Segmentation
    Liang, Hao
    Tan, Yihua
    PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 345 - 349
  • [8] Efficient Long-Short Temporal Attention network for unsupervised Video Object Segmentation
    Li, Ping
    Zhang, Yu
    Yuan, Li
    Xiao, Huaxin
    Lin, Binbin
    Xu, Xianghua
    PATTERN RECOGNITION, 2024, 146
  • [9] Dual Attention Based Network with Hierarchical ConvLSTM for Video Object Segmentation
    Zhao, Zongji
    Zhao, Sanyuan
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 323 - 335
  • [10] Video object segmentation based on Gaussian mixture model
    School of Electronics and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
    Hsi An Chiao Tung Ta Hsueh, 2006, 6 (724-728):