Video object segmentation based on dynamic perception update and feature fusion

被引:0
|
作者
Hou, Zhiqiang [1 ,2 ]
Li, Fucheng [1 ,2 ]
Dong, Jiale [1 ,2 ]
Dai, Nan [1 ,2 ]
Ma, Sugang [1 ,2 ]
Fan, Jiulun [3 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian 710121, Peoples R China
[2] Key Lab Network Data Anal & Intelligent Proc Shaan, Xian 710121, Peoples R China
[3] Xian Univ Posts & Telecommun, Sch Commun & Informat Engn, Xian 710121, Peoples R China
基金
中国国家自然科学基金;
关键词
Memory network; History frames; Dynamic perception update; Feature fusion; Mask refinement;
D O I
10.1016/j.imavis.2024.105156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current popular video object segmentation algorithms based on memory network indiscriminately update the frame information to the memory pool, fails to make reasonable use of the historical frame information, causing frame information redundancy in the memory pool, resulting in the increase of the computation amount. At the same time, the mask refinement method is relatively rough, resulting in blurred edges of the generated mask. To solve these problems, This paper proposes a video object segmentation algorithm based on dynamic perception update and feature fusion. In order to reasonably utilize the historical frame information, a dynamic perception update module is proposed to selectively update the segmentation frame mask. Meanwhile, a mask refinement module is established to enhance the detail information of the shallow features of the backbone network. This module uses a double kernels fusion block to fuse the different scale information of the features, and finally uses the Laplacian operator to sharpen the edges of the mask. The experimental results show that on the public datasets DAVIS2016, DAVIS2017 and YouTube-VOS18, the comprehensive performance of the algorithm in this paper reaches 86.9%, 79.3% and 71.6%, respectively, and the segmentation speed reaches 15FPS on the DAVIS2016 dataset. Compared with many mainstream algorithms in recent years, it has obvious advantages in performance.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Video segmentation using multiscale feature extraction and fusion
    Tsai, T.-H. (han@dsp.ee.ncu.edu.tw), Circuits and Systems Society, IEEE CASS; Science Council of Japan; The Inst. of Electronics, Inf. and Communication Engineers, IEICE; The Institute of Electrical and Electronics Engineers, Inc., IEEE (Institute of Electrical and Electronics Engineers Inc.):
  • [32] FFAVOD: Feature fusion architecture for video object detection
    Perreault, Hughes
    Bilodeau, Guillaume-Alexandre
    Saunier, Nicolas
    Heritier, Maguelonne
    PATTERN RECOGNITION LETTERS, 2021, 151 : 294 - 301
  • [33] Video segmentation using multiscale feature extraction and fusion
    Tsai, TH
    Lin, CY
    2005 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), VOLS 1-6, CONFERENCE PROCEEDINGS, 2005, : 3223 - 3226
  • [34] Video Object Segmentation Based on Disparity
    Xingming, Ouyang
    Wei, Wei
    ADVANCES IN WEB AND NETWORK TECHNOLOGIES, AND INFORMATION MANAGEMENT, 2009, 5731 : 36 - 44
  • [35] Object tracking from airborne video using particle filters algorithm on dynamic feature fusion
    Jiao, Yang
    Zhao, Song
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2016, 19 (03): : 787 - 799
  • [36] Video object segmentation based on multi-level target models and feature integration
    Gao, Bocong
    Zhao, Yuqian
    Zhang, Fan
    Luo, Biao
    Yang, Chunhua
    Neurocomputing, 2022, 492 : 396 - 407
  • [37] Video object segmentation based on multi-level target models and feature integration
    Gao, Bocong
    Zhao, Yuqian
    Zhang, Fan
    Luo, Biao
    Yang, Chunhua
    NEUROCOMPUTING, 2022, 492 : 396 - 407
  • [38] Multiple feature clustering algorithm for automatic video object segmentation
    Wei, W
    Ngan, KN
    Habili, N
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS, 2004, : 625 - 628
  • [39] Video object segmentation via couple streams and feature memory
    Liang, Yun
    Xiao, Xinjie
    Qiu, Shaojian
    Zhang, Yuqing
    Su, Zhuo
    IET IMAGE PROCESSING, 2024, 18 (09) : 2257 - 2272
  • [40] Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation
    Pei, Gensheng
    Shen, Fumin
    Yao, Yazhou
    Xie, Guo-Sen
    Tang, Zhenmin
    Tang, Jinhui
    COMPUTER VISION, ECCV 2022, PT XXXIV, 2022, 13694 : 596 - 613