FAMINet: Learning Real-Time Semisupervised Video Object Segmentation With Steepest Optimized Optical Flow

被引:0
|
作者
Liu, Ziyang [1 ]
Liu, Jingmeng [1 ]
Chen, Weihai [2 ]
Wu, Xingming [1 ]
Li, Zhengguo [3 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China
[3] Inst Infocomm Res, SRO Dept, Singapore 138632, Singapore
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Optical imaging; Integrated optics; Motion segmentation; Feature extraction; Adaptive optics; Optical network units; Streaming media; Online memorizing; optical flow; real time; relaxed steepest descent; semisupervised video object segmentation (VOS);
D O I
10.1109/TIM.2021.3133003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semisupervised video object segmentation (VOS) aims to segment a few moving objects in a video sequence, where these objects are specified by annotation of the first frame. The optical flow has been considered in many existing semisupervised VOS methods to improve the segmentation accuracy. However, the optical flow-based semisupervised VOS methods cannot run in real time due to high complexity of optical flow estimation. A FAMINet, which consists of a feature extraction network (F), an appearance network (A), a motion network (M), and an integration network (I), is proposed in this study to address the above-mentioned problem. The appearance network outputs an initial segmentation result based on static appearances of objects. The motion network estimates the optical flow via very few parameters, which are optimized rapidly by an online memorizing algorithm named relaxed steepest descent. The integration network refines the initial segmentation result using the optical flow. Extensive experiments demonstrate that the FAMINet outperforms other state-of-the-art semisupervised VOS methods on the DAVIS and YouTube-VOS benchmarks and achieves a good trade-off between accuracy and efficiency. Our code is available at https://github.com/liuziyang123/FAMINet.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] SwiftNet: Real-time Video Object Segmentation
    Wang, Haochen
    Jiang, Xiaolong
    Ren, Haibing
    Hu, Yao
    Bai, Song
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 1296 - 1305
  • [2] Real-time video object segmentation for MPEG encoded video sequences
    Porikli, F
    REAL-TIME IMAGING VIII, 2004, 5297 : 195 - 203
  • [3] MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation
    Miles, Roy
    Yucel, Mehmet Kerim
    Manganelli, Bruno
    Saa-Garriga, Albert
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10480 - 10490
  • [4] DMVOS: Discriminative Matching for real-time Video Object Segmentation
    Wen, Peisong
    Yang, Ruolin
    Xu, Qianqian
    Qian, Chen
    Huang, Qingming
    Cong, Runming
    Si, Jianlou
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2048 - 2056
  • [5] Real-time video object segmentation using HSV space
    Li, N
    Bu, JJ
    Chen, C
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2002, : 85 - 88
  • [6] SegFlow: Joint Learning for Video Object Segmentation and Optical Flow
    Cheng, Jingchun
    Tsai, Yi-Hsuan
    Wang, Shengjin
    Yang, Ming-Hsuan
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 686 - 695
  • [7] Real-time and multi-video-object segmentation for compressed video sequences
    Fu Wenxiu
    Wang Bin
    Liu Ming
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 3, PROCEEDINGS, 2007, : 747 - +
  • [8] VLSI architecture study of a real-time scalable optical flow processor for video segmentation
    Minegishi, N
    Miyakoshi, J
    Kuroda, Y
    Katagiri, T
    Fukuyama, Y
    Yamamoto, R
    Miyama, M
    Imamura, K
    Hashimoto, H
    Yoshimoto, M
    IEICE TRANSACTIONS ON ELECTRONICS, 2006, E89C (03): : 230 - 242
  • [9] Real-time video segmentation
    Dibos, F
    Pelletier, S
    Koep, G
    AVSS 2005: ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2005, : 382 - 387
  • [10] Fast Real-Time Video Object Segmentation with a Tangled Memory Network
    Mei, Jianbiao
    Wang, Mengmeng
    Yang, Yu
    Li, Yanjun
    Liu, Yong
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (03)