FAMINet: Learning Real-Time Semisupervised Video Object Segmentation With Steepest Optimized Optical Flow

被引:0
|
作者
Liu, Ziyang [1 ]
Liu, Jingmeng [1 ]
Chen, Weihai [2 ]
Wu, Xingming [1 ]
Li, Zhengguo [3 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China
[3] Inst Infocomm Res, SRO Dept, Singapore 138632, Singapore
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Optical imaging; Integrated optics; Motion segmentation; Feature extraction; Adaptive optics; Optical network units; Streaming media; Online memorizing; optical flow; real time; relaxed steepest descent; semisupervised video object segmentation (VOS);
D O I
10.1109/TIM.2021.3133003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semisupervised video object segmentation (VOS) aims to segment a few moving objects in a video sequence, where these objects are specified by annotation of the first frame. The optical flow has been considered in many existing semisupervised VOS methods to improve the segmentation accuracy. However, the optical flow-based semisupervised VOS methods cannot run in real time due to high complexity of optical flow estimation. A FAMINet, which consists of a feature extraction network (F), an appearance network (A), a motion network (M), and an integration network (I), is proposed in this study to address the above-mentioned problem. The appearance network outputs an initial segmentation result based on static appearances of objects. The motion network estimates the optical flow via very few parameters, which are optimized rapidly by an online memorizing algorithm named relaxed steepest descent. The integration network refines the initial segmentation result using the optical flow. Extensive experiments demonstrate that the FAMINet outperforms other state-of-the-art semisupervised VOS methods on the DAVIS and YouTube-VOS benchmarks and achieves a good trade-off between accuracy and efficiency. Our code is available at https://github.com/liuziyang123/FAMINet.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Real-time object classification in video surveillance based on appearance learning
    Zhang, Lun
    Li, Stan Z.
    Yuan, Xiaotong
    Xiang, Shiming
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3766 - +
  • [22] Real Time Video Object Segmentation in Compressed Domain
    Tan, Zhentao
    Liu, Bin
    Chu, Qi
    Zhong, Hangshi
    Wu, Yue
    Li, Weihai
    Yu, Nenghai
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 175 - 188
  • [23] Real-Time Tracking Combined with Object Segmentation
    Wang, Hongzhi
    Sang, Nong
    Yan, Yi
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4098 - 4103
  • [24] Real-Time Object-Based Video Segmentation Using Colour Segmentation and Connected Component Labeling
    Jau, U. L.
    Teh, C. S.
    VISUAL INFORMATICS: BRIDGING RESEARCH AND PRACTICE, 2009, 5857 : 110 - 121
  • [25] Real-time object segmentation based on GPU
    Lee, Sun-Ju
    Jeong, Chang-Sung
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 739 - 742
  • [26] Real-Time Moving Object Segmentation and Classification From HEVC Compressed Surveillance Video
    Zhao, Liang
    He, Zhihai
    Cao, Wenming
    Zhao, Debin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (06) : 1346 - 1357
  • [27] REAL-TIME VIDEO OBJECT SEGMENTATION ALGORITHM BASED ON CHANGE DETECTION AND BACKGROUND UPDATING
    Chen, Tsong-Yi
    Chen, Thou-Ho
    Wang, Da-Jinn
    Chiou, Yung-Chuen
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2009, 5 (07): : 1797 - 1810
  • [28] Real-time moving object detection and segmentation in H.264 video streams
    Konda, Krishna Reddy
    Tefera, Yonas Teodros
    Conci, Nicola
    De Natale, Francesco G. B.
    2017 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2017, : 314 - 319
  • [29] Automatic Video Segmentation and Object Tracking with Real-Time RGB-D Data
    Chen, I-Kuei
    Hsu, Szu-Lu
    Chi, Chung-Yu
    Chen, Liang-Gee
    2014 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2014, : 488 - 489
  • [30] An FPGA-Optimized Architecture of Real-time Farneback Optical Flow
    Pan, Zhe
    Jin, Yuruo
    Jiang, Xiaohong
    Wu, Jian
    28TH IEEE INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2020, : 223 - 223