FAMINet: Learning Real-Time Semisupervised Video Object Segmentation With Steepest Optimized Optical Flow

被引:0
|
作者
Liu, Ziyang [1 ]
Liu, Jingmeng [1 ]
Chen, Weihai [2 ]
Wu, Xingming [1 ]
Li, Zhengguo [3 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China
[3] Inst Infocomm Res, SRO Dept, Singapore 138632, Singapore
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Optical imaging; Integrated optics; Motion segmentation; Feature extraction; Adaptive optics; Optical network units; Streaming media; Online memorizing; optical flow; real time; relaxed steepest descent; semisupervised video object segmentation (VOS);
D O I
10.1109/TIM.2021.3133003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semisupervised video object segmentation (VOS) aims to segment a few moving objects in a video sequence, where these objects are specified by annotation of the first frame. The optical flow has been considered in many existing semisupervised VOS methods to improve the segmentation accuracy. However, the optical flow-based semisupervised VOS methods cannot run in real time due to high complexity of optical flow estimation. A FAMINet, which consists of a feature extraction network (F), an appearance network (A), a motion network (M), and an integration network (I), is proposed in this study to address the above-mentioned problem. The appearance network outputs an initial segmentation result based on static appearances of objects. The motion network estimates the optical flow via very few parameters, which are optimized rapidly by an online memorizing algorithm named relaxed steepest descent. The integration network refines the initial segmentation result using the optical flow. Extensive experiments demonstrate that the FAMINet outperforms other state-of-the-art semisupervised VOS methods on the DAVIS and YouTube-VOS benchmarks and achieves a good trade-off between accuracy and efficiency. Our code is available at https://github.com/liuziyang123/FAMINet.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] A Consistent, Real-Time Image Segmentation for Object Tracking
    Le, Xuesong
    Gonzalez, Ruben
    2016 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2016, : 172 - 178
  • [42] Real-Time Estimation of Optical Flow Based on Optimized Haar Wavelet Features
    Salmen, Jan
    Caup, Lukas
    Igel, Christian
    EVOLUTIONARY MULTI-CRITERION OPTIMIZATION, 2011, 6576 : 448 - +
  • [43] Real-Time Detection of Personal Protective Equipment Violations for Construction Workers Using Semisupervised Learning and Video Clips
    Chen, Qihua
    Long, Danbing
    Wang, Siqi
    Chen, Qirong
    Yuan, Beifei
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2025, 151 (03)
  • [44] A Real-Time Method to Estimate Speed of Object Based on Object Detection and Optical Flow Calculation
    Liu, Kaizhan
    Ye, Yunming
    Li, Xutao
    Li, Yan
    2ND INTERNATIONAL CONFERENCE ON MACHINE VISION AND INFORMATION TECHNOLOGY (CMVIT 2018), 2018, 1004
  • [45] Real-time Object Detection and Semantic Segmentation Hardware System with Deep Learning Networks
    Fang, Shaoxia
    Tian, Lu
    Wang, Junbin
    Liang, Shuang
    Xie, Dongliang
    Chen, Zhongmin
    Sui, Lingzhi
    Yu, Qian
    Sun, Xiaoming
    Shan, Yi
    Wang, Yu
    2018 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT 2018), 2018, : 392 - 395
  • [46] Smart cameras with real-time video object generation
    Del Bue, A
    Comaniciu, D
    Ramesh, V
    Regazzoni, C
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2002, : 429 - 432
  • [47] Real-Time Video Stylization Using Object Flows
    Lu, Cewu
    Xiao, Yao
    Tang, Chi-Keung
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (06) : 2051 - 2063
  • [48] FaceSeg: Automatic Face Segmentation for Real-Time Video
    Li, Hongliang
    Ngan, King N.
    Liu, Qiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2009, 11 (01) : 77 - 88
  • [49] Real-time recursive motion segmentation of video data
    Wittebrood, R
    de Haan, G
    ICCE: 2001 INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, DIGEST OF TECHNICAL PAPERS, 2001, : 288 - 289
  • [50] Automatic real-time capture and segmentation of endoscopy video
    Stanek, Sean R.
    Tavanapong, Wallapak
    Wong, Johnny S.
    Oh, JungHwan
    de Groen, Piet C.
    MEDICAL IMAGING 2008: PACS AND IMAGING INFORMATICS, 2008, 6919