Toward High Quality Multi-Object Tracking and Segmentation Without Mask Supervision

被引:1
|
作者
Cheng, Wensheng [1 ]
Wu, Yi [2 ]
Wu, Zhenyu [2 ]
Ling, Haibin [1 ]
Hua, Gang [2 ]
机构
[1] SUNY Stony Brook, Dept Comp Sci, New York 11794, NY USA
[2] Wormpex AI Res LLC, Bellevue, WA 98004 USA
关键词
Image segmentation; Task analysis; Feature extraction; Training; Trajectory; Optical flow; Pipelines; Multi-object tracking and segmentation; temporal information; pairwise consistency; pair-based sampling;
D O I
10.1109/TIP.2024.3403497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently studies have shown the potential of weakly supervised multi-object tracking and segmentation, but the drawbacks of coarse pseudo mask label and limited utilization of temporal information remain to be unresolved. To address these issues, we present a framework that directly uses box label to supervise the segmentation network without resorting to pseudo mask label. In addition, we propose to fully exploit the temporal information from two perspectives. Firstly, we integrate optical flow-based pairwise consistency to ensure mask consistency across frames, thereby improving mask quality for segmentation. Secondly, we propose a temporally adjacent pair-based sampling strategy to adapt instance embedding learning for data association in tracking. We combine these techniques into an end-to-end deep model, named BoxMOTS, which requires only box annotation without mask supervision. Extensive experiments demonstrate that our model surpasses current state-of-the-art by a large margin, and produces promising results on KITTI MOTS and BDD100K MOTS. The source code is available at https://github.com/Spritea/BoxMOTS.
引用
收藏
页码:3369 / 3384
页数:16
相关论文
共 50 条
  • [1] MOTS: Multi-Object Tracking and Segmentation
    Voigtlaender, Paul
    Krause, Michael
    Osep, Aljosa
    Luiten, Jonathon
    Sekar, Berin Balachandar Gnana
    Geiger, Andreas
    Leibe, Bastian
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7934 - 7943
  • [2] Weakly Supervised Multi-Object Tracking and Segmentation
    Ruiz, Idoia
    Porzi, Lorenzo
    Bulo, Samuel Rota
    Kontschieder, Peter
    Serrat, Joan
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2021), 2021, : 125 - 133
  • [3] Multi-object tracking by mutual supervision of CNN and particle filter
    Xia Y.
    Qu S.
    Goudos S.
    Bai Y.
    Wan S.
    Personal and Ubiquitous Computing, 2021, 25 (6) : 979 - 988
  • [4] A Framework to Combine Multi-Object Video Segmentation and Tracking
    Nadeem, Sehr
    Rahman, Anis
    Butt, Asad A.
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 525 - 531
  • [5] Leveraging Weak Segmentation for Multi-object Tracking System
    Wang, JiaXin
    Ma, CuiXia
    Wang, Hao
    Wang, HongAn
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 63 - 68
  • [6] Multi-Object Tracking Based on Segmentation and Collision Avoidance
    Meng Zhao
    Junhui Wang
    Maoyong Cao
    Peirui Bai
    Hongyan Gu
    Mingtao Pei
    Journal of Beijing Institute of Technology, 2018, 27 (02) : 213 - 219
  • [7] Multi-Object Tracking, Segmentation and Validation in Thermal Images
    Muresan, Mircea Paul
    Danescu, Radu
    Nedevschi, Sergiu
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [8] An Object Point Set Inductive Tracker for Multi-Object Tracking and Segmentation
    Gao, Yan
    Xu, Haojun
    Zheng, Yu
    Li, Jie
    Gao, Xinbo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6083 - 6096
  • [9] Segmentation, Ordering and Multi-Object Tracking using Graphical Models
    Wang, Chaohui
    de La Gorce, Martin
    Paragios, Nikos
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 747 - 754
  • [10] Learning Multi-Object Tracking and Segmentation from Automatic Annotations
    Porzi, Lorenzo
    Hofinger, Markus
    Ruiz, Idoia
    Serrat, Joan
    Bulo, Samuel Rota
    Kontschieder, Peter
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 6845 - 6854