Toward High Quality Multi-Object Tracking and Segmentation Without Mask Supervision

被引:1
|
作者
Cheng, Wensheng [1 ]
Wu, Yi [2 ]
Wu, Zhenyu [2 ]
Ling, Haibin [1 ]
Hua, Gang [2 ]
机构
[1] SUNY Stony Brook, Dept Comp Sci, New York 11794, NY USA
[2] Wormpex AI Res LLC, Bellevue, WA 98004 USA
关键词
Image segmentation; Task analysis; Feature extraction; Training; Trajectory; Optical flow; Pipelines; Multi-object tracking and segmentation; temporal information; pairwise consistency; pair-based sampling;
D O I
10.1109/TIP.2024.3403497
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently studies have shown the potential of weakly supervised multi-object tracking and segmentation, but the drawbacks of coarse pseudo mask label and limited utilization of temporal information remain to be unresolved. To address these issues, we present a framework that directly uses box label to supervise the segmentation network without resorting to pseudo mask label. In addition, we propose to fully exploit the temporal information from two perspectives. Firstly, we integrate optical flow-based pairwise consistency to ensure mask consistency across frames, thereby improving mask quality for segmentation. Secondly, we propose a temporally adjacent pair-based sampling strategy to adapt instance embedding learning for data association in tracking. We combine these techniques into an end-to-end deep model, named BoxMOTS, which requires only box annotation without mask supervision. Extensive experiments demonstrate that our model surpasses current state-of-the-art by a large margin, and produces promising results on KITTI MOTS and BDD100K MOTS. The source code is available at https://github.com/Spritea/BoxMOTS.
引用
收藏
页码:3369 / 3384
页数:16
相关论文
共 50 条
  • [31] TrackFormer: Multi-Object Tracking with Transformers
    Meinhardt, Tim
    Kirillov, Alexander
    Leal-Taixe, Laura
    Feichtenhofer, Christoph
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8834 - 8844
  • [32] Interacting Tracklets for Multi-Object Tracking
    Lan, Long
    Wang, Xinchao
    Zhang, Shiliang
    Tao, Dacheng
    Gao, Wen
    Huang, Thomas S.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (09) : 4585 - 4597
  • [33] Engineering statistics for multi-object tracking
    Mahler, R
    2001 IEEE WORKSHOP ON MULTI-OBJECT TRACKING, PROCEEDINGS, 2001, : 53 - 60
  • [34] Multi-object tracking for horse racing
    Ng, Wing W. Y.
    Liu, Xuyu
    Yan, Xuli
    Tian, Xing
    Zhong, Cankun
    Kwong, Sam
    INFORMATION SCIENCES, 2023, 638
  • [35] Relational Prior for Multi-Object Tracking
    Moskalev, Artem
    Sosnovik, Ivan
    Smeulders, Arnold
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1081 - 1085
  • [36] Multi-Object Tracking with Distributed Sensing
    Dias, Ricardo
    Lau, Nuno
    Silva, Joao
    Lim, Gi Hyun
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS (MFI), 2016, : 564 - 569
  • [37] MeMOT: Multi-Object Tracking with Memory
    Cai, Jiarui
    Xu, Mingze
    Li, Wei
    Xiong, Yuanjun
    Xia, Wei
    Tu, Zhuowen
    Soatto, Stefano
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8080 - 8090
  • [38] A Robust Framework for Multi-object Tracking
    Jalal, Anand Singh
    Singh, Vrijendra
    ADVANCES IN COMPUTING AND COMMUNICATIONS, PT 4, 2011, 193 : 329 - 338
  • [39] HumanTop: a multi-object tracking tabletop
    Soto Candela, Emilio
    Ortega Perez, Mario
    Marin Romero, Clemente
    Perez Lopez, David C.
    Salvador Herranz, Gustavo
    Contero, Manuel
    Alcaniz Raya, Mariano
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 70 (03) : 1837 - 1868
  • [40] SiamMOT: Siamese Multi-Object Tracking
    Shuai, Bing
    Berneshawi, Andrew
    Li, Xinyu
    Modolo, Davide
    Tighe, Joseph
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12367 - 12377