TensorMask: A Foundation for Dense Object Segmentation

被引:257
|
作者
Chen, Xinlei [1 ]
Girshick, Ross [1 ]
He, Kaiming [1 ]
Dollar, Piotr [1 ]
机构
[1] Facebook AI Res FAIR, Menlo Pk, CA 94025 USA
关键词
D O I
10.1109/ICCV.2019.00215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sliding-window object detectors that generate bounding-box object predictions over a dense, regular grid have advanced rapidly and proven popular. In contrast, modern instance segmentation approaches are dominated by methods that first detect object bounding boxes, and then crop and segment these regions, as popularized by Mask R-CNN. In this work, we investigate the paradigm of dense sliding-window instance segmentation, which is surprisingly under-explored. Our core observation is that this task is fundamentally different than other dense prediction tasks such as semantic segmentation or bounding-box object detection, as the output at every spatial location is itself a geometric structure with its own spatial dimensions. To formalize this, we treat dense instance segmentation as a prediction task over 4D tensors and present a general framework called TensorMask that explicitly captures this geometry and enables novel operators on 4D tensors. We demonstrate that the tensor view leads to large gains over baselines that ignore this structure, and leads to results comparable to Mask R-CNN. These promising results suggest that TensorMask can serve as a foundation for novel advances in dense mask prediction and a more complete understanding of the task. Code will be made available.
引用
收藏
页码:2061 / 2069
页数:9
相关论文
共 50 条
  • [21] Bidirectionally Learning Dense Spatio-temporal Feature Propagation Network for Unsupervised Video Object Segmentation
    Fan, Jiaqing
    Su, Tiankang
    Zhang, Kaihua
    Liu, Qingshan
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3646 - 3655
  • [22] Multi-appearance segmentation and extended 0-1 programming for dense small object tracking
    Chen, Longtao
    Ren, Mingwu
    PLOS ONE, 2018, 13 (10):
  • [23] Moving object Segmentation
    Zinbi, Youssef
    Chahir, Youssef
    Elmoataz, Abder
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 1132 - 1136
  • [24] DENSE CONVOLUTION FOR SEMANTIC SEGMENTATION
    Han, Chaoyi
    Tao, Xiaoming
    Duan, Yiping
    Lu, Jianhua
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2222 - 2226
  • [25] Foundation of the taxonomic object system
    Zendler, AM
    INFORMATION AND SOFTWARE TECHNOLOGY, 1998, 40 (09) : 475 - 492
  • [26] Object recognition in dense clutter
    Mary J. Bravo
    Hany Farid
    Perception & Psychophysics, 2006, 68 : 911 - 918
  • [27] Bayesian object segmentation
    Reno, AL
    INTERNATIONAL CONFERENCE ON IMAGING SCIENCE, SYSTEMS, AND TECHNOLOGY, PROCEEDINGS, 1999, : 35 - 41
  • [28] Object recognition in dense clutter
    Bravo, Mary J.
    Farid, Hany
    PERCEPTION & PSYCHOPHYSICS, 2006, 68 (06): : 911 - 918
  • [29] Segmentation of dense leukocyte clusters
    Nilsson, B
    Heyden, A
    IEEE WORKSHOP ON MATHEMATICAL METHODS IN BIOMEDICAL IMAGE ANALYSIS, PROCEEDINGS, 2001, : 221 - 227
  • [30] Moving Object Tracking Using Object Segmentation
    Singh, Sanjay
    Dunga, Srinivasa Murali
    Mandal, A. S.
    Shekhar, Chandra
    Vohra, Anil
    INFORMATION AND COMMUNICATION TECHNOLOGIES, 2010, 101 : 691 - +