TensorMask: A Foundation for Dense Object Segmentation

被引:257
|
作者
Chen, Xinlei [1 ]
Girshick, Ross [1 ]
He, Kaiming [1 ]
Dollar, Piotr [1 ]
机构
[1] Facebook AI Res FAIR, Menlo Pk, CA 94025 USA
关键词
D O I
10.1109/ICCV.2019.00215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sliding-window object detectors that generate bounding-box object predictions over a dense, regular grid have advanced rapidly and proven popular. In contrast, modern instance segmentation approaches are dominated by methods that first detect object bounding boxes, and then crop and segment these regions, as popularized by Mask R-CNN. In this work, we investigate the paradigm of dense sliding-window instance segmentation, which is surprisingly under-explored. Our core observation is that this task is fundamentally different than other dense prediction tasks such as semantic segmentation or bounding-box object detection, as the output at every spatial location is itself a geometric structure with its own spatial dimensions. To formalize this, we treat dense instance segmentation as a prediction task over 4D tensors and present a general framework called TensorMask that explicitly captures this geometry and enables novel operators on 4D tensors. We demonstrate that the tensor view leads to large gains over baselines that ignore this structure, and leads to results comparable to Mask R-CNN. These promising results suggest that TensorMask can serve as a foundation for novel advances in dense mask prediction and a more complete understanding of the task. Code will be made available.
引用
收藏
页码:2061 / 2069
页数:9
相关论文
共 50 条
  • [31] Object proposals for salient object segmentation in videos
    Kalboussi, Rahma
    Azaza, Aymen
    van de Weijer, Joost
    Abdellaoui, Mehrez
    Douik, Ali
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (13-14) : 8677 - 8693
  • [32] Object proposals for salient object segmentation in videos
    Rahma Kalboussi
    Aymen Azaza
    Joost van de Weijer
    Mehrez Abdellaoui
    Ali Douik
    Multimedia Tools and Applications, 2020, 79 : 8677 - 8693
  • [33] Learning Object Context for Dense Captioning
    Li, Xiangyang
    Jiang, Shuqiang
    Han, Jungong
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8650 - 8657
  • [34] A foundation for the concept of role in object modelling
    Genilloud, G
    Wegmann, A
    FOURTH INTERNATIONAL ENTERPRISE DISTRIBUTED OBJECT COMPUTING CONFERENCE - PROCEEDINGS, 2000, : 76 - 85
  • [35] DWSD: Dense waste segmentation dataset
    Ali, Asfak
    Acharjee, Suvojit
    Manarul, S. K. Md.
    Alharthi, Salman Z.
    Chaudhuri, Sheli Sinha
    Akhunzada, Adnan
    DATA IN BRIEF, 2025, 59
  • [36] Dense Unsupervised Learning for Video Segmentation
    Araslanov, Nikita
    Schaub-Meyer, Simone
    Roth, Stefan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [37] Disentangle Your Dense Object Detector
    Chen, Zehui
    Yang, Chenhongyi
    Li, Qiaofei
    Zhao, Feng
    Zha, Zheng-Jun
    Wu, Feng
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4939 - 4948
  • [38] DENSE DECONVOLUTIONAL NETWORK FOR SEMANTIC SEGMENTATION
    Yang, Wenbin
    Zhou, Quan
    Lu, Jingnan
    Wu, Xiaofu
    Zhang, Suofei
    Latecki, Longin Jan
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1573 - 1577
  • [39] Dense Segmentation-aware Descriptors
    Trulls, Eduard
    Kokkinos, Iasonas
    Sanfeliu, Alberto
    Moreno-Noguer, Francesc
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2890 - 2897
  • [40] Dense Receptive Field for Object Detection
    Yao, Yongqiang
    Dong, Yuan
    Huang, Zesang
    Bai, Hongliang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1815 - 1820