TensorMask: A Foundation for Dense Object Segmentation

被引:257
|
作者
Chen, Xinlei [1 ]
Girshick, Ross [1 ]
He, Kaiming [1 ]
Dollar, Piotr [1 ]
机构
[1] Facebook AI Res FAIR, Menlo Pk, CA 94025 USA
关键词
D O I
10.1109/ICCV.2019.00215
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sliding-window object detectors that generate bounding-box object predictions over a dense, regular grid have advanced rapidly and proven popular. In contrast, modern instance segmentation approaches are dominated by methods that first detect object bounding boxes, and then crop and segment these regions, as popularized by Mask R-CNN. In this work, we investigate the paradigm of dense sliding-window instance segmentation, which is surprisingly under-explored. Our core observation is that this task is fundamentally different than other dense prediction tasks such as semantic segmentation or bounding-box object detection, as the output at every spatial location is itself a geometric structure with its own spatial dimensions. To formalize this, we treat dense instance segmentation as a prediction task over 4D tensors and present a general framework called TensorMask that explicitly captures this geometry and enables novel operators on 4D tensors. We demonstrate that the tensor view leads to large gains over baselines that ignore this structure, and leads to results comparable to Mask R-CNN. These promising results suggest that TensorMask can serve as a foundation for novel advances in dense mask prediction and a more complete understanding of the task. Code will be made available.
引用
收藏
页码:2061 / 2069
页数:9
相关论文
共 50 条
  • [1] Video Object Segmentation Via Dense Trajectories
    Chen, Lin
    Shen, Jianbing
    Wang, Wenguan
    Ni, Bingbing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (12) : 2225 - 2234
  • [2] Joint Optimization for Object Class Segmentation and Dense Stereo Reconstruction
    Lubor Ladický
    Paul Sturgess
    Chris Russell
    Sunando Sengupta
    Yalin Bastanlar
    William Clocksin
    Philip H. S. Torr
    International Journal of Computer Vision, 2012, 100 : 122 - 133
  • [3] Joint Optimization for Object Class Segmentation and Dense Stereo Reconstruction
    Ladicky, Lubor
    Sturgess, Paul
    Russell, Chris
    Sengupta, Sunando
    Bastanlar, Yalin
    Clocksin, William
    Torr, Philip H. S.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 100 (02) : 122 - 133
  • [4] Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions
    Zhang, Dong
    Javed, Omar
    Shah, Mubarak
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 628 - 635
  • [5] Efficient, Dense, Object-based Segmentation from RGBD Video
    Ghafarianzadeh, Mahsa
    Blaschko, Matthew B.
    Sibley, Gabe
    2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 2310 - 2317
  • [6] Foreground Object Segmentation from Dense Multi-view Images
    Fan Liangzhong
    Yu Xin
    Shu Zhenyu
    2009 INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, VOL I, 2009, : 473 - 476
  • [7] Object recognition and segmentation by non-rigid quasi-dense matching
    Kannala, Juho
    Rahtu, Esa
    Brandt, Sarni S.
    Heikkila, Janne
    2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 1006 - 1013
  • [8] Deep-dense Conditional Random Fields for Object Co-segmentation
    Yuan, Zehuan
    Lu, Tong
    Wu, Yirui
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3371 - 3377
  • [9] Dense estimation and object-based segmentation of the optical flow with robust techniques
    Memin, E
    Perez, P
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (05) : 703 - 719
  • [10] Towards Dense Moving Object Segmentation based Robust Dense RGB-D SLAM in Dynamic Scenarios
    Wang, Youbing
    Huang, Shoudong
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 1841 - 1846