Bottom-Up Shift and Reasoning for Referring Image Segmentation

被引:58
|
作者
Yang, Sibei [1 ]
Xia, Meng [2 ]
Li, Guanbin [2 ]
Zhou, Hong-Yu [3 ]
Yu, Yizhou [3 ,4 ]
机构
[1] ShanghaiTech Univ, Shanghai, Peoples R China
[2] Sun Yat Sen Univ, Guangzhou, Peoples R China
[3] Univ Hong Kong, Hong Kong, Peoples R China
[4] Deepwise AI Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR46437.2021.01111
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Referring image segmentation aims to segment the referent that is the corresponding object or stuff referred by a natural language expression in an image. Its main challenge lies in how to effectively and efficiently differentiate between the referent and other objects of the same category as the referent. In this paper, we tackle the challenge by jointly performing compositional visual reasoning and accurate segmentation in a single stage via the proposed novel Bottom-Up Shift (BUS) and Bidirectional Attentive Refinement (BIAR) modules. Specifically, BUS progressively locates the referent along hierarchical reasoning steps implied by the expression. At each step, it locates the corresponding visual region by disambiguating between similar regions, where the disambiguation bases on the relationships between regions. By the explainable visual reasoning, BUS explicitly aligns linguistic components with visual regions so that it can identify all the mentioned entities in the expression. BIAR fuses multi-level features via a twoway attentive message passing, which captures the visual details relevant to the referent to refine segmentation results. Experimental results demonstrate that the proposed method consisting of BUS and BIAR modules, can not only consistently surpass all existing state-of-the-art algorithms across common benchmark datasets but also visualize interpretable reasoning steps for stepwise segmentation. Code is available at https://github.com/incredibleXM/BUSNet.
引用
收藏
页码:11261 / 11270
页数:10
相关论文
共 50 条
  • [31] A bottom-up review
    Standing, G
    FOREIGN POLICY, 2001, (122) : 8 - +
  • [32] Bottom-Up Management
    Gordon, Paul J.
    INDUSTRIAL & LABOR RELATIONS REVIEW, 1950, 3 (04): : 620 - 621
  • [33] Bottom-Up Management
    不详
    HUMAN ORGANIZATION, 1950, 9 (01) : 38 - 38
  • [34] A Bottom-Up Approach for Automatic Pancreas Segmentation in Abdominal CT Scans
    Farag, Amal
    Lu, Le
    Turkbey, Evrim
    Liu, Jiamin
    Summers, Ronald M.
    ABDOMINAL IMAGING: COMPUTATIONAL AND CLINICAL APPLICATIONS, 2014, 8676 : 103 - 113
  • [35] BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
    Chen, Hao
    Sun, Kunyang
    Tian, Zhi
    Shen, Chunhua
    Huang, Yongming
    Yan, Youliang
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, : 8570 - 8578
  • [36] BOTTOM-UP DEMOCRACY
    不详
    SOCIOLOGY AND SOCIAL RESEARCH, 1955, 39 (05): : 353 - 353
  • [37] BOTTOM-UP TESTING
    MEHTA, KD
    IEEE SOFTWARE, 1990, 7 (05) : 4 - 4
  • [38] Bottom-Up Proteomics
    Armirotti, Andrea
    CURRENT ANALYTICAL CHEMISTRY, 2009, 5 (02) : 116 - 130
  • [39] BOTTOM-UP GRAPHENE
    不详
    CHEMICAL & ENGINEERING NEWS, 2009, 87 (28) : 26 - 26
  • [40] BOTTOM-UP THE SYSTEM
    STAPLES, L
    SOCIAL POLICY, 1989, 19 (04) : 34 - 39