Delving into Shape-aware Zero-shot Semantic Segmentation

被引:8
|
作者
Liu, Xinyu [1 ,2 ]
Tian, Beiwen [2 ,4 ]
Wang, Zhen [3 ]
Wang, Rui [3 ]
Sheng, Kehua [3 ]
Zhang, Bo [3 ]
Zhao, Hao [2 ]
Zhou, Guyue [2 ]
机构
[1] Xidian Univ, Xian, Peoples R China
[2] Didi Chuxing, Beijing, Peoples R China
[3] Tsinghua Univ, lnst Ind Res AIR, Beijing, Peoples R China
[4] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.00293
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thanks to the impressive progress of large-scale vision-language pretraining, recent recognition models can classify arbitrary objects in a zero-shot and open-set manner, with a surprisingly high accuracy. However, translating this success to semantic segmentation is not trivial, because this dense prediction task requires not only accurate semantic understanding but also fine shape delineation and existing vision-language models are trained with image-level language descriptions. To bridge this gap, we pursue shape-aware zero-shot semantic segmentation in this study. Inspired by classical spectral methods in the image segmentation literature, we propose to leverage the eigen vectors of Laplacian matrices constructed with self-supervised pixel-wise features to promote shape-awareness. Despite that this simple and effective technique does not make use of the masks of seen classes at all, we demonstrate that it outperforms a state-of-the-art shape-aware formulation that aligns ground truth and predicted edges during training. We also delve into the performance gains achieved on different datasets using different backbones and draw several interesting and conclusive observations: the benefits of promoting shape-awareness highly relates to mask compactness and language embedding locality. Finally, our method sets new state-of-the-art performance for zero-shot semantic segmentation on both Pascal and COCO, with significant margins. Code and models will be accessed at SAZS.
引用
收藏
页码:2999 / 3009
页数:11
相关论文
共 50 条
  • [1] Zero-Shot Semantic Segmentation
    Bucher, Maxime
    Vu, Tuan-Hung
    Cord, Matthieu
    Perez, Patrick
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [2] Uncertainty-Aware Learning for Zero-Shot Semantic Segmentation
    Hu, Ping
    Sclaroff, Stan
    Saenko, Kate
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [3] Context-aware Feature Generation for Zero-shot Semantic Segmentation
    Gu, Zhangxuan
    Zhou, Siyuan
    Niu, Li
    Zhao, Zihan
    Zhang, Liqing
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1921 - 1929
  • [4] Decoupling Zero-Shot Semantic Segmentation
    Ding, Jian
    Xue, Nan
    Xia, Gui-Song
    Dai, Dengxin
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11573 - 11582
  • [5] Zero-Shot Point Cloud Segmentation by Semantic-Visual Aware Synthesis
    Yang, Yuwei
    Hayat, Munawar
    Jin, Zhao
    Zhu, Hongyuan
    Lei, Yinjie
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 11552 - 11562
  • [6] Recursive Training for Zero-Shot Semantic Segmentation
    Wang, Ce
    Farazi, Moshiur
    Barnes, Nick
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] A meaningful learning method for zero-shot semantic segmentation
    Liu, Xianglong
    Bai, Shihao
    An, Shan
    Wang, Shuo
    Liu, Wei
    Zhao, Xiaowei
    Ma, Yuqing
    SCIENCE CHINA-INFORMATION SCIENCES, 2023, 66 (11)
  • [8] A meaningful learning method for zero-shot semantic segmentation
    Xianglong LIU
    Shihao BAI
    Shan AN
    Shuo WANG
    Wei LIU
    Xiaowei ZHAO
    Yuqing MA
    Science China(Information Sciences), 2023, 66 (11) : 35 - 53
  • [9] A meaningful learning method for zero-shot semantic segmentation
    Xianglong Liu
    Shihao Bai
    Shan An
    Shuo Wang
    Wei Liu
    Xiaowei Zhao
    Yuqing Ma
    Science China Information Sciences, 2023, 66
  • [10] Zero-shot Semantic Segmentation Using Relation Network
    Zhang, Yindong
    Khriyenko, Oleksiy
    PROCEEDINGS OF THE 28TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2021, : 516 - 527