Dual Branch Multi-Level Semantic Learning for Few-Shot Segmentation

被引:13
|
作者
Chen, Yadang [1 ,2 ]
Jiang, Ren [1 ,2 ]
Zheng, Yuhui [3 ,4 ]
Sheng, Bin [5 ]
Yang, Zhi-Xin [6 ]
Wu, Enhua [7 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China
[3] Qinghai Normal Univ, Coll Comp, Xining 810016, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[5] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[6] Univ Macau, Dept Electromech Engn, State Key Lab Internet Things Smart City, Macau, Peoples R China
[7] Chinese Acad Sci, State Key Lab Comp Sci, Inst Software, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Training; Semantics; Semantic segmentation; Self-supervised learning; Feature extraction; Measurement; Few-shot learning; semantic segmentation; contrastive learning; metric learning; NETWORK;
D O I
10.1109/TIP.2024.3364056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few -shot semantic segmentation aims to segment novel -class objects in a query image with only a few annotated examples in support images. Although progress has been made recently by combining prototype -based metric learning, existing methods still face two main challenges. First, various intra-class objects between the support and query images or semantically similar inter -class objects can seriously harm the segmentation performance due to their poor feature representations. Second, the latent novel classes are treated as the background in most methods, leading to a learning bias, whereby these novel classes are difficult to correctly segment as foreground. To solve these problems, we propose a dual -branch learning method. The class -specific branch encourages representations of objects to be more distinguishable by increasing the inter -class distance while decreasing the intra-class distance. In parallel, the class -agnostic branch focuses on minimizing the foreground class feature distribution and maximizing the features between the foreground and background, thus increasing the generalizability to novel classes in the test stage. Furthermore, to obtain more representative features, pixel -level and prototype -level semantic learning are both involved in the two branches. The method is evaluated on PASCAL -5(i) 1 -shot, PASCAL -5(i) 5 -shot, COCO-20(i) 1 -shot, and COCO-20(i) 5 -shot, and extensive experiments show that our approach is effective for few -shot semantic segmentation despite its simplicity.
引用
收藏
页码:1432 / 1447
页数:16
相关论文
共 50 条
  • [41] Few-Shot Semantic Segmentation with Cyclic Memory Network
    Xie, Guo-Sen
    Xiong, Huan
    Liu, Jie
    Yao, Yazhou
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7273 - 7282
  • [42] Query semantic reconstruction for background in few-shot segmentation
    Haoyan Guan
    Michael Spratling
    The Visual Computer, 2024, 40 (2) : 799 - 810
  • [43] Few-Shot Semantic Segmentation via Mask Aggregation
    Ao, Wei
    Zheng, Shunyi
    Meng, Yan
    Yang, Yang
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [44] Query semantic reconstruction for background in few-shot segmentation
    Guan, Haoyan
    Spratling, Michael
    VISUAL COMPUTER, 2024, 40 (02): : 799 - 810
  • [45] Deep Reasoning Network for Few-shot Semantic Segmentation
    Zhuge, Yunzhi
    Shen, Chunhua
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5344 - 5352
  • [46] Incorporating Depth Information into Few-Shot Semantic Segmentation
    Zhang, Yifei
    Sidibe, Desire
    Morel, Olivier
    Meriaudeau, Fabrice
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3582 - 3588
  • [47] Dynamic Extension Nets for Few-shot Semantic Segmentation
    Liu, Lizhao
    Cao, Junyi
    Liu, Minqian
    Guo, Yong
    Chen, Qi
    Tan, Mingkui
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1441 - 1449
  • [48] Few-shot semantic segmentation: a review on recent approaches
    Zhaobin Chang
    Yonggang Lu
    Xingcheng Ran
    Xiong Gao
    Xiangwen Wang
    Neural Computing and Applications, 2023, 35 : 18251 - 18275
  • [49] Few-Shot Semantic Segmentation for Complex Driving Scenes
    Zhou, Jingxing
    Chen, Ruei-Bo
    Beyerer, Juergen
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 695 - 702
  • [50] Prediction Calibration for Generalized Few-Shot Semantic Segmentation
    Lu, Zhihe
    He, Sen
    Li, Da
    Song, Yi-Zhe
    Xiang, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3311 - 3323