Dual Branch Multi-Level Semantic Learning for Few-Shot Segmentation

被引:13
|
作者
Chen, Yadang [1 ,2 ]
Jiang, Ren [1 ,2 ]
Zheng, Yuhui [3 ,4 ]
Sheng, Bin [5 ]
Yang, Zhi-Xin [6 ]
Wu, Enhua [7 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China
[3] Qinghai Normal Univ, Coll Comp, Xining 810016, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[5] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[6] Univ Macau, Dept Electromech Engn, State Key Lab Internet Things Smart City, Macau, Peoples R China
[7] Chinese Acad Sci, State Key Lab Comp Sci, Inst Software, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Training; Semantics; Semantic segmentation; Self-supervised learning; Feature extraction; Measurement; Few-shot learning; semantic segmentation; contrastive learning; metric learning; NETWORK;
D O I
10.1109/TIP.2024.3364056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few -shot semantic segmentation aims to segment novel -class objects in a query image with only a few annotated examples in support images. Although progress has been made recently by combining prototype -based metric learning, existing methods still face two main challenges. First, various intra-class objects between the support and query images or semantically similar inter -class objects can seriously harm the segmentation performance due to their poor feature representations. Second, the latent novel classes are treated as the background in most methods, leading to a learning bias, whereby these novel classes are difficult to correctly segment as foreground. To solve these problems, we propose a dual -branch learning method. The class -specific branch encourages representations of objects to be more distinguishable by increasing the inter -class distance while decreasing the intra-class distance. In parallel, the class -agnostic branch focuses on minimizing the foreground class feature distribution and maximizing the features between the foreground and background, thus increasing the generalizability to novel classes in the test stage. Furthermore, to obtain more representative features, pixel -level and prototype -level semantic learning are both involved in the two branches. The method is evaluated on PASCAL -5(i) 1 -shot, PASCAL -5(i) 5 -shot, COCO-20(i) 1 -shot, and COCO-20(i) 5 -shot, and extensive experiments show that our approach is effective for few -shot semantic segmentation despite its simplicity.
引用
收藏
页码:1432 / 1447
页数:16
相关论文
共 50 条
  • [31] Harnessing Multi-Semantic Hypergraph for Few-Shot Learning
    Chen, Hao
    Li, Linyan
    Xia, Zhenping
    Lyu, Fan
    Zhao, Liuqing
    Huang, Kaizhu
    Feng, Wei
    Hu, Fuyuan
    PATTERN RECOGNITION AND COMPUTER VISION, PT I, PRCV 2022, 2022, 13534 : 232 - 244
  • [32] Dual-Branch Multi-Scale Relation Networks with Tutorial Learning for Few-Shot Learning
    Xu, Chuanyun
    Wang, Hang
    Zhang, Yang
    Zhou, Zheng
    Li, Gang
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [33] Learning Non-target Knowledge for Few-shot Semantic Segmentation
    Liu, Yuanwei
    Liu, Nian
    Cao, Qinglong
    Yao, Xiwen
    Han, Junwei
    Shao, Ling
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11563 - 11572
  • [34] Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation
    Wen, Chunlin
    Huang, Hui
    Ma, Yan
    Yuan, Feiniu
    Zhu, Hongqing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 8874 - 8888
  • [35] PRIOR SEMANTIC HARMONIZATION NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Yang, Xinhao
    Ma, Liyan
    Zhou, Yang
    Peng, Yan
    Xie, Shaorong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1126 - 1130
  • [36] MIINet: a multi-branch information interaction network for few-shot segmentation
    Zhang, Zhaopeng
    Xu, Zhijie
    Zhang, Jianqin
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (12) : 9081 - 9094
  • [37] DSMF-Net: Dual Semantic Metric Learning Fusion Network for Few-Shot Aerial Image Semantic Segmentation
    Qi, Xiyu
    Zhang, Yidan
    Wang, Lei
    Wu, Yifan
    Xin, Yi
    Chen, Zhan
    Ge, Yunping
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 853 - 864
  • [38] Multi-level semantic information guided image generation for few-shot steel surface defect classification
    Hao, Liang
    Shen, Pei
    Pan, Zhiwei
    Xu, Yong
    FRONTIERS IN PHYSICS, 2023, 11
  • [39] Multi-Level Matching and Aggregation Network for Few-Shot Relation Classification
    Ye, Zhi-Xiu
    Ling, Zhen-Hua
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2872 - 2881
  • [40] Self-support Few-Shot Semantic Segmentation
    Fan, Qi
    Pei, Wenjie
    Tai, Yu-Wing
    Tang, Chi-Keung
    COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 701 - 719