Dual Branch Multi-Level Semantic Learning for Few-Shot Segmentation

被引:13
|
作者
Chen, Yadang [1 ,2 ]
Jiang, Ren [1 ,2 ]
Zheng, Yuhui [3 ,4 ]
Sheng, Bin [5 ]
Yang, Zhi-Xin [6 ]
Wu, Enhua [7 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China
[3] Qinghai Normal Univ, Coll Comp, Xining 810016, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[5] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[6] Univ Macau, Dept Electromech Engn, State Key Lab Internet Things Smart City, Macau, Peoples R China
[7] Chinese Acad Sci, State Key Lab Comp Sci, Inst Software, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Training; Semantics; Semantic segmentation; Self-supervised learning; Feature extraction; Measurement; Few-shot learning; semantic segmentation; contrastive learning; metric learning; NETWORK;
D O I
10.1109/TIP.2024.3364056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few -shot semantic segmentation aims to segment novel -class objects in a query image with only a few annotated examples in support images. Although progress has been made recently by combining prototype -based metric learning, existing methods still face two main challenges. First, various intra-class objects between the support and query images or semantically similar inter -class objects can seriously harm the segmentation performance due to their poor feature representations. Second, the latent novel classes are treated as the background in most methods, leading to a learning bias, whereby these novel classes are difficult to correctly segment as foreground. To solve these problems, we propose a dual -branch learning method. The class -specific branch encourages representations of objects to be more distinguishable by increasing the inter -class distance while decreasing the intra-class distance. In parallel, the class -agnostic branch focuses on minimizing the foreground class feature distribution and maximizing the features between the foreground and background, thus increasing the generalizability to novel classes in the test stage. Furthermore, to obtain more representative features, pixel -level and prototype -level semantic learning are both involved in the two branches. The method is evaluated on PASCAL -5(i) 1 -shot, PASCAL -5(i) 5 -shot, COCO-20(i) 1 -shot, and COCO-20(i) 5 -shot, and extensive experiments show that our approach is effective for few -shot semantic segmentation despite its simplicity.
引用
收藏
页码:1432 / 1447
页数:16
相关论文
共 50 条
  • [1] Multi-level semantic adaptation for few-shot segmentation on cardiac image sequences
    Guo, Saidi
    Xu, Lin
    Feng, Cheng
    Xiong, Huahua
    Gao, Zhifan
    Zhang, Heye
    MEDICAL IMAGE ANALYSIS, 2021, 73
  • [2] Few-shot semantic segmentation via multi-level feature extraction and multi-prototype localization
    Hegui Zhu
    Jiayi Wang
    Yange Zhou
    Zhan Gao
    Libo Zhang
    Multimedia Tools and Applications, 2024, 83 : 50921 - 50953
  • [3] Few-shot semantic segmentation via multi-level feature extraction and multi-prototype localization
    Zhu, Hegui
    Wang, Jiayi
    Zhou, Yange
    Gao, Zhan
    Zhang, Libo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 50921 - 50953
  • [4] Multi-level semantic-assisted prototype learning for Few-Shot Action Recognition
    Liu, Dan
    Xia, Qing
    Meng, Fanrong
    Ye, Mao
    Zhang, Jianwei
    NEUROCOMPUTING, 2025, 636
  • [5] LEARNING WITH MEMORY FOR FEW-SHOT SEMANTIC SEGMENTATION
    Lu, Hongchao
    Wei, Chao
    Deng, Zhidong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 629 - 633
  • [6] Multi-Level Semantic Fusion Optimization for Few-shot Relation Classification
    Li, Peihong
    Cai, Fei
    Liu, Dengfeng
    Wang, Siyuan
    Liu, Shixian
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 206 - 212
  • [7] Multi-level Attention Feature Network for Few-shot Learning
    Wang Ronggui
    Han Mengya
    Yang Juan
    Xue Lixia
    Hu Min
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (03) : 772 - 778
  • [8] Multi-Level Second-Order Few-Shot Learning
    Zhang, Hongguang
    Li, Hongdong
    Koniusz, Piotr
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 2111 - 2126
  • [9] Multi-level Attention Feature Network for Few-shot Learning
    Wang R.
    Han M.
    Yang J.
    Xue L.
    Hu M.
    Yang, Juan (yangjuan@hfut.edu.cn), 1600, Science Press (42): : 772 - 778
  • [10] Multi-level Metric Learning for Few-Shot Image Recognition
    Chen, Haoxing
    Li, Huaxiong
    Li, Yaohui
    Chen, Chunlin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 243 - 254