Dual Branch Multi-Level Semantic Learning for Few-Shot Segmentation

被引:13
|
作者
Chen, Yadang [1 ,2 ]
Jiang, Ren [1 ,2 ]
Zheng, Yuhui [3 ,4 ]
Sheng, Bin [5 ]
Yang, Zhi-Xin [6 ]
Wu, Enhua [7 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Minist Educ, Engn Res Ctr Digital Forens, Nanjing 210044, Peoples R China
[3] Qinghai Normal Univ, Coll Comp, Xining 810016, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[5] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[6] Univ Macau, Dept Electromech Engn, State Key Lab Internet Things Smart City, Macau, Peoples R China
[7] Chinese Acad Sci, State Key Lab Comp Sci, Inst Software, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Prototypes; Training; Semantics; Semantic segmentation; Self-supervised learning; Feature extraction; Measurement; Few-shot learning; semantic segmentation; contrastive learning; metric learning; NETWORK;
D O I
10.1109/TIP.2024.3364056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few -shot semantic segmentation aims to segment novel -class objects in a query image with only a few annotated examples in support images. Although progress has been made recently by combining prototype -based metric learning, existing methods still face two main challenges. First, various intra-class objects between the support and query images or semantically similar inter -class objects can seriously harm the segmentation performance due to their poor feature representations. Second, the latent novel classes are treated as the background in most methods, leading to a learning bias, whereby these novel classes are difficult to correctly segment as foreground. To solve these problems, we propose a dual -branch learning method. The class -specific branch encourages representations of objects to be more distinguishable by increasing the inter -class distance while decreasing the intra-class distance. In parallel, the class -agnostic branch focuses on minimizing the foreground class feature distribution and maximizing the features between the foreground and background, thus increasing the generalizability to novel classes in the test stage. Furthermore, to obtain more representative features, pixel -level and prototype -level semantic learning are both involved in the two branches. The method is evaluated on PASCAL -5(i) 1 -shot, PASCAL -5(i) 5 -shot, COCO-20(i) 1 -shot, and COCO-20(i) 5 -shot, and extensive experiments show that our approach is effective for few -shot semantic segmentation despite its simplicity.
引用
收藏
页码:1432 / 1447
页数:16
相关论文
共 50 条
  • [21] Multi-Level Feature-Guided Network for Few-shot Medical Image Segmentation
    Shen, Yue
    Fan, Wanshu
    Han, Zhongbin
    Zhou, Dongsheng
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1346 - 1351
  • [22] Multi-level adaptive few-shot learning network combined with vision transformer
    Zhu H.
    Cai X.
    Dou J.
    Gao Z.
    Zhang L.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12477 - 12491
  • [23] Few-shot Segmentation and Semantic Segmentation for Underwater Imagery
    Kabir, Imran
    Shaurya, Shubham
    Maigur, Vijayalaxmi
    Thakurdesai, Nikhil
    Latnekar, Mahesh
    Raunak, Mayank
    Crandall, David
    Reza, Md Alimoor
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 11451 - 11457
  • [24] A Dual Attention Network with Semantic Embedding for Few-Shot Learning
    Yan, Shipeng
    Zhang, Songyang
    He, Xuming
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9079 - 9086
  • [25] Multi-level Semantic Fusion Network For Few-shot Multimedia Image Recognition In Education Management
    Yuan, Chunlin
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2025, 28 (02): : 227 - 235
  • [26] Multi-Level Correlation Network For Few-Shot Image Classification
    Dang, Yunkai
    Sun, Meijun
    Zhang, Min
    Chen, Zhengyu
    Zhang, Xinliang
    Wang, Zheng
    Wang, Donglin
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2909 - 2914
  • [27] Multi-level alignment for few-shot temporal action localization
    Keisham, Kanchan
    Jalali, Amin
    Kim, Jonghong
    Lee, Minho
    INFORMATION SCIENCES, 2023, 650
  • [28] Learning Meta-class Memory for Few-Shot Semantic Segmentation
    Wu, Zhonghua
    Shi, Xiangxi
    Lin, Guosheng
    Cai, Jianfei
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 497 - 506
  • [29] Differentiable Meta-Learning Model for Few-Shot Semantic Segmentation
    Tian, Pinzhuo
    Wu, Zhangkai
    Qi, Lei
    Wang, Lei
    Shi, Yinghuan
    Gao, Yang
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12087 - 12094
  • [30] Quaternion-Valued Correlation Learning for Few-Shot Semantic Segmentation
    Zheng, Zewen
    Huang, Guoheng
    Yuan, Xiaochen
    Pun, Chi-Man
    Liu, Hongrui
    Ling, Wing-Kuen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (05) : 2102 - 2115