SMALNet: Segment Anything Model Aided Lightweight Network for Infrared Image Segmentation

被引:1
|
作者
Ding, Kun [1 ]
Xiang, Shiming [1 ]
Pan, Chunhong [2 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence S, Beijing 100190, Peoples R China
[2] Chinese Acad Sci, Inst Automat, Res Ctr Aerosp Informat, Beijing 100190, Peoples R China
关键词
Infrared image; Semantic segmentation; Lightweight design; Segment anything model;
D O I
10.1016/j.infrared.2024.105540
中图分类号
TH7 [仪器、仪表];
学科分类号
0804 ; 080401 ; 081102 ;
摘要
Infrared based visual perception is important for night vision of autonomous vehicles, unmanned aerial vehicles (UAVs), etc. Semantic segmentation based on deep learning is one of the key techniques for infrared vision- based perception systems. Currently, most of the advanced methods are based on Transformers, which can achieve favorable segmentation accuracy. However, the high complexity of Transformers prevents them from meeting the real-time requirement of inference speed in resource constrained applications. In view of this, we suggest several lightweight designs that significantly reduce existing computational complexity. In order to maintain the segmentation accuracy, we further introduce the recent vision big model - Segment Anything Model (SAM) to supply auxiliary supervisory signals while training models. Based on these designs, we propose a lightweight segmentation network termed SMALNet (Segment Anything Model Aided Lightweight Network). Compared to existing state-of-the-art method, SegFormer, it reduces 64% FLOPs while maintaining the accuracy to a large extent on two commonly-used benchmarks. The proposed SMALNet can be used in various infrared based vision perception systems with limited hardware resources.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] MAET-SAM: Magneto-Acousto-Electrical Tomography segmentation network based on the segment anything model
    Bu, Shuaiyu
    Li, Yuanyuan
    Liu, Guoqiang
    Li, Yifan
    Mathematical Biosciences and Engineering, 2025, 22 (03) : 585 - 603
  • [42] IRSAM: Advancing Segment Anything Model for Infrared Small Target Detection
    Zhang, Mingjin
    Wang, Yuchun
    Guo, Jie
    Li, Yunsong
    Gao, Xinbo
    Zhang, Jing
    COMPUTER VISION - ECCV 2024, PT LXVII, 2025, 15125 : 233 - 249
  • [43] LANet: Lightweight Attention Network for Medical Image Segmentation
    Tang, Yi
    Pertsau, Dmitry
    Zhao, Di
    Kupryianava, Dziana
    Tatur, Mikhail
    INFORMATION TECHNOLOGIES AND THEIR APPLICATIONS, PT II, ITTA 2024, 2025, 2226 : 213 - 227
  • [44] A Novel Universal Image Forensics Localization Model Based on Image Noise and Segment Anything Model
    Su, Yang
    Tan, Shunquan
    Huang, Jiwu
    PROCEEDINGS OF THE 2024 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2024, 2024, : 149 - 158
  • [45] BiASAM: Bidirectional-Attention Guided Segment Anything Model for Very Few-Shot Medical Image Segmentation
    Zhou, Wei
    Guan, Guilin
    Cui, Wei
    Yi, Yugen
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 246 - 250
  • [46] SEMPNet: enhancing few-shot remote sensing image semantic segmentation through the integration of the segment anything model
    Ao, Wei
    Zheng, Shunyi
    Meng, Yan
    GISCIENCE & REMOTE SENSING, 2024, 61 (01)
  • [47] S-SAM: SVD-Based Fine-Tuning of Segment Anything Model for Medical Image Segmentation
    Paranjape, Jay N.
    Sikder, Shameema
    Vedula, S. Swaroop
    Patel, Vishal M.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XII, 2024, 15012 : 720 - 730
  • [48] Intraoperative Stereovision Cortical Surface Segmentation Using Fast Segment Anything Model
    Li, Chengpei
    Fan, Xiaoyao
    Duke, Ryan
    Chen, Kristen
    Evans, Linton T.
    Paulsen, Keith
    IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, MEDICAL IMAGING 2024, 2024, 12928
  • [49] Combining Segment Anything Model with Domain-Specific Knowledge for Semi-Supervised Learning in Medical Image Segmentation
    Zhang, Yizhe
    Zhou, Tao
    Wu, Ye
    Gu, Pengfei
    Wang, Shuo
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT XIV, 2025, 15044 : 343 - 357
  • [50] WebSAM-Adapter: Adapting Segment Anything Model for Web Page Segmentation
    Ren, Bowen
    Qian, Zefeng
    Sun, Yuchen
    Gao, Chao
    Zhang, Chongyang
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 439 - 454