ORSI Salient Object Detection via Cross-Scale Interaction and Enlarged Receptive Field

被引:8
|
作者
Zheng, Jianwei [1 ]
Quan, Yueqian [1 ]
Zheng, Hang [1 ]
Wang, Yibin [1 ]
Pan, Xiang [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
基金
中国国家自然科学基金;
关键词
Power capacitors; Feature extraction; Semantics; Optical sensors; Shape; Optical imaging; Computer architecture; Cross-scale interaction; receptive field; remote sensing image (RSI); salient object detection (SOD); NETWORK;
D O I
10.1109/LGRS.2023.3249764
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Due to the diversity of scales and shapes, the uncertainty of object position, and the complexity of edge details, the recent merging problem of salient object detection in optical remote sensing image (RSI-SOD) is a considerably challenging topic. To cope with the challenges, we propose a new cross-scale interaction network (CIFNet) equipped with the enlarged receptive field, which mainly contains three modules in an encoder-decoder architecture, including a furcate skip connection module (FSCM), a global leading attention module, and an expansion-integration module (EIM). First, the FSCM uses dilated convolutions to enlarge the receptive field and furcate skip connections to capture more multiscale contextual information, both of which facilitate the adaptability of the model to different sizes, shapes, and quantities of the target objects. Second, on the low-resolution branch, a global leading attention module (GLM) locates the potentially significant object positions in the feature map from a global semantic perspective. Finally, through an attention-guided cascade structure, the EIM seeks more delicate characteristics by refining the features in a coarse-to-fine fashion. Extensive experiments are conducted on two RSI-SOD datasets, from which superior results can be achieved by our CIFNet, outperforming the other state-of-the-art methods. Compared with the second-best method, the performance gain of our method reaches 3.45% on mean absolute error (MAE) and 1.38% on F-beta(adp). Notably, the proposed CIF-Net runs with 40.40-M parameters, 14.8-GFLOPs computational complexity, and 58-frames/s inference speed, which guarantees high efficiency.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] MULTI-SCALE GRAPH CONVOLUTIONAL INTERACTION NETWORK FOR SALIENT OBJECT DETECTION
    Che, Wenqi
    Sun, Luoyi
    Xie, Zhifeng
    Ding, Youdong
    Han, Kaili
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 679 - 683
  • [32] MPI: Multi-receptive and parallel integration for salient object detection
    Sun, Han
    Cen, Jun
    Liu, Ningzhong
    Liang, Dong
    Zhou, Huiyu
    IET IMAGE PROCESSING, 2021, 15 (13) : 3281 - 3291
  • [33] Video Salient Object Detection via Multiple Time-scale Analysis
    Chen, Yuhuan
    Huang, Limin
    Zou, Wenbin
    Li, Xia
    Qiu, Guoping
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2184 - 2189
  • [34] Cross-Modal and Cross-Level Attention Interaction Network for Salient Object Detection
    Wang F.
    Su Y.
    Wang R.
    Sun J.
    Sun F.
    Li H.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (06): : 2907 - 2920
  • [35] VIDEO SALIENT OBJECT DETECTION VIA CROSS-FRAME CELLULAR AUTOMATA
    Guo, Jingfan
    Ren, Tongwei
    Huang, Lei
    Liu, Xingyu
    Cheng, Ming-Ming
    Wu, Gangshan
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 325 - 330
  • [36] TS-BiT: Two-Stage Binary Transformer for ORSI Salient Object Detection
    Zhang, Jinfeng
    Liu, Tianpeng
    Zhang, Jiehua
    Liu, Li
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2025, 22
  • [37] Using hidden scale for salient object detection
    Chalmond, Bernard
    Francesconi, Benjamin
    Herbin, Stephane
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (09) : 2644 - 2656
  • [38] RGBD Salient Object Detection via Disentangled Cross-Modal Fusion
    Chen, Hao
    Deng, Yongjian
    Li, Youfu
    Hung, Tzu-Yi
    Lin, Guosheng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 8407 - 8416
  • [39] Cross-Scale Haptic Object Recognition for Intelligent Robot
    Li, Ang
    Lu, Wei
    Wang, Haoyu
    Wei, Xin
    2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC, 2023, : 205 - 209
  • [40] Learnable Cross-Scale Sparse Attention Guided Feature Fusion for UAV Object Detection
    Zuo, Xin
    Qi, Chenhui
    Chen, Yifei
    Shen, Jifeng
    Fan, Heng
    Yang, Wankou
    IEEE ACCESS, 2024, 12 : 114212 - 114226