Frequency-Adaptive Dilated Convolution for Semantic Segmentation

被引:7
|
作者
Chen, Linwei [1 ]
Gu, Lin [2 ,3 ]
Zheng, Dezhi [1 ]
Fu, Ying [1 ]
机构
[1] Beijing Inst Technol, Beijing, Peoples R China
[2] RIKEN, Wako, Saitama, Japan
[3] Univ Tokyo, Tokyo, Japan
基金
中国国家自然科学基金;
关键词
NETWORK;
D O I
10.1109/CVPR52733.2024.00328
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dilated convolution, which expands the receptive field by inserting gaps between its consecutive elements, is widely employed in computer vision. In this study, we propose three strategies to improve individual phases of dilated convolution from the perspective of spectrum analysis. Departing from the conventional practice of fixing a global dilation rate as a hyperparameter, we introduce Frequency-Adaptive Dilated Convolution (FADC), which dynamically adjusts dilation rates spatially based on local frequency components. Subsequently, we design two plug-in modules to directly enhance effective bandwidth and receptive field size. The Adaptive Kernel (AdaKern) module decomposes convolution weights into low-frequency and high-frequency components, dynamically adjusting the ratio between these components on a per-channel basis. By increasing the high-frequency part of convolution weights, AdaKern captures more high-frequency components, thereby improving effective bandwidth. The Frequency Selection (FreqSelect) module optimally balances high- and low-frequency components in feature representations through spatially variant reweighting. It suppresses high frequencies in the background to encourage FADC to learn a larger dilation, thereby increasing the receptive field for an expanded scope. Extensive experiments on segmentation and object detection consistently validate the efficacy of our approach. The code is made publicly available at https://github.com/ying-fu/FADC.
引用
收藏
页码:3414 / 3425
页数:12
相关论文
共 50 条
  • [1] MCDCNet: Mask Classification Combined with Adaptive Dilated Convolution for Image Semantic Segmentation
    Wei, Geng
    Wang, Junbo
    Shi, Bingxian
    Zhu, Xiaolin
    Cao, Bo
    Liu, Tong
    APPLIED SCIENCES-BASEL, 2025, 15 (04):
  • [2] Gaussian Dilated Convolution for Semantic Image Segmentation
    Shen, Falong
    Zeng, Gang
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 324 - 334
  • [3] Transformable Dilated Convolution by Distance for LiDAR Semantic Segmentation
    Lee, Jae-Seol
    Park, Tae-Hyoung
    IEEE ACCESS, 2022, 10 : 125102 - 125111
  • [4] Frequency Dynamic Convolution: Frequency-Adaptive Pattern Recognition for Sound Event Detection
    Nam, Hyeonuk
    Kim, Seong-Hu
    Ko, Byeong-Yun
    Park, Yong-Hwa
    INTERSPEECH 2022, 2022, : 2763 - 2767
  • [5] MULTIPLE SKIP CONNECTIONS OF DILATED CONVOLUTION NETWORK FOR SEMANTIC SEGMENTATION
    Yamashita, Takayoshi
    Furukawa, Hironori
    Fujiyoshi, Hironobu
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 1593 - 1597
  • [6] End-to-end dilated convolution network for document image semantic segmentation
    Xu Can-hui
    Shi Cao
    Chen Yi-nong
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2021, 28 (06) : 1765 - 1774
  • [7] DENSE CONVOLUTION FOR SEMANTIC SEGMENTATION
    Han, Chaoyi
    Tao, Xiaoming
    Duan, Yiping
    Lu, Jianhua
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2222 - 2226
  • [8] Understanding Convolution for Semantic Segmentation
    Wang, Panqu
    Chen, Pengfei
    Yuan, Ye
    Liu, Ding
    Huang, Zehua
    Hou, Xiaodi
    Cottrell, Garrison
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1451 - 1460
  • [9] CONTROL OF FREQUENCY-ADAPTIVE HEART PACEMAKERS
    LAMPADIUS, M
    WIRTZFELD, A
    STANGL, K
    THIEL, H
    ZEITSCHRIFT FUR KARDIOLOGIE, 1986, 75 : 61 - 61
  • [10] Semantic Segmentation by Multi-Scale Feature Extraction Based on Grouped Dilated Convolution Module
    Kim, Dong Seop
    Kim, Yu Hwan
    Park, Kang Ryoung
    MATHEMATICS, 2021, 9 (09)