DECOUPLE THE HIGH-FREQUENCY AND LOW-FREQUENCY INFORMATION OF IMAGES FOR SEMANTIC SEGMENTATION

被引:10
|
作者
Shan, Lianlei [1 ]
Li, Xiaobin [1 ]
Wang, Weiqiang [1 ]
机构
[1] Chinese Acad Sci, China Comp Vis & Multimedia Technol Lab, Univ Chinese Acad Sci, Beijing, Peoples R China
关键词
Semantic segmentation; decouple images; Fourier transform; low-frequency and high-frequency components; multi-branch network;
D O I
10.1109/ICASSP39728.2021.9414019
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
As a special kind of signal processing technology, image processing has been developed rapidly after the appearance of convolutional neural network (CNN). At present, the semantic segmentation methods are all based on CNN and ignore the advantages of traditional image processing technology. We combine the two and make them promote each other. The high frequency component of the image represents the edge part and the low frequency represents the body part. Based on this assumption, we use Fourier transform to obtain the high and low frequency component from images. Then, a multi-branch parallel network structure is designed, and the high and low frequency components are sent into two branches respectively to obtain the body and edge features. Finally, the two features are fused together through one deep feature fusion to obtain the final output. In this way, the edge information and body information are decoupled on original images and extracted separately, which not only ensures the consistency of the internal information within objects, but also strengthens the supervision of the edge part which is the most error prone area in semantic segmentation. The results on Cityscapes and KITTI fully demonstrate the effectiveness of our work.
引用
收藏
页码:1805 / 1809
页数:5
相关论文
共 50 条