Geometry-Aware Segmentation of Remote Sensing Images via Joint Height Estimation

被引:28
|
作者
Li, Xiang [1 ,2 ]
Wen, Congcong [3 ]
Wang, Lingjing [1 ,2 ]
Fang, Yi [1 ,2 ]
机构
[1] NYU Tandon, Multimedia & Visual Comp Lab, Abu Dhabi, U Arab Emirates
[2] NYU Tandon, Multimedia & Visual Comp Lab, New York, NY 11201 USA
[3] NYU, Dept Elect & Comp Engn, New York, NY 11201 USA
关键词
Semantics; Data models; Image segmentation; Labeling; Decoding; Estimation; Convolution; Feature fusion; geometry-aware convolution (GAC); height estimation; semantic segmentation; SEMANTIC SEGMENTATION; AERIAL;
D O I
10.1109/LGRS.2021.3058168
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recent studies have shown the benefits of using additional elevation data [e.g., digital surface model (DSM) or normalized DSM (nDSM)] for enhancing the performance of the semantic labeling of aerial images. However, previous methods mostly adopt 3-D elevation information as additional inputs, while, in many real-world applications, one does not have the corresponding DSM images at hand, and the spatial resolution of acquired DSM images usually does not match the aerial images. To alleviate this data constraint and also take advantage of 3-D elevation information, in this letter, a geometry-aware segmentation model is introduced to achieve accurate semantic labeling of aerial images via joint height estimation. Instead of using a single-stream encoder-decoder network for semantic labeling, we design a separate decoder branch to predict the height map and use the DSM images as side supervision to train this newly designed decoder branch. With the newly designed decoder branch, our model can distill the 3-D geometric features from 2-D appearance features under the supervision of ground-truth DSM images. Moreover, we develop a new geometry-aware convolution module that fuses the 3-D geometric features from the height decoder branch and the 2-D contextual features from the semantic segmentation branch. The fused feature embeddings can produce geometry-aware segmentation maps with enhanced performance. Our model is trained with DSM images as side supervision, while, in the inference stage, it does not require DSM data and directly predicts the semantic labels. Experiments on International Society for Photogrammetry and Remote Sensing (ISPRS) Vaihingen and Potsdam data sets demonstrate the effectiveness of the proposed method for the semantic segmentation of aerial images.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Geometry-Aware Deep Video Deblurring via Recurrent Feature Refinement
    Kim, Taeoh
    Lee, Sangyoun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1176 - 1189
  • [32] A Geometry-Aware Registration Algorithm for Multiview High-Resolution SAR Images
    Xiang, Yuming
    Jiao, Niangang
    Liu, Rui
    Wang, Feng
    You, Hongjian
    Qiu, Xiaolan
    Fu, Kun
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [33] Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE
    Liu, Jiaxu
    Yi, Xinping
    Wu, Sihao
    Yin, Xiangyu
    Zhang, Tianle
    Huang, Xiaowei
    Jin, Shi
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT III, ECML PKDD 2024, 2024, 14943 : 231 - 249
  • [34] TransPose: 6D object pose estimation with geometry-aware Transformer
    Lin, Xiao
    Wang, Deming
    Zhou, Guangliang
    Liu, Chengju
    Chen, Qijun
    NEUROCOMPUTING, 2024, 589
  • [35] SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation
    Li, Xuewei
    Wu, Tao
    Qi, Zhongang
    Wang, Gaoang
    Shan, Ying
    Li, Xi
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1125 - 1133
  • [36] CFM-UNet:A Joint CNN and Transformer Network via Cross Feature Modulation for Remote Sensing Images Segmentation
    Min WANG
    Peidong WANG
    JournalofGeodesyandGeoinformationScience, 2023, 6 (04) : 40 - 47
  • [37] Estimation of the height of monocular satellite remote sensing images based on semantic information guidance
    He, Meiyang
    Li, Dongjie
    Ma, Lei
    Wan, Ling
    REMOTE SENSING LETTERS, 2025, 16 (05) : 526 - 536
  • [38] GOPE: Geometry-Aware Optimal Viewpoint Path Estimation Using a Monocular Camera
    Kim, Nuri
    Choi, Yunho
    Kang, Minjae
    Oh, Songhwai
    2020 20TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2020, : 1062 - 1067
  • [39] TREE CANOPY HEIGHT ESTIMATION AND ACCURACY ANALYSIS BASED ON UAV REMOTE SENSING IMAGES
    Hao, J.
    Fang, Z.
    Wu, B.
    Liu, S.
    Ma, Y.
    Pan, Y.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 129 - 134
  • [40] Unsupervised Geometry-Aware Representation for 3D Human Pose Estimation
    Rhodin, Helge
    Salzmann, Mathieu
    Fua, Pascal
    COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 765 - 782