THE Benchmark: Transferable Representation Learning for Monocular Height Estimation

被引:4
|
作者
Xiong, Zhitong [1 ]
Huang, Wei [1 ]
Hu, Jingtao [2 ]
Zhu, Xiao Xiang [1 ]
机构
[1] Tech Univ Munich TUM, Chair Data Sci Earth Observat, D-80333 Munich, Germany
[2] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
关键词
Benchmark; cross-dataset transfer; remote sensing; synthetic data; transfer learning; Transformer;
D O I
10.1109/TGRS.2023.3311764
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Generating 3-D city models rapidly is crucial for many applications. Monocular height estimation (MHE) is one of the most efficient and timely ways to obtain large-scale geometric information. However, existing works focus primarily on training and testing models using unbiased datasets, which does not align well with real-world applications. Therefore, we propose a new benchmark dataset to study the transferability of height estimation models in a cross-dataset setting. To this end, we first design and construct a large-scale benchmark dataset for cross-dataset transfer learning on the height estimation task. This benchmark dataset includes a newly proposed large-scale synthetic dataset, a newly collected real-world dataset, and four existing datasets from different cities. Next, a new experimental protocol, few-shot cross-dataset transfer, is designed. Furthermore, in this article, we propose a scale-deformable convolution (SDC) module to enhance the window-based Transformer for handling the scale-variation problem in the height estimation task. Experimental results have demonstrated the effectiveness of the proposed methods in traditional and cross-dataset transfer settings.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] SOFT WEIGHTED ORDINAL CLASSIFICATION FOR MONOCULAR HEIGHT ESTIMATION IN REMOTE SENSING IMAGE
    Feng, Yingchao
    Sun, Xian
    Diao, Wenhui
    Li, Jihao
    Xu, Tao
    Gao, Xin
    Fu, Kun
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 2750 - 2753
  • [32] CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer
    Mu, Yao
    Chen, Shoufa
    Ding, Mingyu
    Chen, Jianyu
    Chen, Runjian
    Luo, Ping
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [33] ADAPTIVE BINS FOR MONOCULAR HEIGHT ESTIMATION FROM SINGLE REMOTE SENSING IMAGES
    Chen, Sining
    Shi, Yilei
    Xiong, Zhitong
    Zhu, Xiao Xiang
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 7015 - 7018
  • [34] Removal of Redundant Information via Discrete Representation for Monocular Depth Estimation
    Du, Hao
    Liu, Xinzhi
    Cheng, Guoan
    Matsune, Ai
    Xu, Liangfeng
    Zhan, Shu
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (12)
  • [35] Monocular depth estimation based on deep learning: An overview
    ZHAO ChaoQiang
    SUN Qi Yu
    ZHANG ChongZhen
    TANG Yang
    QIAN Feng
    Science China(Technological Sciences), 2020, (09) : 1612 - 1627
  • [36] Learning Regularizer for Monocular Depth Estimation with Adversarial Guidance
    Shen, Guibao
    Zhang, Yingkui
    Li, Jialu
    Wei, Mingqiang
    Wang, Qiong
    Chen, Guangyong
    Heng, Pheng-Ann
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5222 - 5230
  • [37] Deep Learning Based Monocular Depth Estimation: A Survey
    Jiang J.-J.
    Li Z.-Y.
    Liu X.-M.
    Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (06): : 1276 - 1307
  • [38] Monocular Depth Estimation Based on Deep Learning:A Survey
    Ruan Xiaogang
    Yan Wenjing
    Huang Jing
    Guo Peiyuan
    Guo Wei
    2020 CHINESE AUTOMATION CONGRESS (CAC 2020), 2020, : 2436 - 2440
  • [39] Monocular depth estimation based on deep learning: An overview
    ZHAO ChaoQiang
    SUN Qi Yu
    ZHANG ChongZhen
    TANG Yang
    QIAN Feng
    Science China(Technological Sciences), 2020, 63 (09) : 1612 - 1627
  • [40] Learning monocular depth estimation with unsupervised trinocular assumptions
    Poggi, Matteo
    Tosi, Fabio
    Mattoccia, Stefano
    2018 INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2018, : 324 - 333