THE Benchmark: Transferable Representation Learning for Monocular Height Estimation

被引:4
|
作者
Xiong, Zhitong [1 ]
Huang, Wei [1 ]
Hu, Jingtao [2 ]
Zhu, Xiao Xiang [1 ]
机构
[1] Tech Univ Munich TUM, Chair Data Sci Earth Observat, D-80333 Munich, Germany
[2] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
关键词
Benchmark; cross-dataset transfer; remote sensing; synthetic data; transfer learning; Transformer;
D O I
10.1109/TGRS.2023.3311764
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Generating 3-D city models rapidly is crucial for many applications. Monocular height estimation (MHE) is one of the most efficient and timely ways to obtain large-scale geometric information. However, existing works focus primarily on training and testing models using unbiased datasets, which does not align well with real-world applications. Therefore, we propose a new benchmark dataset to study the transferability of height estimation models in a cross-dataset setting. To this end, we first design and construct a large-scale benchmark dataset for cross-dataset transfer learning on the height estimation task. This benchmark dataset includes a newly proposed large-scale synthetic dataset, a newly collected real-world dataset, and four existing datasets from different cities. Next, a new experimental protocol, few-shot cross-dataset transfer, is designed. Furthermore, in this article, we propose a scale-deformable convolution (SDC) module to enhance the window-based Transformer for handling the scale-variation problem in the height estimation task. Experimental results have demonstrated the effectiveness of the proposed methods in traditional and cross-dataset transfer settings.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] URL: A Representation Learning Benchmark for Transferable Uncertainty Estimates
    Kirchhof, Michael
    Mucsanyi, Balint
    Oh, Seong Joon
    Kasneci, Enkelejda
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] HeightNet: Monocular Object Height Estimation
    Kim, In Su
    Kim, Hyeongbok
    Lee, Seungwon
    Jung, Soon Ki
    ELECTRONICS, 2023, 12 (02)
  • [3] Deep Learning-Based Monocular Estimation of Distance and Height for Edge Devices
    Gasienica-Jozkowy, Jan
    Cyganek, Boguslaw
    Knapik, Mateusz
    Glogowski, Szymon
    Przebinda, Lukasz
    INFORMATION, 2024, 15 (08)
  • [4] Learning Transferable Policies for Monocular Reactive MAV Control
    Daftry, Shreyansh
    Bagnell, J. Andrew
    Hebert, Martial
    2016 INTERNATIONAL SYMPOSIUM ON EXPERIMENTAL ROBOTICS, 2017, 1 : 3 - 11
  • [5] Transferable Representation Learning with Deep Adaptation Networks
    Long, Mingsheng
    Cao, Yue
    Cao, Zhangjie
    Wang, Jianmin
    Jordan, Michael
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (12) : 3071 - 3085
  • [6] Delivering Meaningful Representation for Monocular Depth Estimation
    Kim, Doyeon
    Joo, Donggyu
    Kim, Junmo
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7790 - 7795
  • [7] Temporal Representation Learning on Monocular Videos for 3D Human Pose Estimation
    Honari, Sina
    Constantin, Victor
    Rhodin, Helge
    Salzmann, Mathieu
    Fua, Pascal
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (05) : 6415 - 6427
  • [8] Monocular Height Estimation by Chronological Correction of Road Unevenness
    Kaneko, Alex M.
    Yamamoto, Kenjiro
    2016 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2016, : 31 - 38
  • [9] Towards Robust Monocular Depth Estimation: A New Baseline and Benchmark
    Xian, Ke
    Cao, Zhiguo
    Shen, Chunhua
    Lin, Guosheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (07) : 2401 - 2419
  • [10] Pseudo View Representation Learning for Monocular RGB-D Human Pose and Shape Estimation
    Zhu, Armando
    Li, Jiefeng
    Lu, Cewu
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 712 - 716