OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images

被引:6
|
作者
Li, Weijia [1 ]
Lai, Yawen [2 ]
Xu, Linning [3 ]
Xiangli, Yuanbo [3 ]
Yu, Jinhua [1 ]
He, Conghui [2 ,4 ]
Xia, Gui-Song [5 ]
Lin, Dahua [3 ,4 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
[5] Wuhan Univ, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents OmniCity, a new dataset for omnipotent city understanding from multi-level and multi-view images. More precisely, OmniCity contains multi-view satellite images as well as street-level panorama and monoview images, constituting over 100K pixel-wise annotated images that are well-aligned and collected from 25K geolocations in New York City. To alleviate the substantial pixel-wise annotation efforts, we propose an efficient street-view image annotation pipeline that leverages the existing label maps of satellite view and the transformation relations between different views (satellite, panorama, and mono-view). With the new OmniCity dataset, we provide benchmarks for a variety of tasks including building footprint extraction, height estimation, and building plane/instance/fine-grained segmentation. Compared with existing multi-level and multi-view benchmarks, OmniCity contains a larger number of images with richer annotation types and more views, provides more benchmark results of state-of-the-art models, and introduces a new task for fine-grained building instance segmentation on street-level panorama images. Moreover, OmniCity provides new problem settings for existing tasks, such as cross-view image matching, synthesis, segmentation, detection, etc., and facilitates the developing of new methods for large-scale city understanding, reconstruction, and simulation. The OmniCity dataset as well as the benchmarks will be released at https://city-super.github.io/omnicity/.
引用
收藏
页码:17397 / 17407
页数:11
相关论文
共 50 条
  • [1] Multi-view Fusion for Multi-level Robotic Scene Understanding
    Lin, Yunzhi
    Tremblay, Jonathan
    Tyree, Stephen
    Vela, Patricio A.
    Birchfield, Stan
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 6817 - 6824
  • [2] MCoCo: Multi-level Consistency Collaborative multi-view clustering
    Zhou, Yiyang
    Zheng, Qinghai
    Wang, Yifei
    Yan, Wenbiao
    Shi, Pengcheng
    Zhu, Jihua
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [3] Towards the Integration of Multi-Level and Multi-View Modelling for Interoperability
    Fu, Yuhong
    Grossmann, Georg
    Kaur, Karamjit
    Selway, Matt
    Stumptner, Markus
    2023 ACM/IEEE INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS COMPANION, MODELS-C, 2023, : 679 - 688
  • [4] Multi-level photo quality assessment with multi-view features
    Dong, Zhe
    Tian, Xinmei
    NEUROCOMPUTING, 2015, 168 : 308 - 319
  • [5] Multi-level and multi-view description framework of military requirements
    Deng P.-H.
    Bi Y.-M.
    Zhang O.-Y.
    Tian Y.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2010, 32 (11): : 2380 - 2384
  • [6] Multi-level Feature Learning for Contrastive Multi-view Clustering
    Xu, Jie
    Tang, Huayi
    Ren, Yazhou
    Peng, Liang
    Zhu, Xiaofeng
    He, Lifang
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 16030 - 16039
  • [7] Towards Deep Reactions in Multi-Level, Multi-View Modeling
    Weber, Thomas
    Ojha, Monalisha
    Sadeghi, Mohammad
    Konig, Lars
    Armbruster, Martin
    Lange, Arne
    Burger, Erik
    Atkinson, Colin
    ACM/IEEE 27TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS: COMPANION PROCEEDINGS, MODELS 2024, 2024, : 760 - 769
  • [8] Multi-level correlation learning for multi-view unsupervised feature selection
    Wu, Jian-Sheng
    Gong, Jun-Xiao
    Liu, Jing-Xin
    Min, Weidong
    KNOWLEDGE-BASED SYSTEMS, 2023, 281
  • [9] Multi-layer multi-level comprehensive learning for deep multi-view clustering
    Chen, Zhe
    Wu, Xiao-Jun
    Xu, Tianyang
    Li, Hui
    Kittler, Josef
    INFORMATION FUSION, 2025, 116
  • [10] Multi-view and Multi-level network for fault diagnosis accommodating feature transferability
    Lu, Na
    Cui, Zhiyan
    Hu, Huiyang
    Yin, Tao
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213