OmniCity: Omnipotent City Understanding with Multi-level and Multi-view Images

被引:6
|
作者
Li, Weijia [1 ]
Lai, Yawen [2 ]
Xu, Linning [3 ]
Xiangli, Yuanbo [3 ]
Yu, Jinhua [1 ]
He, Conghui [2 ,4 ]
Xia, Gui-Song [5 ]
Lin, Dahua [3 ,4 ]
机构
[1] Sun Yat Sen Univ, Guangzhou, Peoples R China
[2] SenseTime Res, Hong Kong, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
[4] Shanghai Artificial Intelligence Lab, Shanghai, Peoples R China
[5] Wuhan Univ, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.01669
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents OmniCity, a new dataset for omnipotent city understanding from multi-level and multi-view images. More precisely, OmniCity contains multi-view satellite images as well as street-level panorama and monoview images, constituting over 100K pixel-wise annotated images that are well-aligned and collected from 25K geolocations in New York City. To alleviate the substantial pixel-wise annotation efforts, we propose an efficient street-view image annotation pipeline that leverages the existing label maps of satellite view and the transformation relations between different views (satellite, panorama, and mono-view). With the new OmniCity dataset, we provide benchmarks for a variety of tasks including building footprint extraction, height estimation, and building plane/instance/fine-grained segmentation. Compared with existing multi-level and multi-view benchmarks, OmniCity contains a larger number of images with richer annotation types and more views, provides more benchmark results of state-of-the-art models, and introduces a new task for fine-grained building instance segmentation on street-level panorama images. Moreover, OmniCity provides new problem settings for existing tasks, such as cross-view image matching, synthesis, segmentation, detection, etc., and facilitates the developing of new methods for large-scale city understanding, reconstruction, and simulation. The OmniCity dataset as well as the benchmarks will be released at https://city-super.github.io/omnicity/.
引用
收藏
页码:17397 / 17407
页数:11
相关论文
共 50 条
  • [31] Content-Based Video Relevance Prediction with Multi-view Multi-level Deep Interest Network
    Chen, Zeyuan
    Xu, Kai
    Zhang, Wei
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2607 - 2611
  • [32] MULTI-VIEW STEREO USING MULTI-LUMINANCE IMAGES
    Feng, Xiaoduan
    Liu, Yebin
    Dai, Qionghai
    2009 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO, 2009, : 109 - +
  • [33] JOINT RECONSTRUCTION OF COMPRESSED MULTI-VIEW IMAGES
    Chen, Xu
    Frossard, Pascal
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1005 - +
  • [34] Multi-view Subspace Clustering for Face Images
    Zhang, Xin
    Dinh Phung
    Venkatesh, Svetha
    Duc-Son Pham
    Liu, Wanquan
    2015 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2015, : 555 - 561
  • [35] An Improved Clustering Method for Multi-view Images
    Dong, Yang
    Fan, Dazhao
    Ma, Qiuhe
    Ji, Song
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 134 - 144
  • [36] ALIGNMENT OF UNCALIBRATED IMAGES FOR MULTI-VIEW CLASSIFICATION
    Arik, Sercan Omer
    Vural, Elif
    Frossard, Pascal
    2011 18TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2011,
  • [37] Efficient representation and compression of multi-view images
    Park, JI
    Yang, KH
    Iwadate, Y
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2000, E83D (12) : 2186 - 2188
  • [38] Generating Virtual Images for Multi-view Video
    Zhang Lijun
    Li Yue
    Zhu Qiuyu
    Li Mingqi
    CHINESE JOURNAL OF ELECTRONICS, 2017, 26 (04) : 810 - 813
  • [39] Highlight Removal of Multi-View Facial Images
    Su, Tong
    Zhou, Yu
    Yu, Yao
    Du, Sidan
    SENSORS, 2022, 22 (17)
  • [40] Anthropometric Measurements from Multi-View Images
    Li, Jie
    Sun, Mingui
    Chen, Hsin-Chen
    Li, Zhaoxin
    Jia, Wenyan
    2012 38TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE (NEBEC), 2012, : 426 - +