共 3 条
Global Streetscapes - A comprehensive dataset of 10 million street-level images across 688 cities for urban science and analytics
被引:15
|作者:
Hou, Yujun
[1
]
Quintana, Matias
[2
]
Khomiakov, Maxim
[1
,3
]
Yap, Winston
[1
]
Ouyang, Jiani
[1
,4
]
Ito, Koichi
[1
]
Wang, Zeyu
[1
]
Zhao, Tianhong
[5
]
Biljecki, Filip
[1
,6
]
机构:
[1] Natl Univ Singapore, Dept Architecture, 4 Architecture Dr, Singapore 117566, Singapore
[2] Singapore ETH Ctr, Future Cities Lab Global Programme, CREATE Campus,1 Create Way,06-01 CREATE Tower, Singapore 138602, Singapore
[3] Tech Univ Denmark, Dept Appl Math & Comp Sci, Richard Petersens Plads Bygning 324, DK-2800 Lyngby, Denmark
[4] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Luoyu Rd 129, Wuhan 430079, Peoples R China
[5] Shenzhen Technol Univ, Coll Big Data & Internet, 3002 Lantian Rd, Shenzhen 518118, Peoples R China
[6] Natl Univ Singapore, Dept Real Estate, 15 Kent Ridge Dr, Singapore 119245, Singapore
关键词:
Urban analytics;
Volunteered geographic information;
Data fusion;
GeoAI;
Machine learning;
Spatial data infrastructure;
MAPILLARY;
D O I:
10.1016/j.isprsjprs.2024.06.023
中图分类号:
P9 [自然地理学];
学科分类号:
0705 ;
070501 ;
摘要:
Street view imagery (SVI) is instrumental for sensing urban environments, benefitting numerous domains such as urban morphology, health, greenery, and accessibility. Billions of images worldwide have been made available by commercial services such as Google Street View and crowdsourcing services such as Mapillary and KartaView where anyone from anywhere can upload imagery while moving. However, while the data tend to be plentiful, have high coverage and quality, and are used to derive rich insights, they remain simple and limited in metadata as characteristics such as weather, quality, and lighting conditions remain unknown, making it difficult to evaluate the suitability of the images for specific analyses. We introduce Global Streetscapes - a dataset of 10 million crowdsourced and free-to-use SVIs sampled from 688 cities across 210 countries and territories, enriched with more than 300 camera, geographical, temporal, contextual, semantic, and perceptual attributes. The cities included are well balanced and diverse, and are home to about 10% of the world's population. Deep learning models are trained on a subset of manually labelled images for eight visual-contextual attributes pertaining to the usability of SVI - panoramic status, lighting condition, view direction, weather, platform, quality, presence of glare and reflections, achieving accuracy ranging from 68.3% to 99.9%, and used to automatically label the entire dataset. Thanks to its scale and pre-computed standard semantic information, the data can be readily used to benefit existing use cases and to unlock new applications, including multi-city comparative studies and longitudinal analyses, as affirmed by a couple of use cases in the paper. Moreover, the automated processes and open-source code facilitate the expansion and updates of the dataset and encourage users to create their own datasets. With the rich manual annotations, some of which are provided for the first time, and diverse conditions present in the images, the dataset also facilitates assessing the heterogeneous properties of crowdsourced SVIs and provides a benchmark for evaluating future computer vision models. We make the Global Streetscapes dataset and the code to reproduce and use it publicly available in https://github.com/ualsg/global-streetscapes.
引用
收藏
页码:216 / 238
页数:23
相关论文