Towards general-purpose representation learning of polygonal geometries

被引：18

作者：

Mai, Gengchen ^{[1
,2
,3
,4
]}

Jiang, Chiyu ^{[5
]}

Sun, Weiwei ^{[6
]}

Zhu, Rui ^{[3
,4
,7
]}

Xuan, Yao ^{[8
]}

Cai, Ling ^{[3
,4
]}

Janowicz, Krzysztof ^{[3
,4
,9
]}

Ermon, Stefano ^{[2
,10
]}

Lao, Ni ^{[11
]}

机构：

[1] Univ Georgia, Dept Geog, Spatially Explicit Artificial Intelligence Lab, Athens, GA 30602 USA

[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

[3] Univ Calif Santa Barbara, STKO Lab, Santa Barbara, CA 93106 USA

[4] Univ Calif Santa Barbara, Ctr Spatial Studies, Santa Barbara, CA 93106 USA

[5] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA

[6] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada

[7] Univ Bristol, Sch Geog Sci, Bristol BS8 1TH, Avon, England

[8] Univ Calif Santa Barbara, Dept Math, Santa Barbara, CA 93106 USA

[9] Univ Vienna, Dept Geog & Reg Res, A-1040 Vienna, Austria

[10] Chan Zuckerberg Biohub, San Francisco, CA 94158 USA

[11] Google, Mountain View, CA 94043 USA

来源：

GEOINFORMATICA | 2023年 / 27卷 / 02期

基金：

美国国家科学基金会;

关键词：

Polygon encoding; Non-uniform fourier transformation; Shape classification; Spatial relation prediction; Spatially explicit artificial intelligence; OBJECT; APPEARANCE; PATTERNS; NETWORK;

D O I：

10.1007/s10707-022-00481-2

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Neural network representation learning for spatial data (e.g., points, polylines, polygons, and networks) is a common need for geographic artificial intelligence (GeoAI) problems. In recent years, many advancements have been made in representation learning for points, polylines, and networks, whereas little progress has been made for polygons, especially complex polygonal geometries. In this work, we focus on developing a general-purpose polygon encoding model, which can encode a polygonal geometry (with or without holes, single or multipolygons) into an embedding space. The result embeddings can be leveraged directly (or finetuned) for downstream tasks such as shape classification, spatial relation prediction, building pattern classification, cartographic building generalization, and so on. To achieve model generalizability guarantees, we identify a few desirable properties that the encoder should satisfy: loop origin invariance, trivial vertex invariance, part permutation invariance, and topology awareness. We explore two different designs for the encoder: one derives all representations in the spatial domain and can naturally capture local structures of polygons; the other leverages spectral domain representations and can easily capture global structures of polygons. For the spatial domain approach we propose ResNet1D, a 1D CNN-based polygon encoder, which uses circular padding to achieve loop origin invariance on simple polygons. For the spectral domain approach we develop NUFTspec based on Non-Uniform Fourier Transformation (NUFT), which naturally satisfies all the desired properties. We conduct experiments on two different tasks: 1) polygon shape classification based on the commonly used MNIST dataset; 2) polygon-based spatial relation prediction based on two new datasets (DBSR-46K and DBSR-cplx46K) constructed from OpenStreetMap and DBpedia. Our results show that NUFTspec and ResNet1D outperform multiple existing baselines with significant margins. While ResNet1D suffers from model performance degradation after shape-invariance geometry modifications, NUFTspec is very robust to these modifications due to the nature of the NUFT representation. NUFTspec is able to jointly consider all parts of a multipolygon and their spatial relations during prediction while ResNet1D can recognize the shape details which are sometimes important for classification. This result points to a promising research direction of combining spatial and spectral representations.

引用

页码：289 / 340

页数：52

共 50 条

[1] Towards general-purpose representation learning of polygonal geometries
Gengchen Mai
Chiyu Jiang
Weiwei Sun
Rui Zhu
Yao Xuan
Ling Cai
Krzysztof Janowicz
Stefano Ermon
Ni Lao
GeoInformatica, 2023, 27 : 289 - 340
[2] BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
Niizumi, Daisuke
Takeuchi, Daiki
Ohishi, Yasunori
Harada, Noboru
Kashino, Kunio
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[3] Bioinspired framework for general-purpose learning
de Toledo, SA
Barreiro, JM
FOUNDATIONS AND TOOLS FOR NEURAL MODELING, PROCEEDINGS, VOL I, 1999, 1606 : 507 - 516
[4] Synthetic Sensors: Towards General-Purpose Sensing
Laput, Gierad
Zhang, Yang
Harrison, Chris
PROCEEDINGS OF THE 2017 ACM SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'17), 2017, : 3986 - 3999
[5] Towards General-Purpose Neural Network Computing
Eldridge, Schuyler
Appavoo, Jonathan
Joshi, Ajay
Waterland, Amos
Seltzer, Margo
2015 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION (PACT), 2015, : 99 - 112
[6] PolyMesher: a general-purpose mesh generator for polygonal elements written in Matlab
Talischi, Cameron
Paulino, Glaucio H.
Pereira, Anderson
Menezes, Ivan F. M.
STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2012, 45 (03) : 309 - 328
[7] PolyMesher: a general-purpose mesh generator for polygonal elements written in Matlab
Cameron Talischi
Glaucio H. Paulino
Anderson Pereira
Ivan F. M. Menezes
Structural and Multidisciplinary Optimization, 2012, 45 : 309 - 328
[8] Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation
Niizumi, Daisuke
Takeuchi, Daiki
Ohishi, Yasunori
Harada, Noboru
Kashino, Kunio
HEAR: HOLISTIC EVALUATION OF AUDIO REPRESENTATIONS, VOL 166, 2021, 166 : 1 - 24
[9] CONTRASTIVE LEARNING OF GENERAL-PURPOSE AUDIO REPRESENTATIONS
Saeed, Aaqib
Grangier, David
Zeghidour, Neil
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3875 - 3879
[10] LEARNING ON VLSI - A GENERAL-PURPOSE DIGITAL NEUROCHIP
DURANTON, M
SIRAT, JA
PHILIPS JOURNAL OF RESEARCH, 1990, 45 (01) : 1 - 17

← 1 2 3 4 5 →