Towards general-purpose representation learning of polygonal geometries

被引:18
|
作者
Mai, Gengchen [1 ,2 ,3 ,4 ]
Jiang, Chiyu [5 ]
Sun, Weiwei [6 ]
Zhu, Rui [3 ,4 ,7 ]
Xuan, Yao [8 ]
Cai, Ling [3 ,4 ]
Janowicz, Krzysztof [3 ,4 ,9 ]
Ermon, Stefano [2 ,10 ]
Lao, Ni [11 ]
机构
[1] Univ Georgia, Dept Geog, Spatially Explicit Artificial Intelligence Lab, Athens, GA 30602 USA
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[3] Univ Calif Santa Barbara, STKO Lab, Santa Barbara, CA 93106 USA
[4] Univ Calif Santa Barbara, Ctr Spatial Studies, Santa Barbara, CA 93106 USA
[5] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94720 USA
[6] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1Z4, Canada
[7] Univ Bristol, Sch Geog Sci, Bristol BS8 1TH, Avon, England
[8] Univ Calif Santa Barbara, Dept Math, Santa Barbara, CA 93106 USA
[9] Univ Vienna, Dept Geog & Reg Res, A-1040 Vienna, Austria
[10] Chan Zuckerberg Biohub, San Francisco, CA 94158 USA
[11] Google, Mountain View, CA 94043 USA
基金
美国国家科学基金会;
关键词
Polygon encoding; Non-uniform fourier transformation; Shape classification; Spatial relation prediction; Spatially explicit artificial intelligence; OBJECT; APPEARANCE; PATTERNS; NETWORK;
D O I
10.1007/s10707-022-00481-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Neural network representation learning for spatial data (e.g., points, polylines, polygons, and networks) is a common need for geographic artificial intelligence (GeoAI) problems. In recent years, many advancements have been made in representation learning for points, polylines, and networks, whereas little progress has been made for polygons, especially complex polygonal geometries. In this work, we focus on developing a general-purpose polygon encoding model, which can encode a polygonal geometry (with or without holes, single or multipolygons) into an embedding space. The result embeddings can be leveraged directly (or finetuned) for downstream tasks such as shape classification, spatial relation prediction, building pattern classification, cartographic building generalization, and so on. To achieve model generalizability guarantees, we identify a few desirable properties that the encoder should satisfy: loop origin invariance, trivial vertex invariance, part permutation invariance, and topology awareness. We explore two different designs for the encoder: one derives all representations in the spatial domain and can naturally capture local structures of polygons; the other leverages spectral domain representations and can easily capture global structures of polygons. For the spatial domain approach we propose ResNet1D, a 1D CNN-based polygon encoder, which uses circular padding to achieve loop origin invariance on simple polygons. For the spectral domain approach we develop NUFTspec based on Non-Uniform Fourier Transformation (NUFT), which naturally satisfies all the desired properties. We conduct experiments on two different tasks: 1) polygon shape classification based on the commonly used MNIST dataset; 2) polygon-based spatial relation prediction based on two new datasets (DBSR-46K and DBSR-cplx46K) constructed from OpenStreetMap and DBpedia. Our results show that NUFTspec and ResNet1D outperform multiple existing baselines with significant margins. While ResNet1D suffers from model performance degradation after shape-invariance geometry modifications, NUFTspec is very robust to these modifications due to the nature of the NUFT representation. NUFTspec is able to jointly consider all parts of a multipolygon and their spatial relations during prediction while ResNet1D can recognize the shape details which are sometimes important for classification. This result points to a promising research direction of combining spatial and spectral representations.
引用
收藏
页码:289 / 340
页数:52
相关论文
共 50 条
  • [1] Towards general-purpose representation learning of polygonal geometries
    Gengchen Mai
    Chiyu Jiang
    Weiwei Sun
    Rui Zhu
    Yao Xuan
    Ling Cai
    Krzysztof Janowicz
    Stefano Ermon
    Ni Lao
    GeoInformatica, 2023, 27 : 289 - 340
  • [2] BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation
    Niizumi, Daisuke
    Takeuchi, Daiki
    Ohishi, Yasunori
    Harada, Noboru
    Kashino, Kunio
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] Bioinspired framework for general-purpose learning
    de Toledo, SA
    Barreiro, JM
    FOUNDATIONS AND TOOLS FOR NEURAL MODELING, PROCEEDINGS, VOL I, 1999, 1606 : 507 - 516
  • [4] Synthetic Sensors: Towards General-Purpose Sensing
    Laput, Gierad
    Zhang, Yang
    Harrison, Chris
    PROCEEDINGS OF THE 2017 ACM SIGCHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI'17), 2017, : 3986 - 3999
  • [5] Towards General-Purpose Neural Network Computing
    Eldridge, Schuyler
    Appavoo, Jonathan
    Joshi, Ajay
    Waterland, Amos
    Seltzer, Margo
    2015 INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURE AND COMPILATION (PACT), 2015, : 99 - 112
  • [6] PolyMesher: a general-purpose mesh generator for polygonal elements written in Matlab
    Talischi, Cameron
    Paulino, Glaucio H.
    Pereira, Anderson
    Menezes, Ivan F. M.
    STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2012, 45 (03) : 309 - 328
  • [7] PolyMesher: a general-purpose mesh generator for polygonal elements written in Matlab
    Cameron Talischi
    Glaucio H. Paulino
    Anderson Pereira
    Ivan F. M. Menezes
    Structural and Multidisciplinary Optimization, 2012, 45 : 309 - 328
  • [8] Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation
    Niizumi, Daisuke
    Takeuchi, Daiki
    Ohishi, Yasunori
    Harada, Noboru
    Kashino, Kunio
    HEAR: HOLISTIC EVALUATION OF AUDIO REPRESENTATIONS, VOL 166, 2021, 166 : 1 - 24
  • [9] CONTRASTIVE LEARNING OF GENERAL-PURPOSE AUDIO REPRESENTATIONS
    Saeed, Aaqib
    Grangier, David
    Zeghidour, Neil
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3875 - 3879
  • [10] LEARNING ON VLSI - A GENERAL-PURPOSE DIGITAL NEUROCHIP
    DURANTON, M
    SIRAT, JA
    PHILIPS JOURNAL OF RESEARCH, 1990, 45 (01) : 1 - 17