Causal invariant geographic network representations with feature and structural distribution shifts

被引：0

作者：

Wang, Yuhan ^{[1
]}

He, Silu ^{[1
]}

Luo, Qinyao ^{[1
]}

Yuan, Hongyuan ^{[1
]}

Zhao, Ling ^{[1
]}

Zhu, Jiawei ^{[2
]}

Li, Haifeng ^{[1
]}

机构：

[1] Cent South Univ, Dept Geog Informat Syst, Changsha 410000, Hunan, Peoples R China

[2] Cent South Univ, Sch Architecture & Art, Changsha 410083, Hunan, Peoples R China

来源：

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2025年 / 169卷

关键词：

Geographic network representation learning; Out-of-distribution generalisation; Casual inference;

D O I：

10.1016/j.future.2025.107814

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Relationships between geographic entities, including human-land and human-people relationships, can be naturally modelled by graph structures, and geographic network representation is an important theoretical issue. The existing methods learn geographic network representations through deep graph neural networks (GNNs) based on the i.i.d. assumption. However, the spatial heterogeneity and temporal dynamics of geographic data make the out-of-distribution (OOD) generalisation problem particularly salient. We classify geographic network representations into invariant representations that always stabilise the predicted labels under distribution shifts and background representations that vary with different distributions. The latter are particularly sensitive to distribution shifts (feature and structural shifts) between testing and training data and are the main causes of the out-of-distribution generalisation (OOD) problem. Spurious correlations are present between invariant and background representations due to selection biases/environmental effects, resulting in the model extremes being more likely to learn background representations. The existing approaches focus on background representation changes that are determined by shifts in the feature distributions of nodes in the training and test data while ignoring changes in the proportional distributions of heterogeneous and homogeneous neighbour nodes, which we refer to as structural distribution shifts. We propose a feature-structure mixed invariant representation learning (FSM-IRL) model that accounts for both feature distribution shifts and structural distribution shifts. To address structural distribution shifts, we introduce a sampling method based on causal attention, encouraging the model to identify nodes possessing strong causal relationships with labels or nodes that are more similar to the target node. This approach significantly enhances the invariance of the representations between the source and target domains while reducing the dependence on background representations that arise by chance or in specific patterns. Inspired by the Hilbert-Schmidt independence criterion, we implement a reweighting strategy to maximise the orthogonality of the node representations, thereby mitigating the spurious correlations among the node representations and suppressing the learning of background representations. In addition, we construct an educational-level geographic network dataset under out-of-distribution (OOD) conditions. Our experiments demonstrate that FSM-IRL exhibits strong learning capabilities on both geographic and social network datasets in OOD scenarios.

引用

页数：12

共 50 条

[31] Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
Zanin, Massimiliano
Menasalvas, Ernestina
Boccaletti, Stefano
Sousa, Pedro
PLOS ONE, 2013, 8 (08):
[32] General and feature-based semantic representations in the semantic network
Liuzzi, Antonietta Gabriella
Aglinskas, Aidas
Fairhall, Scott Laurence
SCIENTIFIC REPORTS, 2020, 10 (01)
[33] Oil Prices and Monetary Policy in Emerging Markets: Structural Shifts in Causal Linkages
Nazlioglu, Saban
Gormus, Alper
Soytas, Ugur
EMERGING MARKETS FINANCE AND TRADE, 2019, 55 (01) : 105 - 117
[34] How Does Diaspora Mobilization Become a Causal Feature of Structural Change?
Kim, MyungJa
JOURNAL OF ASIAN SECURITY AND INTERNATIONAL AFFAIRS, 2015, 2 (03) : 266 - 290
[35] Composite Fault Diagnosis of Rolling Bearings: A Feature Selection Approach Based on the Causal Feature Network
Gao, Kuo
Wu, Zongning
Yu, Chongchong
Li, Mengxiong
Liu, Sihan
APPLIED SCIENCES-BASEL, 2023, 13 (16):
[36] Invariant Node Representation Learning under Distribution Shifts with Multiple Latent Environments
Li, Haoyang
Zhang, Ziwei
Wang, Xin
Zhu, Wenwu
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (01)
[37] FIXC: A Method for Data Distribution Shifts Calibration via Feature Importance
Liu Zhendong
Jiang, Wenyu
Zhang, Yi
Wang, Chongjun
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 153 - 160
[38] Causal Graph Attention Network with Disentangled Representations for Complex Systems Fault Detection
Liu, Jie
Zheng, Shuwen
Wang, Chong
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 235
[39] Learning Network Representations With Different Order Structural Information
Liu, Qidong
Zhou, Xin
Long, Cheng
Zhang, Jie
Xu, Mingliang
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (04) : 907 - 914
[40] Sequence Space Representations for Translation-Modulation Invariant Function and Distribution Spaces
Debrouwere, Andreas
Neyt, Lenny
JOURNAL OF FOURIER ANALYSIS AND APPLICATIONS, 2022, 28 (06)

← 1 2 3 4 5 →