Causal invariant geographic network representations with feature and structural distribution shifts

被引:0
|
作者
Wang, Yuhan [1 ]
He, Silu [1 ]
Luo, Qinyao [1 ]
Yuan, Hongyuan [1 ]
Zhao, Ling [1 ]
Zhu, Jiawei [2 ]
Li, Haifeng [1 ]
机构
[1] Cent South Univ, Dept Geog Informat Syst, Changsha 410000, Hunan, Peoples R China
[2] Cent South Univ, Sch Architecture & Art, Changsha 410083, Hunan, Peoples R China
关键词
Geographic network representation learning; Out-of-distribution generalisation; Casual inference;
D O I
10.1016/j.future.2025.107814
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Relationships between geographic entities, including human-land and human-people relationships, can be naturally modelled by graph structures, and geographic network representation is an important theoretical issue. The existing methods learn geographic network representations through deep graph neural networks (GNNs) based on the i.i.d. assumption. However, the spatial heterogeneity and temporal dynamics of geographic data make the out-of-distribution (OOD) generalisation problem particularly salient. We classify geographic network representations into invariant representations that always stabilise the predicted labels under distribution shifts and background representations that vary with different distributions. The latter are particularly sensitive to distribution shifts (feature and structural shifts) between testing and training data and are the main causes of the out-of-distribution generalisation (OOD) problem. Spurious correlations are present between invariant and background representations due to selection biases/environmental effects, resulting in the model extremes being more likely to learn background representations. The existing approaches focus on background representation changes that are determined by shifts in the feature distributions of nodes in the training and test data while ignoring changes in the proportional distributions of heterogeneous and homogeneous neighbour nodes, which we refer to as structural distribution shifts. We propose a feature-structure mixed invariant representation learning (FSM-IRL) model that accounts for both feature distribution shifts and structural distribution shifts. To address structural distribution shifts, we introduce a sampling method based on causal attention, encouraging the model to identify nodes possessing strong causal relationships with labels or nodes that are more similar to the target node. This approach significantly enhances the invariance of the representations between the source and target domains while reducing the dependence on background representations that arise by chance or in specific patterns. Inspired by the Hilbert-Schmidt independence criterion, we implement a reweighting strategy to maximise the orthogonality of the node representations, thereby mitigating the spurious correlations among the node representations and suppressing the learning of background representations. In addition, we construct an educational-level geographic network dataset under out-of-distribution (OOD) conditions. Our experiments demonstrate that FSM-IRL exhibits strong learning capabilities on both geographic and social network datasets in OOD scenarios.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Feature Selection in the Reconstruction of Complex Network Representations of Spectral Data
    Zanin, Massimiliano
    Menasalvas, Ernestina
    Boccaletti, Stefano
    Sousa, Pedro
    PLOS ONE, 2013, 8 (08):
  • [32] General and feature-based semantic representations in the semantic network
    Liuzzi, Antonietta Gabriella
    Aglinskas, Aidas
    Fairhall, Scott Laurence
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [33] Oil Prices and Monetary Policy in Emerging Markets: Structural Shifts in Causal Linkages
    Nazlioglu, Saban
    Gormus, Alper
    Soytas, Ugur
    EMERGING MARKETS FINANCE AND TRADE, 2019, 55 (01) : 105 - 117
  • [34] How Does Diaspora Mobilization Become a Causal Feature of Structural Change?
    Kim, MyungJa
    JOURNAL OF ASIAN SECURITY AND INTERNATIONAL AFFAIRS, 2015, 2 (03) : 266 - 290
  • [35] Composite Fault Diagnosis of Rolling Bearings: A Feature Selection Approach Based on the Causal Feature Network
    Gao, Kuo
    Wu, Zongning
    Yu, Chongchong
    Li, Mengxiong
    Liu, Sihan
    APPLIED SCIENCES-BASEL, 2023, 13 (16):
  • [36] Invariant Node Representation Learning under Distribution Shifts with Multiple Latent Environments
    Li, Haoyang
    Zhang, Ziwei
    Wang, Xin
    Zhu, Wenwu
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (01)
  • [37] FIXC: A Method for Data Distribution Shifts Calibration via Feature Importance
    Liu Zhendong
    Jiang, Wenyu
    Zhang, Yi
    Wang, Chongjun
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 153 - 160
  • [38] Causal Graph Attention Network with Disentangled Representations for Complex Systems Fault Detection
    Liu, Jie
    Zheng, Shuwen
    Wang, Chong
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2023, 235
  • [39] Learning Network Representations With Different Order Structural Information
    Liu, Qidong
    Zhou, Xin
    Long, Cheng
    Zhang, Jie
    Xu, Mingliang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (04) : 907 - 914
  • [40] Sequence Space Representations for Translation-Modulation Invariant Function and Distribution Spaces
    Debrouwere, Andreas
    Neyt, Lenny
    JOURNAL OF FOURIER ANALYSIS AND APPLICATIONS, 2022, 28 (06)