A Transformer-Based Multimodal Model for Urban-Rural Fringe Identification

被引:1
|
作者
Jia, Furong [1 ]
Dong, Quanhua [1 ]
Huang, Zhou [1 ]
Chen, Xiao-Jian [1 ]
Wang, Yi [1 ]
Peng, Xia [3 ]
Guo, Yuan [2 ]
Ma, Ruixian [4 ]
Zhang, Fan [1 ]
Liu, Yu [1 ,5 ]
机构
[1] Peking Univ, Sch Earth & Space Sci, Inst Remote Sensing & Geog Informat Syst, Beijing 100871, Peoples R China
[2] Wuhan Univ Technol, Sch Resources & Environm Engn, Wuhan 430070, Peoples R China
[3] Beijing Union Univ, Tourism Coll, Beijing 100101, Peoples R China
[4] MIT, Senseable City Lab, Cambridge, MA 02139 USA
[5] Int Res Ctr Big Data Sustainable Dev Goals, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Transformers; Visualization; Socioeconomics; Buildings; Labeling; Data models; Deep learning; social sensing; street view images (SVIs); urban rural fringe (URF); urbanization; LAND-USE; URBANIZATION; AREAS;
D O I
10.1109/JSTARS.2024.3439429
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As the frontier of urbanization, urban-rural fringes (URFs) transitionally connect urban construction regions to the rural hinterland, and its identification is significant for the study of urbanization-related socioeconomic changes and human dynamics. Previous research on URF identification has predominantly relied on remote sensing data, which often provides a uniform overhead perspective with limited spatial resolution. As an additional data source, street view images (SVIs) offer a valuable human-related perspective, efficiently capturing intricate transitions from urban to rural areas. However, the abundant visual information offered by SVIs has often been overlooked and multimodal techniques have seldom been explored to integrate multisource data for delineating URFs. To address this gap, this study proposes a transformed-based multimodal methodology for identifying URFs, which includes a street view panorama classifier and a remote sensing classification model. In the study area of Beijing, the experimental results indicate that an URF with a total area of 731.24 km(2) surrounds urban cores, primarily located between the fourth and sixth ring roads. The effectiveness of the proposed method is demonstrated through comparative experiments with traditional URF identification methods. In addition, a series of ablation studies demonstrate the efficacy of incorporating multisource data. Based on the delineated URFs in Beijing, this research introduced points of interest data and commuting data to analyze the socioeconomic characteristics of URFs. The findings indicate that URFs are characterized by longer commuting distances and less diverse restaurant consumption patterns compared to more urbanized regions. This study enables the accurate identification of URFs through the transform-based multimodal approach integrating SVIs. Furthermore, it provides a human-centric comprehension of URFs, which is essential for informing strategies of urban planning and development.
引用
收藏
页码:15041 / 15051
页数:11
相关论文
共 50 条
  • [21] Soil quality assessment of reclaimed land in the urban-rural fringe
    Li, Fangfang
    Zhang, Xinsheng
    Zhao, Ye
    Song, Mengjie
    Liang, Jia
    CATENA, 2023, 220
  • [22] Transformer-based models for multimodal irony detection
    Tomás D.
    Ortega-Bueno R.
    Zhang G.
    Rosso P.
    Schifanella R.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (6) : 7399 - 7410
  • [23] Transformer-Based Multimodal Infusion Dialogue Systems
    Liu, Bo
    He, Lejian
    Liu, Yafei
    Yu, Tianyao
    Xiang, Yuejia
    Zhu, Li
    Ruan, Weijian
    ELECTRONICS, 2022, 11 (20)
  • [24] A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations
    Ma, Hui
    Wang, Jian
    Lin, Hongfei
    Zhang, Bo
    Zhang, Yijia
    Xu, Bo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 776 - 788
  • [25] Transformer-Based Intelligent Prediction Model for Multimodal Multi-Objective Optimization
    Dang, Qianlong
    Zhang, Guanghui
    Wang, Ling
    Yu, Yang
    Yang, Shuai
    He, Xiaoyu
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2025, 20 (01) : 34 - 49
  • [26] Psychological disorder detection: A multimodal approach using a transformer-based hybrid model
    Ghosh, Debadrita
    Karande, Hema
    Gite, Shilpa
    Pradhan, Biswajeet
    METHODSX, 2024, 13
  • [27] URBAN-RURAL FRINGE RECOGNITION WITH THE INTEGRATION OF OPTICAL AND NIGHTTIME LIGHTS DATA
    Chen, Xiaolin
    Jia, Xiuping
    Pickering, Mark
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 7435 - 7438
  • [28] FACTORS AFFECTING LAND-USE CHANGE AT THE URBAN-RURAL FRINGE
    LEE, L
    GROWTH AND CHANGE, 1979, 10 (04) : 25 - 31
  • [29] Civic Engagement and Governance in the Urban-Rural Fringe: Evidence from Ireland
    Mahon, Marie
    Fahy, Frances
    Cinneide, Micheal O.
    Gallagher, Brenda
    NATURE + CULTURE, 2009, 4 (01): : 57 - 77
  • [30] Conserving metapopulations in human-altered landscapes at the urban-rural fringe
    Bauer, Dana Marie
    Swallow, Stephen K.
    ECOLOGICAL ECONOMICS, 2013, 95 : 159 - 170