On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper)

被引:12
|
作者
Mai, Gengchen [1 ]
Huang, Weiming [2 ]
Sun, Jin [3 ]
Song, Suhang [4 ]
Mishra, Deepak [5 ]
Liu, Ninghao [3 ]
Gao, Song [6 ]
Liu, Tianming [3 ]
Cong, Gao [2 ]
Hu, Yingjie [7 ]
Cundy, Chris [8 ]
Li, Ziyuan [9 ]
Zhu, Rui [10 ]
Lao, Ni [11 ]
机构
[1] Univ Georgia, Dept Geog, 210 Field St, Athens, GA USA
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Block N4,50 Nanyang Ave, Singapore, Singapore
[3] Univ Georgia Athens, Sch Comp, 415 Boyd Res & Educ Ctr, Athens, GA 30602 USA
[4] Univ Georgia, Coll Publ Hlth, Rhodes Hall,105 Spear Rd, Athens, GA 30602 USA
[5] Univ Georgia, Dept Geog, 210 Field St, Athens, GA 30602 USA
[6] Univ Wisconsin Madison, Dept Geog, Geospatial Data Sci Lab, Sci Hall,550 N Pk St, Madison, WI 53715 USA
[7] Univ Buffalo, Dept Geog, GeoAI Lab, Ste 105, Buffalo, NY 14261 USA
[8] Stanford Univ, Dept Comp Sci, 353 Jane Stanford Way, Stanford, CA 94305 USA
[9] Univ Connecticut, Sch Business, 2100 Hillside Rd, Storrs, CT 06269 USA
[10] Sch Geog Sci, Univ Rd, Bristol BS81SS, Avon, England
[11] Google, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA
基金
美国国家科学基金会;
关键词
Foundation models; geospatial artificial intelligence; multimodal learning; GEOGRAPHICALLY WEIGHTED REGRESSION; URBAN LAND-USE; GEOSPATIAL SEMANTICS; HEALTH GEOGRAPHY; KNOWLEDGE GRAPH; TRAJECTORIES; LOCATION; CONTEXT; IMPACT; PLACE;
D O I
10.1145/3653070
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Large pre-trained models, also known as foundation models (FMs), are trained in a task-agnostic manner on large-scale data and can be adapted to a wide range of downstream tasks by fine-tuning, few-shot, or even zero-shot learning. Despite their successes in language and vision tasks, we have not yet seen an attempt to develop foundation models for geospatial artificial intelligence (GeoAI). In this work, we explore the promises and challenges of developing multimodal foundation models for GeoAI. We first investigate the potential of many existing FMs by testing their performances on seven tasks across multiple geospatial domains, including Geospatial Semantics, Health Geography, Urban Geography, and Remote Sensing. Our results indicate that on several geospatial tasks that only involve text modality, such as toponym recognition, location description recognition, and US state-level/county-level dementia time series forecasting, the task-agnostic large learning models (LLMs) can outperform task-specific fully supervised models in a zero-shot or few-shot learning setting. However, on other geospatial tasks, especially tasks that involve multiple data modalities (e.g., POI-based urban function classification, street view image-based urban noise intensity classification, and remote sensing image scene classification), existing FMs still underperform task-specific models. Based on these observations, we propose that one of the major challenges of developing an FM for GeoAI is to address the multimodal nature of geospatial tasks. After discussing the distinct challenges of each geospatial data modality, we suggest the possibility of a multimodal FM that can reason over various types of geospatial data through geospatial alignments. We conclude this article by discussing the unique risks and challenges to developing such a model for GeoAI.
引用
收藏
页数:46
相关论文
共 50 条
  • [41] DESPITE CHALLENGES, QD VISION LAID THE FOUNDATION FOR QUANTUM
    Patel, Prachi
    Information Display, 2022, 38 (03) : 42 - 44
  • [42] Towards a Foundation Model for Geospatial Artificial Intelligence (Vision Paper)
    Mai, Gengchen
    Cundy, Chris
    Choi, Kristy
    Hu, Yingjie
    Lao, Ni
    Ermon, Stefano
    30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 744 - 747
  • [43] Challenges and Opportunities in the Estimation of Dynamic Models
    Xu, Ran
    DeShon, Richard P.
    Dishop, Christopher R.
    ORGANIZATIONAL RESEARCH METHODS, 2020, 23 (04) : 595 - 619
  • [44] Large circuit models: opportunities and challenges
    Lei CHEN
    Yiqi CHEN
    Zhufei CHU
    Wenji FANG
    TsungYi HO
    Ru HUANG
    Yu HUANG
    Sadaf KHAN
    Min LI
    Xingquan LI
    Yu LI
    Yun LIANG
    Jinwei LIU
    Yi LIU
    Yibo LIN
    Guojie LUO
    Hongyang PAN
    Zhengyuan SHI
    Guangyu SUN
    Dimitrios TSARAS
    Runsheng WANG
    Ziyi WANG
    Xinming WEI
    Zhiyao XIE
    Qiang XU
    Chenhao XUE
    Junchi YAN
    Jun YANG
    Bei YU
    Mingxuan YUAN
    Evangeline FYYOUNG
    Xuan ZENG
    Haoyi ZHANG
    Zuodong ZHANG
    Yuxiang ZHAO
    HuiLing ZHEN
    Ziyang ZHENG
    Binwu ZHU
    Keren ZHU
    Sunan ZOU
    Science China(Information Sciences), 2024, 67 (10) : 25 - 66
  • [45] Large circuit models: opportunities and challenges
    Chen, Lei
    Chen, Yiqi
    Chu, Zhufei
    Fang, Wenji
    Ho, Tsung-Yi
    Huang, Ru
    Huang, Yu
    Khan, Sadaf
    Li, Min
    Li, Xingquan
    Li, Yu
    Liang, Yun
    Liu, Jinwei
    Liu, Yi
    Lin, Yibo
    Luo, Guojie
    Pan, Hongyang
    Shi, Zhengyuan
    Sun, Guangyu
    Tsaras, Dimitrios
    Wang, Runsheng
    Wang, Ziyi
    Wei, Xinming
    Xie, Zhiyao
    Xu, Qiang
    Xue, Chenhao
    Yan, Junchi
    Yang, Jun
    Yu, Bei
    Yuan, Mingxuan
    Young, Evangeline F. Y.
    Zeng, Xuan
    Zhang, Haoyi
    Zhang, Zuodong
    Zhao, Yuxiang
    Zhen, Hui-Ling
    Zheng, Ziyang
    Zhu, Binwu
    Zhu, Keren
    Zou, Sunan
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (10)
  • [46] RESEARCH ON BUSINESS MODELS: CHALLENGES AND OPPORTUNITIES
    Markides, Constantinos C.
    BUSINESS MODELS AND MODELLING, 2015, 33 : 133 - 147
  • [47] Toward Automated Vehicle Teleoperation: Vision, Opportunities, and Challenges
    Zhang, Tao
    IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (12) : 11347 - 11354
  • [48] Automated seed identification with computer vision: challenges and opportunities
    Zhao, Liang
    Haque, S. M. Rafizul
    Wang, Ruojing
    SEED SCIENCE AND TECHNOLOGY, 2022, 50 (01) : 75 - 102
  • [49] The emergence of cognitive digital twin: vision, challenges and opportunities
    Zheng, Xiaochen
    Lu, Jinzhi
    Kiritsis, Dimitris
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (24) : 7610 - 7632
  • [50] A vision of dew point evaporative cooling: Opportunities and challenges
    Lai, Lanbo
    Wang, Xiaolin
    Hu, Eric
    Ng, Kim Choon
    APPLIED THERMAL ENGINEERING, 2024, 244