Learning Multi-context Aware Location Representations from Large-scale Geotagged Images

被引:4
|
作者
Yin, Yifang [1 ]
Zhang, Ying [2 ]
Liu, Zhenguang [3 ]
Liang, Yuxuan [1 ]
Wang, Sheng [1 ,4 ]
Shah, Rajiv Ratn [5 ]
Zimmermann, Roger [1 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Northwestern Polytech Univ, Xian, Peoples R China
[3] Zhejiang Gongshang Univ, Hangzhou, Peoples R China
[4] Alibaba Grp, Singapore, Singapore
[5] IIIT Delhi, Delhi, India
来源
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年
关键词
Location representations; pre-trained neural networks; attentionbased; fusion; geo-aware applications; FEATURES;
D O I
10.1145/3474085.3475268
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the ubiquity of sensor-equipped smartphones, it is common to have multimedia documents uploaded to the Internet that have GPS coordinates associated with them. Utilizing such geotags as an additional feature is intuitively appealing for improving the performance of location-aware applications. However, raw GPS coordinates are fine-grained location indicators without any semantic information. Existing methods on geotag semantic encoding mostly extract hand-crafted, application-specific location representations that heavily depend on large-scale supplementary data and thus cannot perform efficiently on mobile devices. In this paper, we present a machine learning based approach, termed GPS2Vec+, which learns rich location representations by capitalizing on the world-wide geotagged images. Once trained, the model has no dependence on the auxiliary data anymore so it encodes geotags highly efficiently by inference. We extract visual and semantic knowledge from image content and user-generated tags, and transfer the information into locations by using geotagged images as a bridge. To adapt to different application domains, we further present an attention-based fusion framework that estimates the importance of the learnt location representations under different contexts for effective feature fusion. Our location representations yield significant performance improvements over the state-of-the-art geotag encoding methods on image classification and venue annotation.
引用
收藏
页码:899 / 907
页数:9
相关论文
共 50 条
  • [41] Large-Scale Detection and Categorization of Oil Spills from SAR Images with Deep Learning
    Bianchi, Filippo Maria
    Espeseth, Martine M.
    Borch, Njal
    REMOTE SENSING, 2020, 12 (14)
  • [42] A heuristic method for large-scale multi-facility location problems
    Levin, Y
    Ben-Israel, A
    COMPUTERS & OPERATIONS RESEARCH, 2004, 31 (02) : 257 - 272
  • [43] Utilizing Web Analytics in the Context of Learning Analytics for Large-Scale Online Learning
    Robloff, Tobias
    Oldag, Soren
    Renz, Jan
    Meinel, Christoph
    PROCEEDINGS OF 2019 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2019, : 296 - 305
  • [44] Multi-context engaged learning and ethnographic fieldwork: some notes from the middle of the edge
    Carlarne, John
    INTERNATIONAL JOURNAL OF SOCIAL RESEARCH METHODOLOGY, 2011, 14 (02) : 135 - 152
  • [45] Cognitive Modeling With Representations From Large-Scale Digital Data
    Bhatia, Sudeep
    Aka, Ada
    CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2022, 31 (03) : 207 - 214
  • [46] Neural Word Representations from Large-Scale Commonsense Knowledge
    Chen, Jiaqiang
    Tandon, Niket
    de Melo, Gerard
    2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 1, 2015, : 225 - 228
  • [47] Location and task effects on route learning in a large-scale virtual environment
    Buechner, Simon
    Wiener, Jan
    Hoelscher, Christoph
    COGNITIVE PROCESSING, 2009, 10 : S136 - S136
  • [48] Mining Salient Images from a Large-scale Blogosphere
    Chen, Xian
    Chen, Meilian
    Shin, Hyoseop
    Kim, Eun Yi
    2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 132 - 136
  • [49] A Deep Learning-Based Cluster Analysis Method for Large-Scale Multi-Label Images
    Xu, Yanping
    TRAITEMENT DU SIGNAL, 2022, 39 (03) : 931 - 937
  • [50] Learning from large-scale neural simulations
    Serban, Maria
    VITAL MODELS: THE MAKING AND USE OF MODELS IN THE BRAIN SCIENCES, 2017, 233 : 129 - 148