Learning Multi-context Aware Location Representations from Large-scale Geotagged Images

被引：4

作者：

Yin, Yifang ^{[1
]}

Zhang, Ying ^{[2
]}

Liu, Zhenguang ^{[3
]}

Liang, Yuxuan ^{[1
]}

Wang, Sheng ^{[1
,4
]}

Shah, Rajiv Ratn ^{[5
]}

Zimmermann, Roger ^{[1
]}

机构：

[1] Natl Univ Singapore, Singapore, Singapore

[2] Northwestern Polytech Univ, Xian, Peoples R China

[3] Zhejiang Gongshang Univ, Hangzhou, Peoples R China

[4] Alibaba Grp, Singapore, Singapore

[5] IIIT Delhi, Delhi, India

来源：

PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021 | 2021年

关键词：

Location representations; pre-trained neural networks; attentionbased; fusion; geo-aware applications; FEATURES;

D O I：

10.1145/3474085.3475268

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the ubiquity of sensor-equipped smartphones, it is common to have multimedia documents uploaded to the Internet that have GPS coordinates associated with them. Utilizing such geotags as an additional feature is intuitively appealing for improving the performance of location-aware applications. However, raw GPS coordinates are fine-grained location indicators without any semantic information. Existing methods on geotag semantic encoding mostly extract hand-crafted, application-specific location representations that heavily depend on large-scale supplementary data and thus cannot perform efficiently on mobile devices. In this paper, we present a machine learning based approach, termed GPS2Vec+, which learns rich location representations by capitalizing on the world-wide geotagged images. Once trained, the model has no dependence on the auxiliary data anymore so it encodes geotags highly efficiently by inference. We extract visual and semantic knowledge from image content and user-generated tags, and transfer the information into locations by using geotagged images as a bridge. To adapt to different application domains, we further present an attention-based fusion framework that estimates the importance of the learnt location representations under different contexts for effective feature fusion. Our location representations yield significant performance improvements over the state-of-the-art geotag encoding methods on image classification and venue annotation.

引用

页码：899 / 907

页数：9

共 50 条

[41] Large-Scale Detection and Categorization of Oil Spills from SAR Images with Deep Learning
Bianchi, Filippo Maria
Espeseth, Martine M.
Borch, Njal
REMOTE SENSING, 2020, 12 (14)
[42] A heuristic method for large-scale multi-facility location problems
Levin, Y
Ben-Israel, A
COMPUTERS & OPERATIONS RESEARCH, 2004, 31 (02) : 257 - 272
[43] Utilizing Web Analytics in the Context of Learning Analytics for Large-Scale Online Learning
Robloff, Tobias
Oldag, Soren
Renz, Jan
Meinel, Christoph
PROCEEDINGS OF 2019 IEEE GLOBAL ENGINEERING EDUCATION CONFERENCE (EDUCON), 2019, : 296 - 305
[44] Multi-context engaged learning and ethnographic fieldwork: some notes from the middle of the edge
Carlarne, John
INTERNATIONAL JOURNAL OF SOCIAL RESEARCH METHODOLOGY, 2011, 14 (02) : 135 - 152
[45] Cognitive Modeling With Representations From Large-Scale Digital Data
Bhatia, Sudeep
Aka, Ada
CURRENT DIRECTIONS IN PSYCHOLOGICAL SCIENCE, 2022, 31 (03) : 207 - 214
[46] Neural Word Representations from Large-Scale Commonsense Knowledge
Chen, Jiaqiang
Tandon, Niket
de Melo, Gerard
2015 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT), VOL 1, 2015, : 225 - 228
[47] Location and task effects on route learning in a large-scale virtual environment
Buechner, Simon
Wiener, Jan
Hoelscher, Christoph
COGNITIVE PROCESSING, 2009, 10 : S136 - S136
[48] Mining Salient Images from a Large-scale Blogosphere
Chen, Xian
Chen, Meilian
Shin, Hyoseop
Kim, Eun Yi
2013 8TH INTERNATIONAL CONFERENCE FOR INTERNET TECHNOLOGY AND SECURED TRANSACTIONS (ICITST), 2013, : 132 - 136
[49] A Deep Learning-Based Cluster Analysis Method for Large-Scale Multi-Label Images
Xu, Yanping
TRAITEMENT DU SIGNAL, 2022, 39 (03) : 931 - 937
[50] Learning from large-scale neural simulations
Serban, Maria
VITAL MODELS: THE MAKING AND USE OF MODELS IN THE BRAIN SCIENCES, 2017, 233 : 129 - 148

← 1 2 3 4 5 →