An Experimental Study of the k-MXT Algorithm with Applications to Clustering Geo-Tagged Data

被引:2
|
作者
Cooper, Colin [1 ]
Ngoc Vu [1 ]
机构
[1] Kings Coll London, Dept Informat., London, England
关键词
D O I
10.1007/978-3-319-92871-5_10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider a graph fragmentation process which can be described as follows. Each vertex v selects the k adjacent vertices which have the largest number of common of neighbours. For each selected neighbour u, we retain the edge (v, u) to form a the subgraph graph S of the input graph. The object of interest are the components of S, the k-Max-Triangle-Neighbour (k-MXT) subgraph, and the vertex clusters they produce in the original graph. We study the application of this process to clustering in the planted partition model, and on the geometric disk graph formed from geo-tagged photographic data downloaded from Flickr. In the planted partition model, there are numbers of partitions, or subgraphs, which are connected densely within each partition but sparser between partitions. The objective is to recover these hidden partitions. We study the case of the planted partition model based on the random graph G(n,p) with additional edge probability q within the partitions. Theoretical and experimental results show that the 2-MXT algorithm can recover the partitions for any q/p > 0 constant provided the density of triangles is high enough. We apply the k-MXT algorithm experimentally to the problem of clustering geographical data, using London as an example. Given a dataset consisting of geographical coordinates extracted from photographs, we construct a disk graph by connecting every point to other points if and only if theirs distance is at most d. Our experimental results show that the k-MXT algorithm is able to produce clusters which are of comparable to popular clustering algorithms such as DBSCAN (see e.g. Fig. 5).
引用
收藏
页码:145 / 169
页数:25
相关论文
共 50 条
  • [41] Cognitive Visualization of Popular Regions Discovered From Geo-Tagged Social Media Data
    Wang, Yunzhe
    Baciu, George
    Li, Chenhui
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2018, 12 (01) : 14 - 28
  • [42] Recommending Prime Spots of a Destination and Time to Visit from Geo-tagged Social Data
    Sharma, Vishal
    Lee, Kyumin
    Chung, Jinwook
    2014 INTERNATIONAL CONFERENCE ON COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING (COLLABORATECOM), 2014, : 495 - 500
  • [43] Active Collection of Land Cover Sample Data from Geo-Tagged Web Texts
    Hou, Dongyang
    Chen, Jun
    Wu, Hao
    Li, Songnian
    Chen, Fei
    Zhang, Weiwei
    REMOTE SENSING, 2015, 7 (05): : 5805 - 5827
  • [44] Modeling Flu Trends with Real-Time Geo-tagged Twitter Data Streams
    Chon, Jaime
    Raymond, Ross
    Wang, Haiyan
    Wang, Feng
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, 2015, 9204 : 60 - 69
  • [45] A Semantic Geo-Tagged Multimedia-Based Routing in a Crowdsourced Big Data Environment
    Rehman, Faizan Ur
    Lbath, Ahmed
    Murad, Abdullah
    Rahman, Md. Abdur
    Sadiq, Bilal
    Ahmad, Akhlaq
    Qamar, Ahmad
    Basalamah, Saleh
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 759 - 760
  • [46] CONSTRUCTING A LANDMARK IDENTIFICATION SYSTEM FOR GEO-TAGGED PHOTOGRAPHS BASED ON WEB DATA ANALYSIS
    Hoashi, Keiichiro
    Uemukai, Toshiaki
    Matsumoto, Kazunori
    Takishima, Yasuhiro
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 606 - 609
  • [47] Adaptive landmark recommendations for travel planning: Personalizing and clustering landmarks using geo-tagged social media
    Han, Jonghyun
    Lee, Hyunju
    PERVASIVE AND MOBILE COMPUTING, 2015, 18 : 4 - 17
  • [48] PGMS: A Case Study of Collecting PDA-Based Geo-Tagged Malaria-Related Survey Data
    Zhou, Ying
    Lobo, Neil F.
    Wolkon, Adam
    Gimnig, John E.
    Malishee, Alpha
    Stevenson, Jennifer
    Sulistyawati
    Collins, Frank H.
    Madey, Greg
    AMERICAN JOURNAL OF TROPICAL MEDICINE AND HYGIENE, 2014, 91 (03): : 496 - 508
  • [49] Using GPS Geo-tagged Social Media Data and Geodemographics to Investigate Social Differences: A Twitter Pilot Study
    Chappell, Paul
    Tse, Mike
    Zhang, Minhao
    Moore, Susan
    SOCIOLOGICAL RESEARCH ONLINE, 2017, 22 (03): : 38 - 56
  • [50] Efficient Indexing of Top-k Entities in Systems of Engagement with Extensions for Geo-tagged Entities
    Mondal, Anirban
    Kakkar, Ayaan
    Padhariya, Nilesh
    Mohania, Mukesh
    DATA SCIENCE AND ENGINEERING, 2021, 6 (04) : 411 - 433