An Experimental Study of the k-MXT Algorithm with Applications to Clustering Geo-Tagged Data

被引:2
|
作者
Cooper, Colin [1 ]
Ngoc Vu [1 ]
机构
[1] Kings Coll London, Dept Informat., London, England
关键词
D O I
10.1007/978-3-319-92871-5_10
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We consider a graph fragmentation process which can be described as follows. Each vertex v selects the k adjacent vertices which have the largest number of common of neighbours. For each selected neighbour u, we retain the edge (v, u) to form a the subgraph graph S of the input graph. The object of interest are the components of S, the k-Max-Triangle-Neighbour (k-MXT) subgraph, and the vertex clusters they produce in the original graph. We study the application of this process to clustering in the planted partition model, and on the geometric disk graph formed from geo-tagged photographic data downloaded from Flickr. In the planted partition model, there are numbers of partitions, or subgraphs, which are connected densely within each partition but sparser between partitions. The objective is to recover these hidden partitions. We study the case of the planted partition model based on the random graph G(n,p) with additional edge probability q within the partitions. Theoretical and experimental results show that the 2-MXT algorithm can recover the partitions for any q/p > 0 constant provided the density of triangles is high enough. We apply the k-MXT algorithm experimentally to the problem of clustering geographical data, using London as an example. Given a dataset consisting of geographical coordinates extracted from photographs, we construct a disk graph by connecting every point to other points if and only if theirs distance is at most d. Our experimental results show that the k-MXT algorithm is able to produce clusters which are of comparable to popular clustering algorithms such as DBSCAN (see e.g. Fig. 5).
引用
收藏
页码:145 / 169
页数:25
相关论文
共 50 条
  • [1] The Clusterization of Geo-Tagged Data for Finding City Sights with Use of a Modification of k-MXT Algorithm
    Stepanova, Anastasia
    Mironov, Sergei V.
    Korobov, Eugene
    Sidorov, Sergei
    PROCEEDINGS OF THE THIRD WORKSHOP ON COMPUTER MODELLING IN DECISION MAKING (CMDM 2018), 2018, 85 : 20 - 25
  • [2] Modification of the k-MXT Algorithm and Its Application to the Geotagged Data Clustering
    Stepanova, Anastasia
    Mironov, Sergei, V
    Sidorov, Sergei
    Faizliev, Alexey
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, 2019, 11943 : 296 - 307
  • [3] Clustering Geo-Tagged Tweets for Advanced Big Data Analytics
    Bordogna, Gloria
    Frigerio, Luca
    Cuzzocrea, Alfredo
    Psaila, Giuseppe
    2016 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2016, 2016, : 42 - 51
  • [4] Regional Level Influenza Study with Geo-Tagged Twitter Data
    Wang, Feng
    Wang, Haiyan
    Xu, Kuai
    Raymond, Ross
    Chon, Jaime
    Fuller, Shaun
    Debruyn, Anton
    JOURNAL OF MEDICAL SYSTEMS, 2016, 40 (08)
  • [5] Regional Level Influenza Study with Geo-Tagged Twitter Data
    Feng Wang
    Haiyan Wang
    Kuai Xu
    Ross Raymond
    Jaime Chon
    Shaun Fuller
    Anton Debruyn
    Journal of Medical Systems, 2016, 40
  • [6] Sensing urban vibrancy using geo-tagged data
    Zhu T.
    Tu W.
    Yue Y.
    Zhong C.
    Zhao T.
    Li Q.
    Li Q.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2020, 49 (03): : 365 - 374
  • [7] Efficient interactive search for geo-tagged multimedia data
    Jun Long
    Lei Zhu
    Chengyuan Zhang
    Zhan Yang
    Yunwu Lin
    Ruipeng Chen
    Multimedia Tools and Applications, 2019, 78 : 30677 - 30706
  • [8] Distributed Sentiment Analysis for Geo-Tagged Twitter Data
    Zengin, Muhammed Said
    Arslan, Rabia
    Akgun, Mehmet Burak
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [9] Efficient interactive search for geo-tagged multimedia data
    Long, Jun
    Zhu, Lei
    Zhang, Chengyuan
    Yang, Zhan
    Lin, Yunwu
    Chen, Ruipeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (21) : 30677 - 30706
  • [10] DeepDBSCAN: Deep Density-Based Clustering for Geo-Tagged Photos
    Park, Jang You
    Ryu, Dong June
    Nam, Kwang Woo
    Jang, Insung
    Jang, Minseok
    Lee, Yonsik
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (08)