Scalable Distributed Subtrajectory Clustering

被引:0
|
作者
Tampakis, Panagiotis [1 ]
Pelekis, Nikos [2 ]
Doulkeridis, Christos [3 ]
Theodoridis, Yannis [1 ]
机构
[1] Univ Piraeus, Dept Informat, Piraeus, Greece
[2] Univ Piraeus, Dept Stat & Insurance Sci, Piraeus, Greece
[3] Univ Piraeus, Dept Digital Syst, Piraeus, Greece
来源
2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2019年
基金
欧盟地平线“2020”;
关键词
Mobility data; trajectories; subtrajectory clustering; big mobility data mining; distributed clustering; mapreduce; PATTERNS; DISCOVERY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trajectory clustering is an important operation of knowledge discovery from mobility data. Especially nowadays, the need for performing advanced analytic operations over massively produced data, such as mobility traces, in efficient and scalable ways is imperative. However, discovering clusters of complete trajectories can overlook significant patterns that exist only for a small portion of their lifespan. In this paper, we address the problem of Distributed Subtrajectory Clustering in an efficient and highly scalable way. The problem is challenging because the subtrajectories to be clustered are not known in advance, but they need to be discovered dynamically based on adjacent subtrajectories in space and time. Towards this objective, we split the original problem to three sub-problems, namely Subtrajectory Join, Trajectory Segmentation and Clustering and Outlier Detection, and deal with each one in a distributed fashion by utilizing the MapReduce programming model. The efficiency and the effectiveness of our solution is demonstrated experimentally over a synthetic and two large real datasets from the maritime and urban domains and through comparison with two state of the art subtrajectory clustering algorithms.
引用
收藏
页码:950 / 959
页数:10
相关论文
共 50 条
  • [1] Subtrajectory Clustering: Models and Algorithms
    Agarwal, Pankaj K.
    Fox, Kyle
    Munagala, Kamesh
    Nath, Abhinandan
    Pan, Jiangwei
    Taylor, Erin
    PODS'18: PROCEEDINGS OF THE 37TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2018, : 75 - 87
  • [2] Scalable clustering: A distributed approach
    Hore, P
    Hall, LO
    2004 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, PROCEEDINGS, 2004, : 143 - 148
  • [3] Distributed Subtrajectory Join on Massive Datasets
    Tampakis, Panagiotis
    Doulkeridis, Christos
    Pelekis, Nikos
    Theodoridis, Yannis
    ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2020, 6 (02)
  • [4] Scalable density-based distributed clustering
    Januzaj, E
    Kriegel, HP
    Pfeifle, M
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS, 2004, 3202 : 231 - 244
  • [5] A GPU Approach to Subtrajectory Clustering Using the Frechet Distance
    Gudmundsson, Joachim
    Valladares, Nacho
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2015, 26 (04) : 924 - 937
  • [6] ROADSTER: Improved algorithms for subtrajectory clustering and map construction
    Buchin, Kevin
    Buchin, Maike
    Gudmundsson, Joachim
    Hendriks, Jorren
    Sereshgi, Erfan Hosseini
    Sleijster, Jorrick
    Silveira, Rodrigo I.
    Staals, Frank
    Wenk, Carola
    COMPUTERS & GEOSCIENCES, 2025, 196
  • [7] Scalable local density-based distributed clustering
    Liu Yan-bing
    Liu Zhang-xiong
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (08) : 9491 - 9498
  • [8] Enabling scalable and accurate clustering of distributed ligand geometries on supercomputers
    Zhang, Boyu
    Estrada, Trilce
    Cicotti, Pietro
    Balaji, Pavan
    Taufer, Michela
    PARALLEL COMPUTING, 2017, 63 : 38 - 60
  • [9] Scalable Uniformly-Distributed Clustering Mechanism for Sensor Networks
    Yun, Phil-Jung
    Kim, Changhwa
    Kim, Sangkyung
    Kim, JaeNam
    EUC 2008: PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING, VOL 2, WORKSHOPS, 2008, : 676 - 681
  • [10] KeyBin2: Distributed Clustering for Scalable and In-Situ Analysis
    Chen, Xinyu
    Benson, Jeremy
    Peterson, Matt
    Taufer, Michela
    Estrada, Trilce
    PROCEEDINGS OF THE 47TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, 2018,