An Efficient and Distributed Framework for Real-Time Trajectory Stream Clustering

被引:5
|
作者
Gao, Yunjun [1 ]
Fang, Ziquan [1 ]
Xu, Jiachen [1 ]
Gong, Shenghao [1 ]
Shen, Chunhui [2 ,3 ]
Chen, Lu [1 ]
机构
[1] Zhejiang Univ, Hangzhou 310027, Zhejiang, Peoples R China
[2] Alibaba Grp, Hangzhou 310052, Zhejiang, Peoples R China
[3] AZFT Lab, Hangzhou 310007, Zhejiang, Peoples R China
关键词
Trajectory; Real-time systems; Clustering algorithms; Market research; Scalability; Behavioral sciences; Measurement; DBSCAN; distributed online processing; grid partitioning; trajectory stream clustering; SIMPLIFICATION;
D O I
10.1109/TKDE.2023.3312319
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the explosive ubiquity of GPS-equipped devices, e.g., mobile phones, vehicles, and vessels, a massive amount of real-time, unbounded, and varying-sampling trajectory streams are being generated continuously. Clustering trajectory streams is useful in real-life applications, such as traffic congestion prediction, crowd flow detection, and moving behavior study. Although several sliding-window based algorithms (that adopt the classic two-phases online-offline processing framework) are proposed for trajectory stream clustering, three challenges exist to meet ever-increasing application demands for effective, efficient, and scalable online clustering: i) How to effectively model unbounded trajectory streams in the online settings for effective clustering? ii) How to achieve truly real-time online processing? iii) How to improve the scalable capability of the clustering algorithm to support large-scale moving trajectory streams? In this paper, we propose an efficient and distributed trajectory stream clustering framework that can: i) model trajectory streams dynamically and effectively in a self-adaptive manner, i.e., k-Segment, which considers both spatial and temporal aspects of trajectory streams, ii) support distributed indexing, processing, and workload balance, and iii) incrementally cluster trajectory streams in an efficient manner. Experiments on a wide range of real-world trajectory datasets show that our framework outperforms state-of-the-art baselines in terms of clustering quality, efficiency, and scalability.
引用
收藏
页码:1857 / 1873
页数:17
相关论文
共 50 条
  • [1] Lunatory: A Real-Time Distributed Trajectory Clustering Framework for Web Big Data
    Wu, Yang
    Pan, Zhicheng
    Chao, Pingfu
    Fang, Junhua
    Chen, Wei
    Zhao, Lei
    WEB ENGINEERING (ICWE 2022), 2022, 13362 : 219 - 234
  • [2] Disatra: A Real-Time Distributed Abstract Trajectory Clustering
    Chen, Liang
    Chao, Pingfu
    Fang, Junhua
    Chen, Wei
    Xu, Jiajie
    Zhao, Lei
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2021, PT I, 2021, 13080 : 619 - 635
  • [3] TStream: A Framework for Real-time and Scalable Trajectory Stream Processing and Analysis
    Shaikh, Salman Ahmed
    Kitagawa, Hiroyuki
    Matono, Akiyoshi
    Kim, Kyoung-sook
    30TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS, ACM SIGSPATIAL GIS 2022, 2022, : 215 - 218
  • [4] MicroGRID: An Accurate and Efficient Real-Time Stream Data Clustering with Noise
    Tari, Z.
    Thompson, A.
    Almusalam, N.
    Bertok, P.
    Mahmood, A.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT II, 2018, 10938 : 481 - 492
  • [5] MDDRSPF: A Model Driven Distributed Real-time Stream Processing Framework
    Wen, Yijun
    Zhang, Li
    Wang, Cheng
    2019 IEEE INTL CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, BIG DATA & CLOUD COMPUTING, SUSTAINABLE COMPUTING & COMMUNICATIONS, SOCIAL COMPUTING & NETWORKING (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2019), 2019, : 1352 - 1358
  • [6] CDRT: An Efficient Clustering Algorithm for Distributed Real-Time Database sites
    Abdel-kader, H. M.
    Salem, Rashed
    Saleh, Safa'a Said
    2014 9th International Conference on Informatics and Systems (INFOS), 2014,
  • [7] Efficient real-time trajectory tracking
    Ralph Lange
    Frank Dürr
    Kurt Rothermel
    The VLDB Journal , 2011, 20 : 671 - 694
  • [8] Efficient real-time trajectory tracking
    Lange, Ralph
    Duerr, Frank
    Rothermel, Kurt
    VLDB JOURNAL, 2011, 20 (05): : 671 - 694
  • [9] Patterns for Distributed Real-Time Stream Processing
    Basanta-Val, Pablo
    Fernandez-Garcia, Norberto
    Sanchez-Fernandez, Luis
    Arias-Fisteus, Jesus
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (11) : 3243 - 3257
  • [10] Framework for analyzing the real-time data stream
    Li, Qinghua
    Chen, Qiuxia
    Jiang, Shengyi
    Jisuanji Gongcheng/Computer Engineering, 2005, 31 (16): : 59 - 60