SbMBR Tree-A Spatiotemporal Data Indexing and Compression Algorithm for Data Analysis and Mining

被引:1
|
作者
Guan, Runda [1 ]
Wang, Ziyu [1 ]
Pan, Xiaokang [1 ]
Zhu, Rongjie [2 ]
Song, Biao [3 ]
Zhang, Xinchang [4 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp Sci, Nanjing 210044, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Sch Teacher Educ, Nanjing 210044, Peoples R China
[3] Nanjing Univ Informat Sci & Technol, Sch Software, Nanjing 210044, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Dept Sci & Technol, Nanjing 210044, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 19期
基金
美国国家科学基金会;
关键词
spatiotemporal data; lossy compression; data indexing; clustering; IMAGE;
D O I
10.3390/app131910562
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In the field of data analysis and mining, adopting efficient data indexing and compression techniques to spatiotemporal data can significantly reduce computational and storage overhead for the abilities to control the volume of data and exploit the spatiotemporal characteristics. However, traditional lossy compression techniques are hardly suitable due to their inherently random nature. They often impose unpredictable damage to scientific data, which affects the results of data mining and analysis tasks that require certain precision. In this paper, we propose a similarity-based minimum bounding rectangle (SbMBR) tree, a tree-based indexing and compression method, to address the aforementioned problem. Our method can hierarchically select appropriate minimum bounding rectangles according to the given maximum acceptable errors and use the average value contained in each selected MBR to replace the original data to achieve data compression with multi-layer loss control. This paper also provides the corresponding tree construction algorithm and range query processing algorithm for the indexing structure mentioned above. To evaluate the data quality preservation in cross-domain data analysis and mining scenarios, we use mutual information as the estimation metric. Experimental results emphasize the superiority of our method over some of the typical indexing and compression algorithms.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Compression, Indexing, and Retrieval for Massive String Data
    Hon, Wing-Kai
    Shah, Rahul
    Vitter, Jeffrey Scott
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2010, 6129 : 260 - +
  • [22] Lightweight Data Indexing and Compression in External Memory
    Ferragina, Paolo
    Gagie, Travis
    Manzini, Giovanni
    ALGORITHMICA, 2012, 63 (03) : 707 - 730
  • [23] THE DATA MINING ALGORITHM ANALYSIS FOR PERSONALIZED SERVICE
    Zou, Liwu
    Ren, Guangwei
    2012 FOURTH INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION NETWORKING AND SECURITY (MINES 2012), 2012, : 332 - 335
  • [24] Data Mining Technology And The Research And Analysis Of The Algorithm
    Han, Qiyu
    Wang, Panqing
    Wang, Shuo
    PROCEEDINGS OF THE 2016 6TH INTERNATIONAL CONFERENCE ON MACHINERY, MATERIALS, ENVIRONMENT, BIOTECHNOLOGY AND COMPUTER (MMEBC), 2016, 88 : 814 - 818
  • [25] Analysis on algorithm and application of cluster in data mining
    Information Engineering School, Nanchang University, Nanchang 330031, Jiang xi, China
    J. Theor. Appl. Inf. Technol., 1 (416-419):
  • [26] Lightweight Data Indexing and Compression in External Memory
    Paolo Ferragina
    Travis Gagie
    Giovanni Manzini
    Algorithmica, 2012, 63 : 707 - 730
  • [27] Lightweight Data Indexing and Compression in External Memory
    Ferragina, Paolo
    Gagie, Travis
    Manzini, Giovanni
    LATIN 2010: THEORETICAL INFORMATICS, 2010, 6034 : 697 - +
  • [28] Spatiotemporal data mining with cellular automata
    Fu, Karl
    Cai, Yang
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 1, PROCEEDINGS, 2006, 3991 : 1001 - 1004
  • [29] A Survey on Spatiotemporal and Semantic Data Mining
    Yuan, Quan
    Zhang, Chao
    Han, Jiawei
    TRENDS IN SPATIAL ANALYSIS AND MODELLING: DECISION-SUPPORT AND PLANNING STRATEGIES, 2018, 19 : 43 - 57
  • [30] Spatiotemporal Data Mining: A Computational Perspective
    Shekhar, Shashi
    Jiang, Zhe
    Ali, Reem Y.
    Eftelioglu, Emre
    Tang, Xun
    Gunturi, Venkata M. V.
    Zhou, Xun
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2015, 4 (04) : 2306 - 2338