Efficient Algorithms for Range Mode Queries in the Big Data Era

被引:3
|
作者
Karras, Christos [1 ]
Theodorakopoulos, Leonidas [2 ]
Karras, Aristeidis [1 ]
Krimpas, George A. [1 ]
机构
[1] Univ Patras, Comp Engn & Informat Dept, Rion 26504, Greece
[2] Univ Patras, Dept Management Sci & Technol, Patras 26334, Greece
关键词
data structures; algorithms; RAM; range mode queries; big data; internal audit; DATA ANALYTICS;
D O I
10.3390/info15080450
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The mode is a fundamental descriptive statistic in data analysis, signifying the most frequent element within a dataset. The range mode query (RMQ) problem expands upon this concept by preprocessing an array A containing n natural numbers. This allows for the swift determination of the mode within any subarray A[a..b], thus optimizing the computation of the mode for a multitude of range queries. The efficacy of this process bears considerable importance in data analytics and retrieval across diverse platforms, including but not limited to online shopping experiences and financial auditing systems. This study is dedicated to exploring and benchmarking different algorithms and data structures designed to tackle the RMQ problem. The goal is to not only address the theoretical aspects of RMQ but also to provide practical solutions that can be applied in real-world scenarios, such as the optimization of an online shopping platform's understanding of customer preferences, enhancing the efficiency and effectiveness of data retrieval in large datasets.
引用
收藏
页数:37
相关论文
共 50 条
  • [21] Research on Innovation of Enterprise Management Mode in the Era of Big Data
    Jin, Wu
    2018 5TH INTERNATIONAL CONFERENCE ON BUSINESS, ECONOMICS AND MANAGEMENT (BUSEM 2018), 2018, : 330 - 332
  • [22] ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries
    Nathan L Clement
    Lee P Thompson
    Daniel P Miranker
    BMC Bioinformatics, 15
  • [23] ADaM: augmenting existing approximate fast matching algorithms with efficient and exact range queries
    Clement, Nathan L.
    Thompson, Lee P.
    Miranker, Daniel P.
    BMC BIOINFORMATICS, 2014, 15
  • [24] Range mode and range median queries on lists and trees
    Krizanc, D
    Morin, P
    Smid, M
    ALGORITHMS AND COMPUTATION, PROCEEDINGS, 2003, 2906 : 517 - 526
  • [25] Improved bounds for range mode and range median queries
    Petersen, Holger
    SOFSEM 2008: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2008, 4910 : 418 - 423
  • [26] Sphinx: Empowering Impala for Efficient Execution of SQL Queries on Big Spatial Data
    Eldawy, Ahmed
    Sabek, Ibrahim
    Elganainy, Mostafa
    Bakeer, Ammar
    Abdelmotaleb, Ahmed
    Mokbel, Mohamed F.
    ADVANCES IN SPATIAL AND TEMPORAL DATABASES, SSTD 2017, 2017, 10411 : 65 - 83
  • [27] Big data, algorithms and politics: the social sciences in the era of social media
    Gonzalez, Felipe
    CINTA DE MOEBIO, 2019, 65 : 267 - 280
  • [28] Efficient Algorithms for Kernel Aggregation Queries
    Chan, Tsz Nam
    Hou, Leong U.
    Cheng, Reynold
    Yiu, Man Lung
    Mittal, Shivansh
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (06) : 2726 - 2739
  • [29] Efficient rewriting algorithms for preference queries
    Georgiadis, Periklis
    Kapantaidakis, Ioannis
    Christophides, Vassilis
    Nguer, Elhadji Mamadou
    Spyratos, Nicolas
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1101 - +
  • [30] Efficient algorithms for processing XPath queries
    Gottlob, G
    Koch, C
    Pichler, R
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2005, 30 (02): : 444 - 491