Filter Large-scale Engine Data using Apache Spark

被引:0
|
作者
Pirozzi, Donato [1 ]
Scarano, Vittorio [1 ]
Begg, Steven [2 ]
De Sercey, Guillaume [2 ]
Fish, Andrew [2 ]
Harvey, Andrew [2 ]
机构
[1] Univ Salerno, Dipartimento Informat, Fisciano, Italy
[2] Univ Brighton, Sch Comp Engn & Math, Brighton BN2 4AT, E Sussex, England
关键词
Data Visualisation; Data Exploration; Combustion Engine Experimental Data; Big-data Technologies;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper introduces a minimum viable software product to filter large datasets of engine data recorded during laboratory experiments of combustion engines. The aim is to support analysts in the identification and analysis of specific physical phenomenon within hours of recorded engine experimental data. Specifically, the tool has been designed considering the use case of identifying Low Speed Pre-Ignition events. This work describes the tool's graphical user interface and its scalable architecture based on mainstream web and big-data technologies as well as the practical application to pre-ignition events identification. The paper provides details on the architecture's performance, providing evidence of its scalability by increasing the number of available computing workers.
引用
收藏
页码:1300 / 1305
页数:6
相关论文
共 50 条
  • [1] Large-Scale Data Pollution with Apache Spark
    Hildebrandt, Kai
    Panse, Fabian
    Wilcke, Niklas
    Ritter, Norbert
    IEEE TRANSACTIONS ON BIG DATA, 2020, 6 (02) : 396 - 411
  • [2] Processing large-scale data with Apache Spark
    Ko, Seyoon
    Won, Joong-Ho
    KOREAN JOURNAL OF APPLIED STATISTICS, 2016, 29 (06) : 1077 - 1094
  • [3] Large-Scale Network Embedding in Apache Spark
    Lin, Wenqing
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3271 - 3279
  • [4] Large Scale Distributed Data Science using Apache Spark
    Shanahan, James G.
    Dai, Liang
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 2323 - 2324
  • [5] A Large-Scale Sentiment Data Classification for Online Reviews Under Apache Spark
    Al-Saqqa, Samar
    Al-Naymat, Ghazi
    Awajan, Arafat
    9TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN-2018) / 8TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2018), 2018, 141 : 183 - 189
  • [6] Large-scale text processing pipeline with Apache Spark
    Svyatkovskiy, A.
    Imai, K.
    Kroeger, M.
    Shiraito, Y.
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3928 - 3935
  • [7] A Parallel Fast Fourier Transform Algorithm for Large-Scale Signal Data Using Apache Spark in Cloud
    Yang, Cheng
    Bao, Weidong
    Zhu, Xiaomin
    Wang, Ji
    Xiao, Wenhua
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2018, PT III, 2018, 11336 : 293 - 310
  • [8] Supervised Papers Classification on Large-Scale High-Dimensional Data with Apache Spark
    Akritidis, Leonidas
    Bozanis, Panayiotis
    Fevgas, Athanasios
    2018 16TH IEEE INT CONF ON DEPENDABLE, AUTONOM AND SECURE COMP, 16TH IEEE INT CONF ON PERVAS INTELLIGENCE AND COMP, 4TH IEEE INT CONF ON BIG DATA INTELLIGENCE AND COMP, 3RD IEEE CYBER SCI AND TECHNOL CONGRESS (DASC/PICOM/DATACOM/CYBERSCITECH), 2018, : 987 - 994
  • [9] GeoMatch: Efficient Large-Scale Map Matching on Apache Spark
    Zeidan, Ayman
    Lagerspetz, Eemil
    Zhao, Kai
    Nurmi, Petteri
    Tarkoma, Sasu
    Vo, Huy T.
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 384 - 391
  • [10] Particle Swarm Optimization for Large-Scale Clustering on Apache Spark
    Sherar, Matthew
    Zulkernine, Farhana
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 801 - 808