A MACHINE LEARNING BASED DECISION SUPPORT FRAMEWORK FOR BIG DATA PIPELINE MODELING AND DESIGN

被引:0
|
作者
Dhaouadi, Asma [1 ,2 ]
Bousselmi, Khadija [3 ]
Monnet, Sebastien [1 ]
Gammoud, Mohamed Mohsen [4 ]
Hammoudi, Slimane [5 ]
机构
[1] Savoie Mont Blanc Univ, LISTIC, Chambery, France
[2] Univ Tunis El Manar, RIADI, Tunis, Tunisia
[3] Savoie Mont Blanc Univ, LISTIC, IUT, Chambery, France
[4] ISAM La Mannouba, RIADI Lab, Tunis, Tunisia
[5] ESEO TECH Angers, ERIS, Angers, France
关键词
Big data; Data-warehousing modeling; Modeling assistance; Tools and technologies; ML methods;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The data warehousing process requires an architectural revolution to settle big-data challenges and address new data sources, such as social networks, recommendation systems, smart cities and the web to extract value from shared data. In this respect, the pipeline-modeling community for the acquisition, storage and processing of data for analysis purposes is enacting a wide range of technological solutions that present significant challenges and difficulties. More specifically, the choice of the most appropriate tool for the user's specific business needs and the interoperability between the different tools have become primary challenges. From this perspective, we propose in this paper a new interactive framework based on machine learning (ML) techniques to assist experts in the process of modeling a customized pipeline for data warehousing. More precisely, we elaborate first (i) an analysis of the experts' requirements and the characteristics of the data to be processed, then (ii) we propose se the most appropriate architecture to their requirements from a multitude of specific architectures instantiated from a generic one, by using (iii) several ML methods to predict the most suitable tool for each phase and task within the architecture. Additionally, our framework is validated through two real-world use cases and user feedback.
引用
收藏
页码:306 / 318
页数:13
相关论文
共 50 条
  • [21] Open Data Lake to Support Machine Learning on Arctic Big Data
    Olawoyin, Anifat M.
    Leung, Carson K.
    Cuzzocrea, Alfredo
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5215 - 5224
  • [22] Evolvability of Machine Learning-based Systems: An Architectural Design Decision Framework
    Leest, Joran
    Gerostathopoulos, Ilias
    Raibulet, Claudia
    2023 IEEE 20TH INTERNATIONAL CONFERENCE ON SOFTWARE ARCHITECTURE COMPANION, ICSA-C, 2023, : 106 - 110
  • [23] Machine learning-based clinical decision support using laboratory data
    Cubukcu, Hikmet Can
    Topcu, Deniz Ilhan
    Yenice, Sedef
    CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2024, 62 (05) : 793 - 823
  • [24] Machine learning-based design features decision support tool via customers purchasing data analysis
    Zhang, Jian
    Chu, Xingpeng
    Simeone, Alessandro
    Gu, Peihua
    CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS, 2021, 29 (02): : 124 - 141
  • [25] Machine-learning-based optimization framework to support recovery-based design
    Issa, Omar
    Silva-Lopez, Rodrigo
    Baker, Jackw.
    Burton, Henry V.
    EARTHQUAKE ENGINEERING & STRUCTURAL DYNAMICS, 2023, 52 (11): : 3256 - 3280
  • [26] A Data Mining Framework for Glaucoma Decision Support Based on Optic Nerve Image Analysis Using Machine Learning Methods
    Abidi S.S.R.
    Roy P.C.
    Shah M.S.
    Yu J.
    Yan S.
    Journal of Healthcare Informatics Research, 2018, 2 (4) : 370 - 401
  • [27] Virtual design of urban planning based on GIS big data and machine learning
    Zhu, Bin
    Zhou, Jie
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 6263 - 6273
  • [28] CNN-driven Art Design Decision Support System Based on Big Data
    Lin Y.
    Wang B.
    Fan Z.
    Computer-Aided Design and Applications, 2024, 21 (S21): : 37 - 52
  • [29] A Smart Social Insurance Big Data Analytics Framework Based on Machine Learning Algorithms
    Senousy, Youssef
    Shehab, Abdulaziz
    Hanna, Wael K.
    Riad, Alaa M.
    El-bakry, Hazem A.
    Elkhamisy, Nashaat
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2020, 20 (01) : 95 - 111
  • [30] Design Principles for Machine Learning Based Clinical Decision Support Systems: A Design Science Study
    Sjostrom, Jonas
    Dryselius, Petra
    Nygren, Jens
    Nair, Monika
    Soliman, Amira
    Lundgren, Lina E.
    DESIGN SCIENCE RESEARCH FOR A RESILIENT FUTURE, DESRIST 2024, 2024, 14621 : 109 - 122