A MACHINE LEARNING BASED DECISION SUPPORT FRAMEWORK FOR BIG DATA PIPELINE MODELING AND DESIGN

被引:0
|
作者
Dhaouadi, Asma [1 ,2 ]
Bousselmi, Khadija [3 ]
Monnet, Sebastien [1 ]
Gammoud, Mohamed Mohsen [4 ]
Hammoudi, Slimane [5 ]
机构
[1] Savoie Mont Blanc Univ, LISTIC, Chambery, France
[2] Univ Tunis El Manar, RIADI, Tunis, Tunisia
[3] Savoie Mont Blanc Univ, LISTIC, IUT, Chambery, France
[4] ISAM La Mannouba, RIADI Lab, Tunis, Tunisia
[5] ESEO TECH Angers, ERIS, Angers, France
关键词
Big data; Data-warehousing modeling; Modeling assistance; Tools and technologies; ML methods;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The data warehousing process requires an architectural revolution to settle big-data challenges and address new data sources, such as social networks, recommendation systems, smart cities and the web to extract value from shared data. In this respect, the pipeline-modeling community for the acquisition, storage and processing of data for analysis purposes is enacting a wide range of technological solutions that present significant challenges and difficulties. More specifically, the choice of the most appropriate tool for the user's specific business needs and the interoperability between the different tools have become primary challenges. From this perspective, we propose in this paper a new interactive framework based on machine learning (ML) techniques to assist experts in the process of modeling a customized pipeline for data warehousing. More precisely, we elaborate first (i) an analysis of the experts' requirements and the characteristics of the data to be processed, then (ii) we propose se the most appropriate architecture to their requirements from a multitude of specific architectures instantiated from a generic one, by using (iii) several ML methods to predict the most suitable tool for each phase and task within the architecture. Additionally, our framework is validated through two real-world use cases and user feedback.
引用
收藏
页码:306 / 318
页数:13
相关论文
共 50 条
  • [41] Design of a Smart Teaching English Translation System Based on Big Data Machine Learning
    Zhang C.
    Yu T.
    Gao Y.
    Tham M.L.
    International Journal of Web-Based Learning and Teaching Technologies, 2023, 18 (02)
  • [42] Framework for Mobile Internet of Things Security Monitoring Based on Big Data Processing and Machine Learning
    Kotenko, Igor
    Saenko, Igor
    Branitskiy, Alexander
    IEEE ACCESS, 2018, 6 : 72714 - 72723
  • [43] A system of systems framework for autonomy with big data analytic and machine learning
    Jamshidi, Mo M.
    9TH INTERNATIONAL CONFERENCE ON THEORY AND APPLICATION OF SOFT COMPUTING, COMPUTING WITH WORDS AND PERCEPTION, ICSCCW 2017, 2017, 120 : 6 - 6
  • [44] Intelligent financial decision support system based on big data
    Tong, Danna
    Tian, Guixian
    JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [45] A machine learning pipeline for extracting decision-support features from traffic scenes
    Fraga, Vitor A.
    Schreiber, Lincoln V.
    da Silva, Marco Antonio C.
    Kunst, Rafael
    Barbosa, Jorge L. V.
    Ramos, Gabriel de O.
    AI COMMUNICATIONS, 2024, 37 (02) : 189 - 201
  • [46] A Hybrid Support Vector Machine Algorithm for Big Data Heterogeneity Using Machine Learning
    Ul Ahsaan, Shafqat
    Kaur, Harleen
    Mourya, Ashish Kumar
    Naaz, Sameena
    SYMMETRY-BASEL, 2022, 14 (11):
  • [47] Big Data Decision Support System
    Ma, Tian J.
    ProQuest Dissertations and Theses Global, 2022,
  • [48] Machine learning based integrity decision management of pipeline corrosion clusters
    Mensah, Abraham
    Sriramula, Srinivas
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 795 - 799
  • [49] Machine Learning in Big Data
    Wang, Lidong
    Alexander, Cheryl Ann
    INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2016, 1 (02) : 52 - 61
  • [50] Machine Learning on Big Data
    Condie, Tyson
    Mineiro, Paul
    Polyzotis, Neoklis
    Weimer, Markus
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2013, : 1242 - 1244