A MACHINE LEARNING BASED DECISION SUPPORT FRAMEWORK FOR BIG DATA PIPELINE MODELING AND DESIGN

被引:0
|
作者
Dhaouadi, Asma [1 ,2 ]
Bousselmi, Khadija [3 ]
Monnet, Sebastien [1 ]
Gammoud, Mohamed Mohsen [4 ]
Hammoudi, Slimane [5 ]
机构
[1] Savoie Mont Blanc Univ, LISTIC, Chambery, France
[2] Univ Tunis El Manar, RIADI, Tunis, Tunisia
[3] Savoie Mont Blanc Univ, LISTIC, IUT, Chambery, France
[4] ISAM La Mannouba, RIADI Lab, Tunis, Tunisia
[5] ESEO TECH Angers, ERIS, Angers, France
关键词
Big data; Data-warehousing modeling; Modeling assistance; Tools and technologies; ML methods;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The data warehousing process requires an architectural revolution to settle big-data challenges and address new data sources, such as social networks, recommendation systems, smart cities and the web to extract value from shared data. In this respect, the pipeline-modeling community for the acquisition, storage and processing of data for analysis purposes is enacting a wide range of technological solutions that present significant challenges and difficulties. More specifically, the choice of the most appropriate tool for the user's specific business needs and the interoperability between the different tools have become primary challenges. From this perspective, we propose in this paper a new interactive framework based on machine learning (ML) techniques to assist experts in the process of modeling a customized pipeline for data warehousing. More precisely, we elaborate first (i) an analysis of the experts' requirements and the characteristics of the data to be processed, then (ii) we propose se the most appropriate architecture to their requirements from a multitude of specific architectures instantiated from a generic one, by using (iii) several ML methods to predict the most suitable tool for each phase and task within the architecture. Additionally, our framework is validated through two real-world use cases and user feedback.
引用
收藏
页码:306 / 318
页数:13
相关论文
共 50 条
  • [31] A Machine-Learning-Based Epistemic Modeling Framework for Textile Antenna Design
    Kan, Duygu
    Spina, Domenico
    De Ridder, Simon
    Grassi, Flavia
    Rogier, Hendrik
    Vande Ginste, Dries
    IEEE ANTENNAS AND WIRELESS PROPAGATION LETTERS, 2019, 18 (11): : 2292 - 2296
  • [32] A Machine Learning-Based Decision Support System Design for Restraining Orders in Turkey
    Ay, Huseyin Umutcan
    Oner, Alime Aysu
    Yildirim, Nihan
    Kaya, Tolga
    2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 1520 - 1525
  • [33] A Case Study for a Big Data and Machine Learning Platform to Improve Medical Decision Support in Population Health Management
    Lopez-Martinez, Fernando
    Nunez-Valdez, Edward Rolando
    Garcia-Diaz, Vicente
    Bursac, Zoran
    ALGORITHMS, 2020, 13 (04)
  • [34] Clinical decision support system based on RST with machine learning for medical data classification
    Singh, Kamakhya Narain
    Mantri, Jibendu Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (13) : 39707 - 39730
  • [35] Clinical decision support system based on RST with machine learning for medical data classification
    Kamakhya Narain Singh
    Jibendu Kumar Mantri
    Multimedia Tools and Applications, 2024, 83 : 39707 - 39730
  • [36] Big Data, Big Insights? Advancing Service Innovation and Design With Machine Learning
    Antons, David
    Breidbach, Christoph F.
    JOURNAL OF SERVICE RESEARCH, 2018, 21 (01) : 17 - 39
  • [37] MEDICAL DECISION SUPPORT FOR FOOTBALL PLAYERS BASED ON MACHINE LEARNING HISTORICAL INJURY DATA
    Fang, Jinhua
    Xiang, Ting
    REVISTA INTERNACIONAL DE MEDICINA Y CIENCIAS DE LA ACTIVIDAD FISICA Y DEL DEPORTE, 2024, 24 (96): : 479 - 489
  • [38] Design of Children's Product Packaging Preference Based on Big Data Machine Learning
    Gao, Yuan
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [39] Research and Design of Credit Risk Assessment System Based on Big Data and Machine Learning
    Wen, Song
    Zeng, Bi
    Liao, Wenxiong
    Wei, Pengfei
    Pan, Zhihao
    2021 IEEE 6TH INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2021), 2021, : 9 - 13
  • [40] Machine Learning Based Distributed Big Data Analysis Framework for Next Generation Web in IoT
    Singh, Sushil Kumar
    Cha, Jeonghun
    Kim, Tae Woo
    Park, Jong Hyuk
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2021, 18 (02) : 597 - 618