Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions

被引:82
|
作者
Lwakatare, Lucy Ellen [1 ]
Raj, Aiswarya [1 ]
Crnkovic, Ivica [1 ]
Bosch, Jan [1 ]
Olsson, Helena Holmstrom [2 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, Horselgagen 11, S-41296 Gothenburg, Sweden
[2] Malmo Univ, Dept Comp Sci & Media Technol, Nordenskioldsgatan 1, S-21119 Malmo, Sweden
关键词
Machine learning systems; Software engineering; Industrial settings; Challenges; Solutions; SLR;
D O I
10.1016/j.infsof.2020.106368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background : Developing and maintaining large scale machine learning (ML) based software systems in an in-dustrial setting is challenging. There are no well-established development guidelines, but the literature contains reports on how companies develop and maintain deployed ML-based software systems. Objective : This study aims to survey the literature related to development and maintenance of large scale ML -based systems in industrial settings in order to provide a synthesis of the challenges that practitioners face. In addition, we identify solutions used to address some of these challenges. Method : A systematic literature review was conducted and we identified 72 papers related to development and maintenance of large scale ML-based software systems in industrial settings. The selected articles were qualita-tively analyzed by extracting challenges and solutions. The challenges and solutions were thematically synthe-sized into four quality attributes: adaptability, scalability, safety and privacy. The analysis was done in relation to ML workflow, i.e. data acquisition, training, evaluation, and deployment. Results : We identified a total of 23 challenges and 8 solutions related to development and maintenance of large scale ML-based software systems in industrial settings including six different domains. Challenges were most often reported in relation to adaptability and scalability. Safety and privacy challenges had the least reported solutions. Conclusion : The development and maintenance on large-scale ML-based systems in industrial settings introduce new challenges specific for ML, and for the known challenges characteristic for these types of systems, require new methods in overcoming the challenges. The identified challenges highlight important concerns in ML system development practice and the lack of solutions point to directions for future research.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Continual Learning for Real-World Autonomous Systems: Algorithms, Challenges and Frameworks
    Khadija Shaheen
    Muhammad Abdullah Hanif
    Osman Hasan
    Muhammad Shafique
    Journal of Intelligent & Robotic Systems, 2022, 105
  • [42] machine learning applications using real-world data: A literature review
    Adair, Nicholas
    Icten, Zeynep
    Friedman, Mark
    Menzin, Joseph
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 339 - 339
  • [43] Evaluating the Cybersecurity Risk of Real-world, Machine Learning Production Systems
    Bitton, Ron
    Maman, Nadav
    Singh, Inderjeet
    Momiyama, Satoru
    Elovici, Yuval
    Shabtai, Asaf
    ACM COMPUTING SURVEYS, 2023, 55 (09)
  • [44] Improved tomographic reconstruction of large-scale real-world data by filter optimization
    Pelt, Daniel M.
    De Andrade, Vincent
    ADVANCED STRUCTURAL AND CHEMICAL IMAGING, 2016, 2
  • [45] Comparing Large-Scale Global Optimization Competition winners in a real-world problem
    Molina, Daniel
    Nesterenko, Arthur R.
    LaTorre, Antonio
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 359 - 365
  • [46] Large-Scale Analysis on Anti-Analysis Techniques in Real-World Malware
    Kim, Minho
    Cho, Haehyun
    Yi, Jeong Hyun
    IEEE ACCESS, 2022, 10 : 75802 - 75815
  • [47] Large-Scale Parallelization of Partial Evaluations in Evolutionary Algorithms for Real-World Problems
    Bouter, Anton
    Alderliesten, Tanja
    Bel, Arjan
    Witteveen, Cees
    Bosman, Peter A. N.
    GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 1199 - 1206
  • [48] An Efficient Greedy Algorithm for Real-World Large-Scale Electric Vehicle Charging
    Hegele, Marius
    Metzler, Philipp
    Beichter, Sebastian
    Wiegel, Friedrich
    Hagenmeyer, Veit
    PROCEEDINGS OF THE 2023 THE 14TH ACM INTERNATIONAL CONFERENCE ON FUTURE ENERGY SYSTEMS, E-ENERGY 2023, 2023, : 415 - 426
  • [49] Efficient large-scale terrain rendering method for real-world game simulation
    Kang, Dong-Soo
    Kim, Yun-Jin
    Shin, Byeong-Seok
    TECHNOLOGIES FOR E-LEARNING AND DIGITAL ENTERTAINMENT, PROCEEDINGS, 2006, 3942 : 597 - 605
  • [50] A Heuristic Algorithm for solving a large-scale real-world territory design problem
    Zhou, Lin
    Zhen, Lu
    Baldacci, Roberto
    Boschetti, Marco
    Dai, Ying
    Lim, Andrew
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2021, 103