Large-scale machine learning systems in real-world industrial settings: A review of challenges and solutions

被引:82
|
作者
Lwakatare, Lucy Ellen [1 ]
Raj, Aiswarya [1 ]
Crnkovic, Ivica [1 ]
Bosch, Jan [1 ]
Olsson, Helena Holmstrom [2 ]
机构
[1] Chalmers Univ Technol, Dept Comp Sci & Engn, Horselgagen 11, S-41296 Gothenburg, Sweden
[2] Malmo Univ, Dept Comp Sci & Media Technol, Nordenskioldsgatan 1, S-21119 Malmo, Sweden
关键词
Machine learning systems; Software engineering; Industrial settings; Challenges; Solutions; SLR;
D O I
10.1016/j.infsof.2020.106368
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background : Developing and maintaining large scale machine learning (ML) based software systems in an in-dustrial setting is challenging. There are no well-established development guidelines, but the literature contains reports on how companies develop and maintain deployed ML-based software systems. Objective : This study aims to survey the literature related to development and maintenance of large scale ML -based systems in industrial settings in order to provide a synthesis of the challenges that practitioners face. In addition, we identify solutions used to address some of these challenges. Method : A systematic literature review was conducted and we identified 72 papers related to development and maintenance of large scale ML-based software systems in industrial settings. The selected articles were qualita-tively analyzed by extracting challenges and solutions. The challenges and solutions were thematically synthe-sized into four quality attributes: adaptability, scalability, safety and privacy. The analysis was done in relation to ML workflow, i.e. data acquisition, training, evaluation, and deployment. Results : We identified a total of 23 challenges and 8 solutions related to development and maintenance of large scale ML-based software systems in industrial settings including six different domains. Challenges were most often reported in relation to adaptability and scalability. Safety and privacy challenges had the least reported solutions. Conclusion : The development and maintenance on large-scale ML-based systems in industrial settings introduce new challenges specific for ML, and for the known challenges characteristic for these types of systems, require new methods in overcoming the challenges. The identified challenges highlight important concerns in ML system development practice and the lack of solutions point to directions for future research.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Large-scale classification of traffic signs under real-world conditions
    Hazelhoff, Lykele
    Creusen, Ivo
    van de Wouwa, Dennis
    de With, Peter H. N.
    MULTIMEDIA ON MOBILE DEVICES 2012 AND MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS VI, 2012, 8304
  • [22] Supervised temporal link prediction in large-scale real-world networks
    Gerrit Jan de Bruin
    Cor J. Veenman
    H. Jaap van den Herik
    Frank W. Takes
    Social Network Analysis and Mining, 2021, 11
  • [23] Individual differences in performance on a large-scale, real-world wayfinding task
    Malinowski, JC
    Gillespie, WT
    JOURNAL OF ENVIRONMENTAL PSYCHOLOGY, 2001, 21 (01) : 73 - 82
  • [24] LEARNINGS FROM THE LARGE-SCALE REAL-WORLD DEPLOYMENT OF WEARABLES IN CHILDREN
    Kim, Ethan H.
    Jacobs, Grace R.
    Pratap, Abhishek
    ANNALS OF BEHAVIORAL MEDICINE, 2022, 56 (SUPP 1) : S315 - S315
  • [25] Towards Quantum Large-Scale Password Guessing on Real-World Distributions
    Duermuth, Markus
    Golla, Maximilian
    Markert, Philipp
    May, Alexander
    Schlieper, Lars
    CRYPTOLOGY AND NETWORK SECURITY, CANS 2021, 2021, 13099 : 412 - 431
  • [26] Large-scale deployment of electric taxis in Beijing: A real-world analysis
    Zou, Yuan
    Wei, Shouyang
    Sun, Fengchun
    Hu, Xiaosong
    Shiao, Yaojung
    ENERGY, 2016, 100 : 25 - 39
  • [27] Supervised temporal link prediction in large-scale real-world networks
    de Bruin, Gerrit Jan
    Veenman, Cor J.
    van den Herik, H. Jaap
    Takes, Frank W.
    SOCIAL NETWORK ANALYSIS AND MINING, 2021, 11 (01)
  • [28] Measurement of Malware Family Classification on a Large-Scale Real-World Dataset
    Wang, Qinqin
    Yan, Hanbing
    Zhao, Chang
    Mei, Rui
    Han, Zhihui
    Zhou, Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1390 - 1397
  • [29] Reverse engineering: the challenge of large-scale real-world educational projects
    Bothe, K
    14TH CONFERENCE ON SOFTWARE ENGINEERING EDUCATION AND TRAINING, PROCEEDINGS, 2001, : 115 - 126
  • [30] The psychosis analysis in real-world on a cohort of large-scale patients with schizophrenia
    Tan, Wenyan
    Lin, Haicheng
    Lei, Baoxin
    Ou, Aihua
    He, Zehui
    Yang, Ning
    Jia, Fujun
    Weng, Heng
    Hao, Tianyong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)