Network Support for High-Performance Distributed Machine Learning

被引:6
|
作者
Malandrino, Francesco [1 ,2 ]
Chiasserini, Carla Fabiana [1 ,3 ]
Molner, Nuria [4 ,5 ]
de la Oliva, Antonio [6 ]
机构
[1] CNR, IEIIT, I-10129 Turin, Italy
[2] CNIT, I-43124 Parma, Italy
[3] Politecn Torino, Dept Elect & Telecommun, I-10129 Turin, Italy
[4] Univ Carlos III Madrid, IMDEA Networks Inst, Madrid 28903, Spain
[5] Univ Politecn Valencia iTEAM UPV, Inst Univ Telecomunicac & Aplicac Multimedia, Valencia 46022, Spain
[6] Univ Carlos III Madrid, Dept Telemat Engn, Madrid 28903, Spain
关键词
Task analysis; Topology; Network topology; Data models; Costs; Machine learning; Training; Network orchestration; machine learning; edge computing; EDGE;
D O I
10.1109/TNET.2022.3189077
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The traditional approach to distributed machine learning is to adapt learning algorithms to the network, e.g., reducing updates to curb overhead. Networks based on intelligent edge, instead, make it possible to follow the opposite approach, i.e., to define the logical network topology around the learning task to perform, so as to meet the desired learning performance. In this paper, we propose a system model that captures such aspects in the context of supervised machine learning, accounting for both learning nodes (that perform computations) and information nodes (that provide data). We then formulate the problem of selecting (i) which learning and information nodes should cooperate to complete the learning task, and (ii) the number of epochs to run, in order to minimize the learning cost while meeting the target prediction error and execution time. After proving important properties of the above problem, we devise an algorithm, named DoubleClimb, that can find a 1 + 1/vertical bar I vertical bar-competitive solution (with I being the set of information nodes), with cubic worst-case complexity. Our performance evaluation, leveraging a real-world network topology and considering both classification and regression tasks, also shows that DoubleClimb closely matches the optimum, outperforming state-of-the-art alternatives.
引用
收藏
页码:264 / 278
页数:15
相关论文
共 50 条
  • [31] Diagnosis of epilepsy by machine learning of high-performance plasma metabolic fingerprinting
    Chen, Xiaonan
    Yu, Wendi
    Zhao, Yinbing
    Ji, Yuxi
    Qi, Ziheng
    Guan, Yangtai
    Wan, Jingjing
    Hao, Yong
    TALANTA, 2024, 277
  • [32] Smart-MLlib: A High-Performance Machine-Learning Library
    Siegal, David
    Guo, Jia
    Agrawal, Cagan
    2016 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2016, : 336 - 345
  • [33] Applications of machine learning method in high-performance materials design: a review
    Yuan, Junhao
    Li, Zhen
    Yang, Yujia
    Yin, Anyi
    Li, Wenjie
    Sun, Dan
    Wang, Qing
    JOURNAL OF MATERIALS INFORMATICS, 2024, 4 (03):
  • [34] MACHINE LEARNING AND SIMULATION BASED TEMPERATURE PREDICTION ON HIGH-PERFORMANCE PROCESSORS
    Knox, Carlton
    Yuan, Zihao
    Coskun, Ayse K.
    PROCEEDINGS OF ASME 2022 INTERNATIONAL TECHNICAL CONFERENCE AND EXHIBITION ON PACKAGING AND INTEGRATION OF ELECTRONIC AND PHOTONIC MICROSYSTEMS, INTERPACK2022, 2022,
  • [35] Silas: A high-performance machine learning foundation for logical reasoning and verification
    Bride, Hadrien
    Cai, Cheng-Hao
    Dong, Jie
    Dong, Jin Song
    Hou, Zhe
    Mirjalili, Seyedali
    Sun, Jing
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
  • [36] Understanding and Designing a High-Performance Ultrafiltration Membrane Using Machine Learning
    Gao, Haiping
    Zhong, Shifa
    Dangayach, Raghav
    Chen, Yongsheng
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2023, 57 (46) : 17831 - 17840
  • [37] High-performance commercial data mining: A multistrategy machine learning application
    Hsu, WH
    Welge, M
    Redman, T
    Clutter, D
    DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (04) : 361 - 391
  • [38] Special-purpose parallel architectures for high-performance machine learning
    Battiti, R
    Lee, P
    Sartori, A
    Tecchiolli, G
    HIGH-PERFORMANCE COMPUTING AND NETWORKING, 1995, 919 : 944 - 944
  • [39] High-Performance Commercial Data Mining: A Multistrategy Machine Learning Application
    William H. Hsu
    Michael Welge
    Tom Redman
    David Clutter
    Data Mining and Knowledge Discovery, 2002, 6 : 361 - 391
  • [40] Interpretable machine learning for developing high-performance organic solar cells
    Abadi, Elyas Abbasi Jannat
    Sahu, Harikrishna
    Javadpour, Seyed Morteza
    Goharimanesh, Masoud
    MATERIALS TODAY ENERGY, 2022, 25