Hierarchical memory-constrained operator scheduling of neural architecture search networks

被引:6
|
作者
Wang, Zihan [1 ]
Wan, Chengcheng [2 ]
Chen, Yuting [1 ]
Lin, Ziyi [3 ]
Jiang, He [4 ]
Qiao, Lei [5 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Univ Chicago, Chicago, IL 60637 USA
[3] Alibaba Grp Inc, Shanghai, Peoples R China
[4] Dalian Univ Technol, Dalian, Peoples R China
[5] Beijing Inst Control Engn, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
10.1145/3489517.3530472
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Architecture Search (NAS) is widely used in industry, searching for neural networks meeting task requirements. Meanwhile, it faces a challenge in scheduling networks satisfying memory constraints. This paper proposes HMCOS that performs hierarchical memory-constrained operator scheduling of NAS networks: given a network, HMCOS constructs a hierarchical computation graph and employs an iterative scheduling algorithm to progressively reduce peak memory footprints. We evaluate HMCOS against RPO and Serenity (two popular scheduling techniques). The results show that HMCOS outperforms existing techniques in supporting more NAS networks, reducing 8.7 similar to 42.4% of peak memory footprints, and achieving 137 similar to 283x of speedups in scheduling.
引用
收藏
页码:493 / 498
页数:6
相关论文
共 50 条
  • [31] MicroNAS for memory and latency constrained hardware aware neural architecture search in time series classification on microcontrollers
    King, Tobias
    Zhou, Yexu
    Roeddiger, Tobias
    Beigl, Michael
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [32] Neural Architecture Search of SPD Manifold Networks
    Sukthanker, Rhea Sanjay
    Huang, Zhiwu
    Kumar, Suryansh
    Endsjo, Erik Goron
    Wu, Yan
    Van Gool, Luc
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 3002 - 3009
  • [33] Evolutionary Neural Architecture Search for Transferable Networks
    Zhou, Xun
    Liu, Songbai
    Qin, A. K.
    Tan, Kay Chen
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024,
  • [34] Genetic Architecture Search for Binarized Neural Networks
    Chang, Yangyang
    Sobelman, Gerald E.
    Zhou, Xiaofang
    2019 IEEE 13TH INTERNATIONAL CONFERENCE ON ASIC (ASICON), 2019,
  • [35] Efficient Architecture Search for Deep Neural Networks
    Gottapu, Ram Deepak
    Dagli, Cihan H.
    COMPLEX ADAPTIVE SYSTEMS, 2020, 168 : 19 - 25
  • [36] Neural architecture search for resource constrained hardware devices: A survey
    Yang, Yongjia
    Zhan, Jinyu
    Jiang, Wei
    Jiang, Yucheng
    Yu, Antai
    IET CYBER-PHYSICAL SYSTEMS: THEORY & APPLICATIONS, 2023, 8 (03) : 149 - 159
  • [37] Toward Less Constrained Macro-Neural Architecture Search
    Lopes, Vasco
    Alexandre, Luis A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 15
  • [38] Resource-Constrained Neural Architecture Search on Edge Devices
    Lyu, Bo
    Yuan, Hang
    Lu, Longfei
    Zhang, Yunye
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2022, 9 (01): : 134 - 142
  • [39] Neural Architecture Search for Low-Precision Neural Networks
    Wu, Binyi
    Waschneck, Bernd
    Mayr, Christian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 743 - 755
  • [40] Adversarially Robust Neural Architecture Search for Graph Neural Networks
    Xie, Beini
    Chang, Heng
    Zhang, Ziwei
    Wang, Xin
    Wang, Daxin
    Zhang, Zhiqiang
    Ying, Rex
    Zhu, Wenwu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8143 - 8152