NAS-OoD: Neural Architecture Search for Out-of-Distribution Generalization

被引:15
|
作者
Bai, Haoyue [1 ]
Zhou, Fengwei [2 ]
Hong, Lanqing [2 ]
Ye, Nanyang [3 ]
Chan, S. -H. Gary [1 ]
Li, Zhenguo [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Huawei Noahs Ark Lab, Hong Kong, Peoples R China
[3] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
关键词
D O I
10.1109/ICCV48922.2021.00821
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances on Out-of-Distribution (OoD) generalization reveal the robustness of deep learning models against distribution shifts. However, existing works focus on OoD algorithms, such as invariant risk minimization, domain generalization, or stable learning, without considering the influence of deep model architectures on OoD generalization, which may lead to sub-optimal performance. Neural Architecture Search (NAS) methods search for architecture based on its performance on the training data, which may result in poor generalization for OoD tasks. In this work, we propose robust Neural Architecture Search for OoD generalization (NAS-OoD), which optimizes the architecture with respect to its performance on generated OoD data by gradient descent. Specifically, a data generator is learned to synthesize OoD data by maximizing losses computed by different neural architectures, while the goal for architecture search is to find the optimal architecture parameters that minimize the synthetic OoD data losses. The data generator and the neural architecture are jointly optimized in an end-to-end manner, and the minimax training process effectively discovers robust architectures that generalize well for different distribution shifts. Extensive experimental results show that NAS-OoD achieves superior performance on various OoD generalization benchmarks with deep models having a much fewer number of parameters. In addition, on a real industry dataset, the proposed NAS-OoD method reduces the error rate by more than 70% compared with the state-of-the-art method, demonstrating the proposed method's practicality for real applications. [GRAPHICS]
引用
收藏
页码:8300 / 8309
页数:10
相关论文
共 50 条
  • [1] Out-of-Distribution (OOD) Detection and Generalization Improved by Augmenting Adversarial Mixup Samples
    Gwon, Kyungpil
    Yoo, Joonhyuk
    ELECTRONICS, 2023, 12 (06)
  • [2] OoD-Bench: Quantifying and Understanding Two Dimensions of Out-of-Distribution Generalization
    Ye, Nanyang
    Li, Kaican
    Bai, Haoyue
    Yu, Runpeng
    Hong, Lanqing
    Zhou, Fengwei
    Li, Zhenguo
    Zhu, Jun
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7937 - 7948
  • [3] OOD-GNN: Out-of-Distribution Generalized Graph Neural Network
    Li, Haoyang
    Wang, Xin
    Zhang, Ziwei
    Zhu, Wenwu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 7328 - 7340
  • [4] Certifiable Out-of-Distribution Generalization
    Ye, Nanyang
    Zhu, Lin
    Wang, Jia
    Zeng, Zhaoyu
    Shao, Jiayao
    Peng, Chensheng
    Pan, Bikang
    Li, Kaican
    Zhu, Jun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10927 - 10935
  • [5] OOD ATTACK: GENERATING OVERCONFIDENT OUT-OF-DISTRIBUTION EXAMPLES TO FOOL DEEP NEURAL CLASSIFIERS
    Tang, Keke
    Cai, Xujian
    Peng, Weilong
    Li, Shudong
    Wang, Wenping
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1260 - 1264
  • [6] Out-of-Distribution Generalization by Neural-Symbolic Joint Training
    Liu, Anji
    Xu, Hongming
    Van den Broeck, Guy
    Liang, Yitao
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 12252 - 12259
  • [7] Functional Indirection Neural Estimator for Better Out-of-distribution Generalization
    Pham, Kha
    Le, Hung
    Ngo, Man
    Tran, Truyen
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [8] Textual out-of-distribution (OOD) detection for LLM quality assurance
    Ouyang, Tinghui
    Seo, Yoshiki
    Echizen, Isao
    KNOWLEDGE-BASED SYSTEMS, 2025, 310
  • [9] Out-of-Distribution (OOD) Detection Based on Deep Learning: A Review
    Cui, Peng
    Wang, Jinjia
    ELECTRONICS, 2022, 11 (21)
  • [10] Out-of-Distribution Generalization in Kernel Regression
    Canatar, Abdulkadir
    Bordelon, Blake
    Pehlevan, Cengiz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34