Secure Out-of-Distribution Task Generalization with Energy-Based Models

被引:0
|
作者
Chen, Shengzhuang [1 ]
Huang, Long-Kai [2 ]
Schwarz, Jonathan Richard [3 ]
Du, Yilun [4 ]
Wei, Ying [1 ,5 ]
机构
[1] City Univ Hong Kong, Hong Kong, Peoples R China
[2] Tencent AI Lab, Shenzhen, Peoples R China
[3] UCL, London, England
[4] MIT, Cambridge, MA USA
[5] Nanyang Technol Univ, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The success of meta-learning on out-of-distribution (OOD) tasks in the wild has proved to be hit-and-miss. To safeguard the generalization capability of the meta-learned prior knowledge to OOD tasks, in particularly safety-critical applications, necessitates detection of an OOD task followed by adaptation of the task towards the prior. Nonetheless, the reliability of estimated uncertainty on OOD tasks by existing Bayesian meta-learning methods is restricted by incomplete coverage of the feature distribution shift and insufficient expressiveness of the meta-learned prior. Besides, they struggle to adapt an OOD task, running parallel to the line of cross-domain task adaptation solutions which are vulnerable to overfitting. To this end, we build a single coherent framework that supports both detection and adaptation of OOD tasks, while remaining compatible with off-the-shelf meta-learning backbones. The proposed Energy-Based Meta-Learning (EBML) framework learns to characterize any arbitrary meta-training task distribution with the composition of two expressive neural-network-based energy functions. We deploy the sum of the two energy functions, being proportional to the joint distribution of a task, as a reliable score for detecting OOD tasks; during meta-testing, we adapt the OOD task to in-distribution tasks by energy minimization. Experiments on four regression and classification datasets demonstrate the effectiveness of our proposal.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Exploring Energy-Based Models for Out-of-Distribution Detection in Dialect Identification
    Hao, Yaqian
    Hu, Chenguang
    Gao, Yingying
    Zhang, Shilei
    Feng, Junlan
    INTERSPEECH 2024, 2024, : 1640 - 1644
  • [2] Classification of incunable glyphs and out-of-distribution detection with joint energy-based models
    Florian Kordon
    Nikolaus Weichselbaumer
    Randall Herz
    Stephen Mossman
    Edward Potten
    Mathias Seuret
    Martin Mayr
    Vincent Christlein
    International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 223 - 240
  • [3] Classification of incunable glyphs and out-of-distribution detection with joint energy-based models
    Kordon, Florian
    Weichselbaumer, Nikolaus
    Herz, Randall
    Mossman, Stephen
    Potten, Edward
    Seuret, Mathias
    Mayr, Martin
    Christlein, Vincent
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2023, 26 (03) : 223 - 240
  • [4] Improving Energy-Based Out-of-Distribution Detection by Sparsity Regularization
    Chen, Qichao
    Jiang, Wenjie
    Li, Kuan
    Wang, Yi
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 539 - 551
  • [5] On the Adversarial Robustness of Out-of-distribution Generalization Models
    Zou, Xin
    Liu, Weiwei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [6] Certifiable Out-of-Distribution Generalization
    Ye, Nanyang
    Zhu, Lin
    Wang, Jia
    Zeng, Zhaoyu
    Shao, Jiayao
    Peng, Chensheng
    Pan, Bikang
    Li, Kaican
    Zhu, Jun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10927 - 10935
  • [7] Understanding the Generalization of Pretrained Diffusion Models on Out-of-Distribution Data
    Ramachandran, Sai Niranjan
    Mukhopadhyay, Rudrabha
    Agarwal, Madhav
    Jawahar, C. V.
    Namboodiri, Vinay
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 13, 2024, : 14767 - 14775
  • [8] Amodal Segmentation through Out-of-Task and Out-of-Distribution Generalization with a Bayesian Model
    Sun, Yihong
    Kortylewski, Adam
    Yuille, Alan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1205 - 1214
  • [9] Out-of-Distribution Generalization in Kernel Regression
    Canatar, Abdulkadir
    Bordelon, Blake
    Pehlevan, Cengiz
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] Causal softmax for out-of-distribution generalization
    Luo, Jing
    Zhao, Wanqing
    Peng, Jinye
    DIGITAL SIGNAL PROCESSING, 2025, 156