Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions

被引:0
|
作者
Liu, Kai [1 ,2 ]
Fu, Zhihang [2 ]
Chen, Chao [2 ]
Jin, Sheng [2 ]
Chen, Ze [2 ]
Tao, Mingyuan [2 ]
Jiang, Rongxin [1 ]
Ye, Jieping [2 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] Alibaba Cloud, Hangzhou, Peoples R China
基金
国家重点研发计划;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The key to OOD detection has two aspects: generalized feature representation and precise category description. Recently, vision-language models such as CLIP provide significant advances in both two issues, but constructing precise category descriptions is still in its infancy due to the absence of unseen categories. This work introduces two hierarchical contexts, namely perceptual context and spurious context, to carefully describe the precise category boundary through automatic prompt tuning. Specifically, perceptual contexts perceive the inter-category difference (e.g., cats vs apples) for current classification tasks, while spurious contexts further identify spurious (similar but exactly not) OOD samples for every single category (e.g., cats vs panthers, apples vs peaches). The two contexts hierarchically construct the precise description for a certain category, which is, first roughly classifying a sample to the predicted category and then delicately identifying whether it is truly an ID sample or actually OOD. Moreover, the precise descriptions for those categories within the vision-language framework present a novel application: CATegory-EXtensible OOD detection (CATEX). One can efficiently extend the set of recognizable categories by simply merging the hierarchical contexts learned under different sub-task settings. And extensive experiments are conducted to demonstrate CATEX's effectiveness, robustness, and category-extensibility. For instance, CATEX consistently surpasses the rivals by a large margin with several protocols on the challenging ImageNet-1K dataset. In addition, we offer new insights on how to efficiently scale up the prompt engineering in vision-language models to recognize thousands of object categories, as well as how to incorporate large language models (like GPT-3) to boost zero-shot applications.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Boosting Out-of-distribution Detection with Typical Features
    Zhu, Yao
    Chen, Yuefeng
    Xie, Chuanlong
    Li, Xiaodan
    Zhang, Rong
    Xue, Hui
    Tian, Xiang
    Zheng, Bolun
    Chen, Yaowu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [42] Out-of-distribution detection by regaining lost clues
    Zhao, Zhilin
    Cao, Longbing
    Yu, Philip S.
    ARTIFICIAL INTELLIGENCE, 2025, 339
  • [43] Ensemble-Based Out-of-Distribution Detection
    Yang, Donghun
    Mai Ngoc, Kien
    Shin, Iksoo
    Lee, Kyong-Ha
    Hwang, Myunggwon
    ELECTRONICS, 2021, 10 (05) : 1 - 12
  • [44] SELFOOD: Self-Supervised Out-Of-Distribution Detection via Learning to Rank
    Mekalas, Dheeraj
    Samavedhi, Adithya
    Dong, Chengyu
    Shang, Jingbo
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 10721 - 10734
  • [45] Full-Spectrum Out-of-Distribution Detection
    Jingkang Yang
    Kaiyang Zhou
    Ziwei Liu
    International Journal of Computer Vision, 2023, 131 : 2607 - 2622
  • [46] Leveraging Visual Attention for out-of-distribution Detection
    Cultrera, Luca
    Seidenari, Lorenzo
    Del Bimbo, Alberto
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 4449 - 4458
  • [47] Heatmap-based Out-of-Distribution Detection
    Hornauer, Julia
    Belagiannis, Vasileios
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2602 - 2611
  • [48] A Simple Framework for Robust Out-of-Distribution Detection
    Hur, Youngbum
    Yang, Eunho
    Hwang, Sung Ju
    IEEE ACCESS, 2022, 10 : 23086 - 23097
  • [49] A Critical Analysis of Document Out-of-Distribution Detection
    Gu, Jiuxiang
    Ming, Yifei
    Zhou, Yi
    Kuen, Jason
    Morariu, Vlad I.
    Zhao, Handong
    Zhang, Ruiyi
    Barmpalios, Nikolaos
    Liu, Anqi
    Li, Yixuan
    Sun, Tong
    Nenkova, Ani
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 4973 - 4999
  • [50] Weighted Mutual Information for Out-Of-Distribution Detection
    De Bernardi, Giacomo
    Narteni, Sara
    Cambiaso, Enrico
    Muselli, Marco
    Mongelli, Maurizio
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT III, 2023, 1903 : 318 - 331