Deep Multiple Instance Learning for Zero-Shot Image Tagging

被引:2
|
作者
Rahman, Shafin [1 ,2 ]
Khan, Salman [1 ,2 ,3 ]
机构
[1] Australian Natl Univ, Canberra, ACT 2601, Australia
[2] CSIRO, Data61, Canberra, ACT 2601, Australia
[3] Incept Inst AI, Abu Dhabi, U Arab Emirates
来源
关键词
Zero-shot learning; Zero-shot tagging; Object detection;
D O I
10.1007/978-3-030-20887-5_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In-line with the success of deep learning on traditional recognition problem, several end-to-end deep models for zero-shot recognition have been proposed in the literature. These models are successful to predict a single unseen label given an input image, but does not scale to cases where multiple unseen objects are present. In this paper, we model this problem within the framework of Multiple Instance Learning (MIL). To the best of our knowledge, we propose the first end-to-end trainable deep MIL framework for the multi-label zero-shot tagging problem. Due to its novel design, the proposed framework has several interesting features: (1) Unlike previous deep MIL models, it does not use any off-line procedure (e.g., Selective Search or EdgeBoxes) for bag generation. (2) During test time, it can process any number of unseen labels given their semantic embedding vectors. (3) Using only seen labels per image as weak annotation, it can produce a bounding box for each predicted label. We experiment with large-scale NUS-WIDE dataset and achieve superior performance across conventional, zero-shot and generalized zero-shot tagging tasks.
引用
收藏
页码:530 / 546
页数:17
相关论文
共 50 条
  • [21] Zero-Shot Image Dehazing
    Li, Boyun
    Gou, Yuanbiao
    Liu, Jerry Zitao
    Zhu, Hongyuan
    Zhou, Joey Tianyi
    Peng, Xi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8457 - 8466
  • [22] Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
    Lu, Ming Y.
    Chen, Bowen
    Zhang, Andrew
    Williamson, Drew F. K.
    Chen, Richard J.
    Ding, Tong
    Le, Long Phi
    Chuang, Yung-Sung
    Mahmood, Faisal
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 19764 - 19775
  • [23] Zero-Shot Image Classification Based on Deep Feature Extraction
    Wang, Xuesong
    Chen, Chen
    Cheng, Yuhu
    Wang, Z. Jane
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2018, 10 (02) : 432 - 444
  • [24] Zero-Shot Image Feature Consensus with Deep Functional Maps
    Cheng, Xinle
    Deng, Congyue
    Harley, Adam W.
    Zhu, Yixin
    Guibas, Leonidas
    COMPUTER VISION - ECCV 2024, PT XLVII, 2025, 15105 : 277 - 293
  • [25] Zero-Shot Image Classification Based on a Learnable Deep Metric
    Liu, Jingyi
    Shi, Caijuan
    Tu, Dongjing
    Shi, Ze
    Liu, Yazhi
    SENSORS, 2021, 21 (09)
  • [26] Attribute relaxation from class level to instance level for zero-shot learning
    Zhang, Haofeng
    Long, Yang
    Zhao, Chunxia
    ELECTRONICS LETTERS, 2018, 54 (20) : 1170 - 1171
  • [27] Estimation of Near-Instance-Level Attribute Bottleneck for Zero-Shot Learning
    Jiang, Chenyi
    Shen, Yuming
    Chen, Dubing
    Zhang, Haofeng
    Shao, Ling
    Torr, Philip H. S.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (08) : 2962 - 2988
  • [28] Instance-Specific Model Perturbation Improves Generalized Zero-Shot Learning
    Yang, Guanyu
    Huang, Kaizhu
    Zhang, Rui
    Yang, Xi
    NEURAL COMPUTATION, 2024, 36 (05) : 936 - 962
  • [29] Ordinal Zero-Shot Learning
    Huo, Zengwei
    Geng, Xin
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1916 - 1922
  • [30] Zero-Shot Kernel Learning
    Zhang, Hongguang
    Koniusz, Piotr
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7670 - 7679