Deep Multiple Instance Learning for Zero-Shot Image Tagging

被引：2

作者：

Rahman, Shafin ^{[1
,2
]}

Khan, Salman ^{[1
,2
,3
]}

机构：

[1] Australian Natl Univ, Canberra, ACT 2601, Australia

[2] CSIRO, Data61, Canberra, ACT 2601, Australia

[3] Incept Inst AI, Abu Dhabi, U Arab Emirates

来源：

COMPUTER VISION - ACCV 2018, PT I | 2019年 / 11361卷

关键词：

Zero-shot learning; Zero-shot tagging; Object detection;

D O I：

10.1007/978-3-030-20887-5_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In-line with the success of deep learning on traditional recognition problem, several end-to-end deep models for zero-shot recognition have been proposed in the literature. These models are successful to predict a single unseen label given an input image, but does not scale to cases where multiple unseen objects are present. In this paper, we model this problem within the framework of Multiple Instance Learning (MIL). To the best of our knowledge, we propose the first end-to-end trainable deep MIL framework for the multi-label zero-shot tagging problem. Due to its novel design, the proposed framework has several interesting features: (1) Unlike previous deep MIL models, it does not use any off-line procedure (e.g., Selective Search or EdgeBoxes) for bag generation. (2) During test time, it can process any number of unseen labels given their semantic embedding vectors. (3) Using only seen labels per image as weak annotation, it can produce a bounding box for each predicted label. We experiment with large-scale NUS-WIDE dataset and achieve superior performance across conventional, zero-shot and generalized zero-shot tagging tasks.

引用

页码：530 / 546

页数：17

共 50 条

[1] Deep0Tag: Deep Multiple Instance Learning for Zero-Shot Image Tagging
Rahman, Shafin
Khan, Salman
Barnes, Nick
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (01) : 242 - 255
[2] Fast Zero-Shot Image Tagging
Zhang, Yang
Gong, Boqing
Shah, Mubarak
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5985 - 5994
[3] Zero-shot Image Tagging by Hierarchical Semantic Embedding
Li, Xirong
Liao, Shuai
Lan, Weiyu
Du, Xiaoyong
Yang, Gang
SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 879 - 882
[4] Zero-Shot Instance Segmentation
Zheng, Ye
Wu, Jiahong
Qin, Yongqiang
Zhang, Faen
Cui, Li
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2593 - 2602
[5] Prioritized Semantic Learning for Zero-Shot Instance Navigation
Sun, Xinyu
Liu, Lizhao
Zhi, Hongyan
Qiu, Ronghe
Liang, Junwei
COMPUTER VISION - ECCV 2024, PT XII, 2025, 15070 : 161 - 178
[6] Learning Discriminative Instance Attribute for Zero-Shot Classification
Wang, Lu
Wu, Songsong
Yu, Jun
Jing, Xiao-Yuan
PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 210 - 213
[7] ZSDECNet: A zero-shot deep learning framework for image exposure correction
Li, Wenchao
Wen, Shuyuan
Zhu, Jinhao
Ou, Qiaofeng
Guo, Yanchun
Chen, Jiabao
Xiong, Bangshu
NEUROCOMPUTING, 2025, 627
[8] DEEP ZERO-SHOT LEARNING FOR SCENE SKETCH
Xie, Yao
Xu, Peng
Ma, Zhanyu
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 3661 - 3665
[9] Learning a Deep Embedding Model for Zero-Shot Learning
Zhang, Li
Xiang, Tao
Gong, Shaogang
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 3010 - 3019
[10] Webly-supervised zero-shot learning for artwork instance recognition
Del Chiaro, Riccardo
Bagdanov, Andrew D.
Del Bimbo, Alberto
PATTERN RECOGNITION LETTERS, 2019, 128 : 420 - 426

← 1 2 3 4 5 →