Weakly-supervised learning of visual relations

被引：106

作者：

Peyre, Julia ^{[1
,2
]}

Laptev, Ivan ^{[1
,2
]}

Schmid, Cordelia ^{[2
,4
]}

Sivic, Josef ^{[1
,2
,3
]}

机构：

[1] PSL Res Univ, ENS, CNRS, Dept Informat, F-75005 Paris, France

[2] INRIA, Paris, France

[3] Czech Tech Univ, Czech Inst Informat Robot & Cybernet, Prague, Czech Republic

[4] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年

基金：

欧洲研究理事会;

关键词：

D O I：

10.1109/ICCV.2017.554

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper introduces a novel approach for modeling visual relations between pairs of objects. We call relation a triplet of the form (subject, predicate, object) where the predicate is typically a preposition (eg. 'under', 'in front of') or a verb ('hold', 'ride') that links a pair of objects (subject, object). Learning such relations is challenging as the objects have different spatial configurations and appearances depending on the relation in which they occur. Another major challenge comes from the difficulty to get annotations, especially at box-level, for all possible triplets, which makes both learning and evaluation difficult. The contributions of this paper are threefold. First, we design strong yet flexible visual features that encode the appearance and spatial configuration for pairs of objects. Second, we propose a weakly-supervised discriminative clustering model to learn relations from image-level labels only. Third we introduce a new challenging dataset of unusual relations (UnRel) together with an exhaustive annotation, that enables accurate evaluation of visual relation retrieval. We show experimentally that our model results in state-of-the-art results on the visual relationship dataset [32] significantly improving performance on previously unseen relations (zero-shot learning), and confirm this observation on our newly introduced UnRel dataset.

引用

页码：5189 / 5198

页数：10

共 50 条

[21] Deep Learning Frameworks for Weakly-Supervised Indoor Localization
Zanjani, Farhad G.
Karmanov, Ilia
Ackermann, Hanno
Dijkman, Daniel
Merlin, Simone
Kadampot, Ishaque
Buesker, Brian
Vegunta, Vamsi
Porikli, Fatih
NEURIPS 2021 COMPETITIONS AND DEMONSTRATIONS TRACK, VOL 176, 2021, 176 : 349 - 354
[22] Weakly-Supervised Contrastive Learning for Unsupervised Object Discovery
Lv, Yunqiu
Zhang, Jing
Barnes, Nick
Dai, Yuchao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2689 - 2702
[23] Semantic-Aware Registration with Weakly-Supervised Learning
Jin, Zhan
Xue, Peng
Zhang, Yuyao
Cao, Xiaohuan
Shen, Dinggang
CANCER PREVENTION THROUGH EARLY DETECTION, CAPTION 2022, 2022, 13581 : 159 - 168
[24] Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
Wang, Xiang
Liu, Sifei
Ma, Huimin
Yang, Ming-Hsuan
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (06) : 1736 - 1749
[25] Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
Xiang Wang
Sifei Liu
Huimin Ma
Ming-Hsuan Yang
International Journal of Computer Vision, 2020, 128 : 1736 - 1749
[26] Weakly-supervised learning approach for potato defects segmentation
Marino, Sofia
Beauseroy, Pierre
Smolarz, Andre
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 337 - 346
[27] Multimodal Generative Models for Scalable Weakly-Supervised Learning
Wu, Mike
Goodman, Noah
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[28] Adversarial Learning for Weakly-Supervised Social Network Alignment
Li, Chaozhuo
Wang, Senzhang
Wang, Yukun
Yu, Philip
Liang, Yanbo
Liu, Yun
Li, Zhoujun
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 996 - 1003
[29] Weakly-Supervised Semantic Segmentation by Learning Label Uncertainty
Neven, Robby
Neven, Davy
De Brabandere, Bert
Proesmans, Marc
Goedeme, Toon
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1678 - 1686
[30] ALWOD: Active Learning for Weakly-Supervised Object Detection
Wang, Yuting
Ilic, Velibor
Li, Jiatong
Kisacanin, Branislav
Pavlovic, Vladimir
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6436 - 6446

← 1 2 3 4 5 →