Weakly-supervised learning of visual relations

被引:106
|
作者
Peyre, Julia [1 ,2 ]
Laptev, Ivan [1 ,2 ]
Schmid, Cordelia [2 ,4 ]
Sivic, Josef [1 ,2 ,3 ]
机构
[1] PSL Res Univ, ENS, CNRS, Dept Informat, F-75005 Paris, France
[2] INRIA, Paris, France
[3] Czech Tech Univ, Czech Inst Informat Robot & Cybernet, Prague, Czech Republic
[4] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
基金
欧洲研究理事会;
关键词
D O I
10.1109/ICCV.2017.554
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel approach for modeling visual relations between pairs of objects. We call relation a triplet of the form (subject, predicate, object) where the predicate is typically a preposition (eg. 'under', 'in front of') or a verb ('hold', 'ride') that links a pair of objects (subject, object). Learning such relations is challenging as the objects have different spatial configurations and appearances depending on the relation in which they occur. Another major challenge comes from the difficulty to get annotations, especially at box-level, for all possible triplets, which makes both learning and evaluation difficult. The contributions of this paper are threefold. First, we design strong yet flexible visual features that encode the appearance and spatial configuration for pairs of objects. Second, we propose a weakly-supervised discriminative clustering model to learn relations from image-level labels only. Third we introduce a new challenging dataset of unusual relations (UnRel) together with an exhaustive annotation, that enables accurate evaluation of visual relation retrieval. We show experimentally that our model results in state-of-the-art results on the visual relationship dataset [32] significantly improving performance on previously unseen relations (zero-shot learning), and confirm this observation on our newly introduced UnRel dataset.
引用
收藏
页码:5189 / 5198
页数:10
相关论文
共 50 条
  • [21] Deep Learning Frameworks for Weakly-Supervised Indoor Localization
    Zanjani, Farhad G.
    Karmanov, Ilia
    Ackermann, Hanno
    Dijkman, Daniel
    Merlin, Simone
    Kadampot, Ishaque
    Buesker, Brian
    Vegunta, Vamsi
    Porikli, Fatih
    NEURIPS 2021 COMPETITIONS AND DEMONSTRATIONS TRACK, VOL 176, 2021, 176 : 349 - 354
  • [22] Weakly-Supervised Contrastive Learning for Unsupervised Object Discovery
    Lv, Yunqiu
    Zhang, Jing
    Barnes, Nick
    Dai, Yuchao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 2689 - 2702
  • [23] Semantic-Aware Registration with Weakly-Supervised Learning
    Jin, Zhan
    Xue, Peng
    Zhang, Yuyao
    Cao, Xiaohuan
    Shen, Dinggang
    CANCER PREVENTION THROUGH EARLY DETECTION, CAPTION 2022, 2022, 13581 : 159 - 168
  • [24] Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
    Wang, Xiang
    Liu, Sifei
    Ma, Huimin
    Yang, Ming-Hsuan
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (06) : 1736 - 1749
  • [25] Weakly-Supervised Semantic Segmentation by Iterative Affinity Learning
    Xiang Wang
    Sifei Liu
    Huimin Ma
    Ming-Hsuan Yang
    International Journal of Computer Vision, 2020, 128 : 1736 - 1749
  • [26] Weakly-supervised learning approach for potato defects segmentation
    Marino, Sofia
    Beauseroy, Pierre
    Smolarz, Andre
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 337 - 346
  • [27] Multimodal Generative Models for Scalable Weakly-Supervised Learning
    Wu, Mike
    Goodman, Noah
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [28] Adversarial Learning for Weakly-Supervised Social Network Alignment
    Li, Chaozhuo
    Wang, Senzhang
    Wang, Yukun
    Yu, Philip
    Liang, Yanbo
    Liu, Yun
    Li, Zhoujun
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 996 - 1003
  • [29] Weakly-Supervised Semantic Segmentation by Learning Label Uncertainty
    Neven, Robby
    Neven, Davy
    De Brabandere, Bert
    Proesmans, Marc
    Goedeme, Toon
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1678 - 1686
  • [30] ALWOD: Active Learning for Weakly-Supervised Object Detection
    Wang, Yuting
    Ilic, Velibor
    Li, Jiatong
    Kisacanin, Branislav
    Pavlovic, Vladimir
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 6436 - 6446