Weakly-supervised learning of visual relations

被引:106
|
作者
Peyre, Julia [1 ,2 ]
Laptev, Ivan [1 ,2 ]
Schmid, Cordelia [2 ,4 ]
Sivic, Josef [1 ,2 ,3 ]
机构
[1] PSL Res Univ, ENS, CNRS, Dept Informat, F-75005 Paris, France
[2] INRIA, Paris, France
[3] Czech Tech Univ, Czech Inst Informat Robot & Cybernet, Prague, Czech Republic
[4] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
基金
欧洲研究理事会;
关键词
D O I
10.1109/ICCV.2017.554
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a novel approach for modeling visual relations between pairs of objects. We call relation a triplet of the form (subject, predicate, object) where the predicate is typically a preposition (eg. 'under', 'in front of') or a verb ('hold', 'ride') that links a pair of objects (subject, object). Learning such relations is challenging as the objects have different spatial configurations and appearances depending on the relation in which they occur. Another major challenge comes from the difficulty to get annotations, especially at box-level, for all possible triplets, which makes both learning and evaluation difficult. The contributions of this paper are threefold. First, we design strong yet flexible visual features that encode the appearance and spatial configuration for pairs of objects. Second, we propose a weakly-supervised discriminative clustering model to learn relations from image-level labels only. Third we introduce a new challenging dataset of unusual relations (UnRel) together with an exhaustive annotation, that enables accurate evaluation of visual relation retrieval. We show experimentally that our model results in state-of-the-art results on the visual relationship dataset [32] significantly improving performance on previously unseen relations (zero-shot learning), and confirm this observation on our newly introduced UnRel dataset.
引用
收藏
页码:5189 / 5198
页数:10
相关论文
共 50 条
  • [41] Hand pose estimation through semi-supervised and weakly-supervised learning
    Neverova, Natalia
    Wolf, Christian
    Nebout, Florian
    Taylor, Graham W.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 164 : 56 - 67
  • [42] Weakly-Supervised Convolutional Learning for Detection of Inflammatory Gastrointestinal Lesions
    Georgakopoulos, Spiros V.
    Iakovidis, Dimitris K.
    Vasilakakis, Michael
    Plagianakos, Vassilis P.
    Koulaouzidis, Anastasios
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGING SYSTEMS AND TECHNIQUES (IST), 2016, : 510 - 514
  • [43] Temporal RPN Learning for Weakly-Supervised Temporal Action Localization
    Huang, Jing
    Kong, Ming
    Chen, Luyuan
    Liang, Tian
    Zhu, Qiang
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [44] Weakly-Supervised Temporal Localization via Occurrence Count Learning
    Schroeter, Julien
    Sidorov, Kirill
    Marshall, David
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [45] Weakly-supervised deep learning models in computational pathology Comment
    Augustine, Tanya N.
    EBIOMEDICINE, 2022, 81
  • [46] Vectorized Evidential Learning for Weakly-Supervised Temporal Action Localization
    Gao, Junyu
    Chen, Mengyuan
    Xu, Changsheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 15949 - 15963
  • [47] Robust Localization of Retinal Lesions via Weakly-supervised Learning
    Zhao, Ruohan
    Li, Qin
    You, Jane
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4613 - 4618
  • [48] Saliency Guided Dictionary Learning for Weakly-Supervised Image Parsing
    Lai, Baisheng
    Gong, Xiaojin
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3630 - 3639
  • [49] Weakly-supervised learning method for the recognition of potato leaf diseases
    Chen, Junde
    Deng, Xiaofang
    Wen, Yuxin
    Chen, Weirong
    Zeb, Adnan
    Zhang, Defu
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (08) : 7985 - 8002
  • [50] Weakly-supervised word learning is improved by an active online algorithm
    Rasilo, Heikki
    Rasanen, Okko
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1561 - 1565