Visual relation of interest detection based on part detection

被引:0
|
作者
Zhou, You [1 ]
Yu, Fan [2 ]
机构
[1] Nanjing Univ, Jiangsu Vocat Inst Commerce, Nanjing, Peoples R China
[2] Nanjing Univ, Shenzhen Res Inst, Nanjing, Peoples R China
来源
INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021 | 2021年 / 11884卷
关键词
Visual relation of interest detection; interest propagation network; interest propagation from part;
D O I
10.1117/12.2605443
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual relation detection (VRD) aims to describe images with relation triplets like <subject, predicate,=object=>, paying attention to the interaction between every two instances. To detect the visual relations that express the main content of a given image, visual relation of interest detection (VROID) is proposed as an extension of the traditional VRD task. The existing methods related to the general VRD task are mostly based on instance-level features and the methods that adopt detailed information only use part-level attention or human body parts. None of the existing methods take advantage of general semantic parts. Therefore, on the basis of the IPNet for VROID, we further propose an interest propagation form part (IPFP) method which propagates interest along "part-instance-pair-triplet" to detect visual relations of interest. The IPFP method consists of four modules. Panoptic Object-Part Detection module, which extracts instances with instance features and instance parts with part features, Part Interest Prediction module. which predicts interest for every single part, Instance Interest Prediction module, which predicts interest for every single instance; the PairiP module predicts interest for each pair of instances; and the PredIP module predicts possible predicates for each instance pairs, Pair Interest Prediction module. which predicts interest for each pair of instances, and Predicate Interest Prediction module. which predicts possible predicates for each instance pairs. The interest scores of visual relations are the product of pair interest scores and predicate possibilities for pairs. We evaluate the performance of the IPFP method and the effectiveness of important components using the ViROI dataset for VROID.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Modelling relations with prototypes for visual relation detection
    Plesse, Francois
    Ginsca, Alexandru
    Delezoide, Bertrand
    Preteux, Francoise
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22465 - 22486
  • [22] Mouth detection based on interest point
    Wang Li
    Ye Hua
    Xia Liangzheng
    PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 4, 2007, : 610 - +
  • [23] Efficient Detection of Points of Interest from Georeferenced Visual Content
    Lu, Ying
    Colmenares, Juan A.
    BIGSPATIAL 2017: PROCEEDINGS OF THE 6TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON ANALYTICS FOR BIG GEOSPATIAL DATA (BIGSPATIAL-2017), 2017, : 27 - 36
  • [24] Points of Interest and Visual Dictionaries for Automatic Retinal Lesion Detection
    Rocha, Anderson
    Carvalho, Tiago
    Jelinek, Herbert F.
    Goldenstein, Siome
    Wainer, Jacques
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (08) : 2244 - 2253
  • [25] Improving Blast Furnace Raceway Blockage Detection. Part 3: Visual Detection Based on Tuyere Camera Images
    Puttinger, Stefan
    Stocker, Hugo
    ISIJ INTERNATIONAL, 2019, 59 (03) : 481 - 488
  • [26] Detection of change in shape and its relation to part structure
    Bertamini, M
    Farrant, T
    ACTA PSYCHOLOGICA, 2005, 120 (01) : 35 - 54
  • [27] Improving Visual Relation Detection using Depth Maps
    Sharifzadeh, Sahand
    Baharlou, Sina Moayed
    Berrendorf, Max
    Koner, Rajat
    Tresp, Volker
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3597 - 3604
  • [28] VISUAL BACKWARD MASKING AND AREA-DETECTION RELATION
    SHERRICK, MF
    DEMBER, WN
    PSYCHONOMIC SCIENCE, 1970, 19 (02): : 127 - 128
  • [29] Visual Relation Detection with Multi-Level Attention
    Zheng, Sipeng
    Chen, Shizhe
    Jin, Qin
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 121 - 129
  • [30] Visual Relation Detection Using Hybrid Analogical Learning
    Chen, Kezhen
    Forbus, Ken
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 801 - 808