Visual relation of interest detection based on part detection

被引：0

作者：

Zhou, You ^{[1
]}

Yu, Fan ^{[2
]}

机构：

[1] Nanjing Univ, Jiangsu Vocat Inst Commerce, Nanjing, Peoples R China

[2] Nanjing Univ, Shenzhen Res Inst, Nanjing, Peoples R China

来源：

INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021 | 2021年 / 11884卷

关键词：

Visual relation of interest detection; interest propagation network; interest propagation from part;

D O I：

10.1117/12.2605443

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Visual relation detection (VRD) aims to describe images with relation triplets like <subject, predicate,=object=>, paying attention to the interaction between every two instances. To detect the visual relations that express the main content of a given image, visual relation of interest detection (VROID) is proposed as an extension of the traditional VRD task. The existing methods related to the general VRD task are mostly based on instance-level features and the methods that adopt detailed information only use part-level attention or human body parts. None of the existing methods take advantage of general semantic parts. Therefore, on the basis of the IPNet for VROID, we further propose an interest propagation form part (IPFP) method which propagates interest along "part-instance-pair-triplet" to detect visual relations of interest. The IPFP method consists of four modules. Panoptic Object-Part Detection module, which extracts instances with instance features and instance parts with part features, Part Interest Prediction module. which predicts interest for every single part, Instance Interest Prediction module, which predicts interest for every single instance; the PairiP module predicts interest for each pair of instances; and the PredIP module predicts possible predicates for each instance pairs, Pair Interest Prediction module. which predicts interest for each pair of instances, and Predicate Interest Prediction module. which predicts possible predicates for each instance pairs. The interest scores of visual relations are the product of pair interest scores and predicate possibilities for pairs. We evaluate the performance of the IPFP method and the effectiveness of important components using the ViROI dataset for VROID.

引用

页数：8

共 50 条

[21] Modelling relations with prototypes for visual relation detection
Plesse, Francois
Ginsca, Alexandru
Delezoide, Bertrand
Preteux, Francoise
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22465 - 22486
[22] Mouth detection based on interest point
Wang Li
Ye Hua
Xia Liangzheng
PROCEEDINGS OF THE 26TH CHINESE CONTROL CONFERENCE, VOL 4, 2007, : 610 - +
[23] Efficient Detection of Points of Interest from Georeferenced Visual Content
Lu, Ying
Colmenares, Juan A.
BIGSPATIAL 2017: PROCEEDINGS OF THE 6TH ACM SIGSPATIAL INTERNATIONAL WORKSHOP ON ANALYTICS FOR BIG GEOSPATIAL DATA (BIGSPATIAL-2017), 2017, : 27 - 36
[24] Points of Interest and Visual Dictionaries for Automatic Retinal Lesion Detection
Rocha, Anderson
Carvalho, Tiago
Jelinek, Herbert F.
Goldenstein, Siome
Wainer, Jacques
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2012, 59 (08) : 2244 - 2253
[25] Improving Blast Furnace Raceway Blockage Detection. Part 3: Visual Detection Based on Tuyere Camera Images
Puttinger, Stefan
Stocker, Hugo
ISIJ INTERNATIONAL, 2019, 59 (03) : 481 - 488
[26] Detection of change in shape and its relation to part structure
Bertamini, M
Farrant, T
ACTA PSYCHOLOGICA, 2005, 120 (01) : 35 - 54
[27] Improving Visual Relation Detection using Depth Maps
Sharifzadeh, Sahand
Baharlou, Sina Moayed
Berrendorf, Max
Koner, Rajat
Tresp, Volker
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3597 - 3604
[28] VISUAL BACKWARD MASKING AND AREA-DETECTION RELATION
SHERRICK, MF
DEMBER, WN
PSYCHONOMIC SCIENCE, 1970, 19 (02): : 127 - 128
[29] Visual Relation Detection with Multi-Level Attention
Zheng, Sipeng
Chen, Shizhe
Jin, Qin
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 121 - 129
[30] Visual Relation Detection Using Hybrid Analogical Learning
Chen, Kezhen
Forbus, Ken
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 801 - 808

← 1 2 3 4 5 →