Graph-based social relation inference with multi-level conditional attention

被引：0

作者：

Yu, Xiaotian ^{[1
]}

Yi, Hanling ^{[1
]}

Tang, Qie ^{[1
]}

Huang, Kun ^{[1
]}

Hu, Wenze ^{[1
]}

Zhang, Shiliang ^{[2
]}

Wang, Xiaoyu ^{[1
,3
]}

机构：

[1] Shenzhen Intellifus Inc, Dept AI Technol Ctr, Shenzhen, Peoples R China

[2] Peking Univ, Dept Comp Sci, Beijing, Peoples R China

[3] Chinese Univ Hong Kong, Shenzhen, Peoples R China

来源：

NEURAL NETWORKS | 2024年 / 173卷

关键词：

Social relation inference; Multi-level conditional attention; Transformer; NEURAL-NETWORKS;

D O I：

10.1016/j.neunet.2024.106216

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Social relation inference intrinsically requires high-level semantic understanding. In order to accurately infer relations of persons in images, one needs not only to understand scenes and objects in images, but also to adaptively attend to important clues. Unlike prior works of classifying social relations using attention on detected objects, we propose a MUlti-level Conditional Attention (MUCA) mechanism for social relation inference, which attends to scenes, objects and human interactions based on each person pair. Then, we develop a transformer -style network to achieve the MUCA mechanism. The novel network named as Graphbased Relation Inference Transformer (i.e., GRIT) consists of two modules, i.e., a Conditional Query Module (CQM) and a Relation Attention Module (RAM). Specifically, we design a graph -based CQM to generate informative relation queries for all person pairs, which fuses local features and global context for each person pair. Moreover, we fully take advantage of transformer -style networks in RAM for multi -level attentions in classifying social relations. To our best knowledge, GRIT is the first for inferring social relations with multilevel conditional attention. GRIT is end -to -end trainable and significantly outperforms existing methods on two benchmark datasets, e.g., with performance improvement of 7.8% on PIPA and 9.6% on PISC.

引用

页数：15

共 50 条

[41] Multi-level Attention-based Domain Disentanglement for BCDR
Zhang, Xinyue
Li, Jingjing
Su, Hongzu
Zhu, Lei
Shen, Heng Tao
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (04)
[42] Image inpainting network based on multi-level attention mechanism
Xiang, Hongyue
Min, Weidong
Wei, Zitai
Zhu, Meng
Liu, Mengxue
Deng, Ziyang
IET IMAGE PROCESSING, 2024, 18 (02) : 428 - 438
[43] Multi-level conditional spectrum-based record selection for IDA
Kohrangi, Mohsen
Vamvatsikos, Dimitrios
Bazzurro, Paolo
EARTHQUAKE SPECTRA, 2020, 36 (04) : 1976 - 1994
[44] Multi-scale and Multi-level Attention Based on External Knowledge in EHRs
Le, Duc
Le, Bac
RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2024, PT I, 2024, 2144 : 113 - 125
[45] Inferring Users' Social Roles with a Multi-Level Graph Neural Network Model
Zhang, Chunrui
Wang, Shen
Zhan, Dechen
Yin, Mingyong
Lou, Fang
ENTROPY, 2021, 23 (11)
[46] Graph-based multi-level feature fusion network for diabetic retinopathy grading using ultra-wide-field images
Zhang, Dan
Liu, Mengting
Chen, Fangsheng
Lu, Qinkang
Zhao, Yitian
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 93
[47] Optimized Graph Search Using Multi-Level Graph Clustering
Kala, Rahul
Shukla, Anupam
Tiwari, Ritu
CONTEMPORARY COMPUTING, PROCEEDINGS, 2009, 40 : 103 - 114
[48] Graph convolutional networks with multi-level coarsening for graph classification
Xie, Yu
Yao, Chuanyu
Gong, Maoguo
Chen, Cheng
Qin, A. K.
KNOWLEDGE-BASED SYSTEMS, 2020, 194
[49] A knowledge graph embedding model based on multi-level analogical reasoning
Zhao, Xiaofei
Yang, Mengqian
Yang, Hongji
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 10553 - 10567
[50] An effective multi-level algorithm based on simulated annealing for bisecting graph
Sun, Lingyu
Leng, Ming
ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 2007, 4679 : 1 - +

← 1 2 3 4 5 →