Graph-based social relation inference with multi-level conditional attention

被引:0
|
作者
Yu, Xiaotian [1 ]
Yi, Hanling [1 ]
Tang, Qie [1 ]
Huang, Kun [1 ]
Hu, Wenze [1 ]
Zhang, Shiliang [2 ]
Wang, Xiaoyu [1 ,3 ]
机构
[1] Shenzhen Intellifus Inc, Dept AI Technol Ctr, Shenzhen, Peoples R China
[2] Peking Univ, Dept Comp Sci, Beijing, Peoples R China
[3] Chinese Univ Hong Kong, Shenzhen, Peoples R China
关键词
Social relation inference; Multi-level conditional attention; Transformer; NEURAL-NETWORKS;
D O I
10.1016/j.neunet.2024.106216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social relation inference intrinsically requires high-level semantic understanding. In order to accurately infer relations of persons in images, one needs not only to understand scenes and objects in images, but also to adaptively attend to important clues. Unlike prior works of classifying social relations using attention on detected objects, we propose a MUlti-level Conditional Attention (MUCA) mechanism for social relation inference, which attends to scenes, objects and human interactions based on each person pair. Then, we develop a transformer -style network to achieve the MUCA mechanism. The novel network named as Graphbased Relation Inference Transformer (i.e., GRIT) consists of two modules, i.e., a Conditional Query Module (CQM) and a Relation Attention Module (RAM). Specifically, we design a graph -based CQM to generate informative relation queries for all person pairs, which fuses local features and global context for each person pair. Moreover, we fully take advantage of transformer -style networks in RAM for multi -level attentions in classifying social relations. To our best knowledge, GRIT is the first for inferring social relations with multilevel conditional attention. GRIT is end -to -end trainable and significantly outperforms existing methods on two benchmark datasets, e.g., with performance improvement of 7.8% on PIPA and 9.6% on PISC.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Multi-level Attention-based Domain Disentanglement for BCDR
    Zhang, Xinyue
    Li, Jingjing
    Su, Hongzu
    Zhu, Lei
    Shen, Heng Tao
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2023, 41 (04)
  • [42] Image inpainting network based on multi-level attention mechanism
    Xiang, Hongyue
    Min, Weidong
    Wei, Zitai
    Zhu, Meng
    Liu, Mengxue
    Deng, Ziyang
    IET IMAGE PROCESSING, 2024, 18 (02) : 428 - 438
  • [43] Multi-level conditional spectrum-based record selection for IDA
    Kohrangi, Mohsen
    Vamvatsikos, Dimitrios
    Bazzurro, Paolo
    EARTHQUAKE SPECTRA, 2020, 36 (04) : 1976 - 1994
  • [44] Multi-scale and Multi-level Attention Based on External Knowledge in EHRs
    Le, Duc
    Le, Bac
    RECENT CHALLENGES IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2024, PT I, 2024, 2144 : 113 - 125
  • [45] Inferring Users' Social Roles with a Multi-Level Graph Neural Network Model
    Zhang, Chunrui
    Wang, Shen
    Zhan, Dechen
    Yin, Mingyong
    Lou, Fang
    ENTROPY, 2021, 23 (11)
  • [46] Graph-based multi-level feature fusion network for diabetic retinopathy grading using ultra-wide-field images
    Zhang, Dan
    Liu, Mengting
    Chen, Fangsheng
    Lu, Qinkang
    Zhao, Yitian
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 93
  • [47] Optimized Graph Search Using Multi-Level Graph Clustering
    Kala, Rahul
    Shukla, Anupam
    Tiwari, Ritu
    CONTEMPORARY COMPUTING, PROCEEDINGS, 2009, 40 : 103 - 114
  • [48] Graph convolutional networks with multi-level coarsening for graph classification
    Xie, Yu
    Yao, Chuanyu
    Gong, Maoguo
    Chen, Cheng
    Qin, A. K.
    KNOWLEDGE-BASED SYSTEMS, 2020, 194
  • [49] A knowledge graph embedding model based on multi-level analogical reasoning
    Zhao, Xiaofei
    Yang, Mengqian
    Yang, Hongji
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (08): : 10553 - 10567
  • [50] An effective multi-level algorithm based on simulated annealing for bisecting graph
    Sun, Lingyu
    Leng, Ming
    ENERGY MINIMIZATION METHODS IN COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 2007, 4679 : 1 - +