Graph-based social relation inference with multi-level conditional attention

被引:0
|
作者
Yu, Xiaotian [1 ]
Yi, Hanling [1 ]
Tang, Qie [1 ]
Huang, Kun [1 ]
Hu, Wenze [1 ]
Zhang, Shiliang [2 ]
Wang, Xiaoyu [1 ,3 ]
机构
[1] Shenzhen Intellifus Inc, Dept AI Technol Ctr, Shenzhen, Peoples R China
[2] Peking Univ, Dept Comp Sci, Beijing, Peoples R China
[3] Chinese Univ Hong Kong, Shenzhen, Peoples R China
关键词
Social relation inference; Multi-level conditional attention; Transformer; NEURAL-NETWORKS;
D O I
10.1016/j.neunet.2024.106216
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social relation inference intrinsically requires high-level semantic understanding. In order to accurately infer relations of persons in images, one needs not only to understand scenes and objects in images, but also to adaptively attend to important clues. Unlike prior works of classifying social relations using attention on detected objects, we propose a MUlti-level Conditional Attention (MUCA) mechanism for social relation inference, which attends to scenes, objects and human interactions based on each person pair. Then, we develop a transformer -style network to achieve the MUCA mechanism. The novel network named as Graphbased Relation Inference Transformer (i.e., GRIT) consists of two modules, i.e., a Conditional Query Module (CQM) and a Relation Attention Module (RAM). Specifically, we design a graph -based CQM to generate informative relation queries for all person pairs, which fuses local features and global context for each person pair. Moreover, we fully take advantage of transformer -style networks in RAM for multi -level attentions in classifying social relations. To our best knowledge, GRIT is the first for inferring social relations with multilevel conditional attention. GRIT is end -to -end trainable and significantly outperforms existing methods on two benchmark datasets, e.g., with performance improvement of 7.8% on PIPA and 9.6% on PISC.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] A graph-based multi-level linguistic representation for document understanding
    Pinto, David
    Gomez-Adorno, Helena
    Vilarino, Darnes
    Singh, Vivek Kumar
    PATTERN RECOGNITION LETTERS, 2014, 41 : 93 - 102
  • [2] Graph-based modeling of ETL activities with multi-level transformations and updates
    Simitsis, A
    Vassiliadis, P
    Terrovitis, M
    Skiadopoulos, S
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 43 - 52
  • [3] A Graph-Based Multi-level Framework to Support the Designing of Collaborative Workplaces
    Di Marino, Castrese
    Rega, Andrea
    Fruggiero, Fabio
    Pasquariello, Agnese
    Vitolo, Ferdinando
    Patalano, Stanislao
    DESIGN TOOLS AND METHODS IN INDUSTRIAL ENGINEERING II, ADM 2021, 2022, : 641 - 649
  • [4] A Graph-Based Multi-level Framework to Support the Designing of Collaborative Workplaces
    Di Marino, Castrese
    Rega, Andrea
    Fruggiero, Fabio
    Pasquariello, Agnese
    Vitolo, Ferdinando
    Patalano, Stanislao
    Lecture Notes in Mechanical Engineering, 2022, : 641 - 649
  • [5] On improved graph-based alternative wiring scheme for multi-level logic optimization
    Wu, YL
    Sze, CN
    Cheung, CC
    Fan, HB
    ICECS 2000: 7TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS & SYSTEMS, VOLS I AND II, 2000, : 654 - +
  • [6] Analyzing multi-level spatial association rules through a graph-based visualization
    Appice, A
    Buono, P
    INNOVATIONS IN APPLIED ARTIFICIAL INTELLIGENCE, 2005, 3533 : 448 - 458
  • [7] Visual Relation Detection with Multi-Level Attention
    Zheng, Sipeng
    Chen, Shizhe
    Jin, Qin
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 121 - 129
  • [8] Multi-Level Transformer-Based Social Relation Recognition
    Wang, Yuchen
    Qing, Linbo
    Wang, Zhengyong
    Cheng, Yongqiang
    Peng, Yonghong
    SENSORS, 2022, 22 (15)
  • [9] Saliency Detection: Multi-Level Combination Approach via Graph-Based Manifold Ranking
    Li, Cuiping
    Chen, Zhenxue
    Liu, Chengyun
    Zhao, Di
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017, : 604 - 609
  • [10] Relation Classification via Multi-Level Attention CNNs
    Wang, Linlin
    Cao, Zhu
    de Melo, Gerard
    Liu, Zhiyuan
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1298 - 1307