Local context attention learning for fine-grained scene graph generation

被引:2
|
作者
Zhu, Xuhan [1 ,2 ]
Wang, Ruiping [1 ,3 ]
Lan, Xiangyuan [2 ]
Wang, Yaowei
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
关键词
Fine-grained scene graph generation; Local context; Location attention network; Local context-consistent label transfer;
D O I
10.1016/j.patcog.2024.110708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained scene graph generation aims to parse the objects and their fine-grained relationships within scenes. Despite the significant progress in recent years, their performance is still limited by two major issues: (1) ambiguous perception under a global view; (2) the lack of reliable, fine-grained annotations. We argue that understanding the local context is important in addressing the two issues. However, previous works often overlook it, which limits their effectiveness in fine-grained scene graph generation. To tackle this challenge, we introduce a Local-context Attention Learning method that concentrates on local context and can generate high-reliability, fine-grained annotations. It comprises two components: (1) The Fine-grained Location Attention Network (FLAN), a multi-branch network that encompasses global and local branches, can attend to local informative context and perceive granularity levels in different regions, thereby adaptively enhancing the learning of fine-grained locations. (2) The Fine-grained Location Label Transfer (FLLT) method identifies coarse-grained labels inconsistent with the local context and determines which labels should be transferred through the global confidence thresholding strategy, finally transferring them to reliable local context-consistent fine-grained ones. Experiments conducted on the Visual Genome, OpenImage, and GQA200 datasets show that the proposed methods achieve significant improvements on the fine-grained scene graph generation task. By addressing the challenge mentioned above, our method also achieves state-of-the-art performances on the three datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Fine-grained image retrieval by combining attention mechanism and context information
    Xiaoqing Li
    Jinwen Ma
    Neural Computing and Applications, 2023, 35 : 1881 - 1897
  • [22] Learning Hierarchal Channel Attention for Fine-grained Visual Classification
    Guan, Xiang
    Wang, Guoqing
    Xu, Xing
    Bin, Yi
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5011 - 5019
  • [23] Attention cutting and padding learning for fine-grained image recognition
    Zhuo Cheng
    Hongjian Li
    Xiaolin Duan
    Xiangyan Zeng
    Mingxuan He
    Hao Luo
    Multimedia Tools and Applications, 2021, 80 : 32791 - 32805
  • [24] Attention cutting and padding learning for fine-grained image recognition
    Cheng, Zhuo
    Li, Hongjian
    Duan, Xiaolin
    Zeng, Xiangyan
    He, Mingxuan
    Luo, Hao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32791 - 32805
  • [25] Focusing Fine-Grained Action by Self-Attention-Enhanced Graph Neural Networks With Contrastive Learning
    Geng, Pei
    Lu, Xuequan
    Hu, Chunyu
    Liu, Hong
    Lyu, Lei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4754 - 4768
  • [26] Fine-grained Attributed Graph Clustering
    Kang, Zhao
    Liu, Zhanyu
    Pan, Shirui
    Tian, Ling
    PROCEEDINGS OF THE 2022 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2022, : 370 - 378
  • [27] Improve Fine-Grained Feature Learning in Fine-Grained DataSet GAI
    Wang, Hai Peng
    Geng, Zhi Qing
    IEEE ACCESS, 2025, 13 : 12777 - 12788
  • [28] Text-to-Image Generation Grounded by Fine-Grained User Attention
    Koh, Jing Yu
    Baldridge, Jason
    Lee, Honglak
    Yang, Yinfei
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 237 - 246
  • [29] Fine-Grained Video Retrieval With Scene Sketches
    Zuo, Ran
    Deng, Xiaoming
    Chen, Keqi
    Zhang, Zhengming
    Lai, Yu-Kun
    Liu, Fang
    Ma, Cuixia
    Wang, Hao
    Liu, Yong-Jin
    Wang, Hongan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3136 - 3149
  • [30] A Neighborhood-Attention Fine-grained Entity Typing for Knowledge Graph Completion
    Zhuo, Jianhuan
    Zhu, Qiannan
    Yue, Yinliang
    Zhao, Yuhong
    Han, Weisi
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 1525 - 1533