Local context attention learning for fine-grained scene graph generation

被引:2
|
作者
Zhu, Xuhan [1 ,2 ]
Wang, Ruiping [1 ,3 ]
Lan, Xiangyuan [2 ]
Wang, Yaowei
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
关键词
Fine-grained scene graph generation; Local context; Location attention network; Local context-consistent label transfer;
D O I
10.1016/j.patcog.2024.110708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained scene graph generation aims to parse the objects and their fine-grained relationships within scenes. Despite the significant progress in recent years, their performance is still limited by two major issues: (1) ambiguous perception under a global view; (2) the lack of reliable, fine-grained annotations. We argue that understanding the local context is important in addressing the two issues. However, previous works often overlook it, which limits their effectiveness in fine-grained scene graph generation. To tackle this challenge, we introduce a Local-context Attention Learning method that concentrates on local context and can generate high-reliability, fine-grained annotations. It comprises two components: (1) The Fine-grained Location Attention Network (FLAN), a multi-branch network that encompasses global and local branches, can attend to local informative context and perceive granularity levels in different regions, thereby adaptively enhancing the learning of fine-grained locations. (2) The Fine-grained Location Label Transfer (FLLT) method identifies coarse-grained labels inconsistent with the local context and determines which labels should be transferred through the global confidence thresholding strategy, finally transferring them to reliable local context-consistent fine-grained ones. Experiments conducted on the Visual Genome, OpenImage, and GQA200 datasets show that the proposed methods achieve significant improvements on the fine-grained scene graph generation task. By addressing the challenge mentioned above, our method also achieves state-of-the-art performances on the three datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Fine-Grained Predicates Learning for Scene Graph Generation
    Lyu, Xinyu
    Gao, Lianli
    Guo, Yuyu
    Zhao, Zhou
    Huang, Hao
    Shen, Heng Tao
    Song, Jingkuan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19445 - 19453
  • [2] Adaptive Fine-Grained Predicates Learning for Scene Graph Generation
    Lyu, Xinyu
    Gao, Lianli
    Zeng, Pengpeng
    Shen, Heng Tao
    Song, Jingkuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (11) : 13921 - 13940
  • [3] Hierarchical Memory Learning for Fine-Grained Scene Graph Generation
    Deng, Youming
    Li, Yansheng
    Zhang, Yongjun
    Xiang, Xiang
    Wang, Jian
    Chen, Jingdong
    Ma, Jiayi
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 266 - 283
  • [4] Fine-Grained Scene Graph Generation with Data Transfer
    Zhang, Ao
    Yao, Yuan
    Chen, Qianyu
    Ji, Wei
    Liu, Zhiyuan
    Sun, Maosong
    Chua, Tat-Seng
    COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 409 - 424
  • [5] Environment-Invariant Curriculum Relation Learning for Fine-Grained Scene Graph Generation
    Min, Yukuan
    Wu, Aming
    Deng, Cheng
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 13250 - 13261
  • [6] Fine-Grained Scene Graph Generation with Overlap Region and Geometrical Center
    Zhao, Y. Q.
    Jin, Z.
    Zhao, H. Y.
    Zhang, F.
    Tao, Z. W.
    Dou, C. F.
    Xu, X. H.
    Liu, D. H.
    COMPUTER GRAPHICS FORUM, 2022, 41 (07) : 359 - 370
  • [7] Fine-Grained Scene Graph Generation via Sample-Level Bias Prediction
    Li, Yansheng
    Wang, Tingzhu
    Wu, Kang
    Wang, Linlin
    Guo, Xin
    Wang, Wenbin
    COMPUTER VISION - ECCV 2024, PT XXVI, 2025, 15084 : 18 - 35
  • [8] Fine-grained attention for image caption generation
    Chang, Yan-Shuo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (03) : 2959 - 2971
  • [9] Fine-grained attention for image caption generation
    Yan-Shuo Chang
    Multimedia Tools and Applications, 2018, 77 : 2959 - 2971
  • [10] GRAPH FINE-GRAINED CONTRASTIVE REPRESENTATION LEARNING
    Tang, Hui
    Liang, Xun
    Guo, Yuhui
    Zheng, Xiangping
    Wu, Bo
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3478 - 3482