Local context attention learning for fine-grained scene graph generation

被引:2
|
作者
Zhu, Xuhan [1 ,2 ]
Wang, Ruiping [1 ,3 ]
Lan, Xiangyuan [2 ]
Wang, Yaowei
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
关键词
Fine-grained scene graph generation; Local context; Location attention network; Local context-consistent label transfer;
D O I
10.1016/j.patcog.2024.110708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained scene graph generation aims to parse the objects and their fine-grained relationships within scenes. Despite the significant progress in recent years, their performance is still limited by two major issues: (1) ambiguous perception under a global view; (2) the lack of reliable, fine-grained annotations. We argue that understanding the local context is important in addressing the two issues. However, previous works often overlook it, which limits their effectiveness in fine-grained scene graph generation. To tackle this challenge, we introduce a Local-context Attention Learning method that concentrates on local context and can generate high-reliability, fine-grained annotations. It comprises two components: (1) The Fine-grained Location Attention Network (FLAN), a multi-branch network that encompasses global and local branches, can attend to local informative context and perceive granularity levels in different regions, thereby adaptively enhancing the learning of fine-grained locations. (2) The Fine-grained Location Label Transfer (FLLT) method identifies coarse-grained labels inconsistent with the local context and determines which labels should be transferred through the global confidence thresholding strategy, finally transferring them to reliable local context-consistent fine-grained ones. Experiments conducted on the Visual Genome, OpenImage, and GQA200 datasets show that the proposed methods achieve significant improvements on the fine-grained scene graph generation task. By addressing the challenge mentioned above, our method also achieves state-of-the-art performances on the three datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Towards Fine-grained Flow Forecasting: A Graph Attention Approach for Bike Sharing Systems
    He, Suining
    Shin, Kang G.
    WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 88 - 98
  • [42] Graph Analytics Through Fine-Grained Parallelism
    Shang, Zechao
    Li, Feifei
    Yu, Jeffrey Xu
    Zhang, Zhiwei
    Cheng, Hong
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 463 - 478
  • [43] Construct Fine-Grained Geospatial Knowledge Graph
    Wei, Bo
    Guo, Xi
    Wu, Ziyan
    Zhao, Jing
    Zou, Qiping
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2023 INTERNATIONAL WORKSHOPS, BDMS 2023, BDQM 2023, GDMA 2023, BUNDLERS 2023, 2023, 13922 : 267 - 282
  • [44] Towards Fine-Grained Concept Generation
    Li, Chenguang
    Liang, Jiaqing
    Xiao, Yanghua
    Jiang, Haiyun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 986 - 997
  • [45] Fine-grained Expressivity of Graph Neural Networks
    Boeker, Jan
    Levie, Ron
    Huang, Ningyuan
    Villar, Soledad
    Morris, Christopher
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [46] Fine-grained Pseudo Labels for Scene Text Recognition
    Li, Xiaoyu
    Chen, Xiaoxue
    Huang, Zuming
    Xie, Lele
    Chen, Jingdong
    Yang, Ming
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5786 - 5795
  • [47] Fine-Grained Language Identification in Scene Text Images
    Li, Yongrui
    Wu, Shilian
    Yu, Jun
    Wang, Zengfu
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4573 - 4581
  • [48] Knowledge Mining with Scene Text for Fine-Grained Recognition
    Wang, Hao
    Liao, Junchao
    Cheng, Tianheng
    Gao, Zewen
    Liu, Hao
    Ren, Bo
    Bai, Xiang
    Liu, Wenyu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4614 - 4623
  • [49] Attention Bilinear Pooling for Fine-Grained Classification
    Wang, Wenqian
    Zhang, Jun
    Wang, Fenglei
    SYMMETRY-BASEL, 2019, 11 (08):
  • [50] Fine-Grained Machine Teaching with Attention Modeling
    Liu, Jiacheng
    Hou, Xiaofeng
    Tang, Feilong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 2585 - 2592