Local context attention learning for fine-grained scene graph generation

被引:2
|
作者
Zhu, Xuhan [1 ,2 ]
Wang, Ruiping [1 ,3 ]
Lan, Xiangyuan [2 ]
Wang, Yaowei
机构
[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
关键词
Fine-grained scene graph generation; Local context; Location attention network; Local context-consistent label transfer;
D O I
10.1016/j.patcog.2024.110708
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained scene graph generation aims to parse the objects and their fine-grained relationships within scenes. Despite the significant progress in recent years, their performance is still limited by two major issues: (1) ambiguous perception under a global view; (2) the lack of reliable, fine-grained annotations. We argue that understanding the local context is important in addressing the two issues. However, previous works often overlook it, which limits their effectiveness in fine-grained scene graph generation. To tackle this challenge, we introduce a Local-context Attention Learning method that concentrates on local context and can generate high-reliability, fine-grained annotations. It comprises two components: (1) The Fine-grained Location Attention Network (FLAN), a multi-branch network that encompasses global and local branches, can attend to local informative context and perceive granularity levels in different regions, thereby adaptively enhancing the learning of fine-grained locations. (2) The Fine-grained Location Label Transfer (FLLT) method identifies coarse-grained labels inconsistent with the local context and determines which labels should be transferred through the global confidence thresholding strategy, finally transferring them to reliable local context-consistent fine-grained ones. Experiments conducted on the Visual Genome, OpenImage, and GQA200 datasets show that the proposed methods achieve significant improvements on the fine-grained scene graph generation task. By addressing the challenge mentioned above, our method also achieves state-of-the-art performances on the three datasets.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Two-Branch Attention Learning for Fine-Grained Class Incremental Learning
    Guo, Jiaqi
    Qi, Guanqiu
    Xie, Shuiqing
    Li, Xiangyuan
    ELECTRONICS, 2021, 10 (23)
  • [32] Learning to Control the Fine-grained Sentiment for Story Ending Generation
    Luo, Fuli
    Dai, Damai
    Yang, Pengcheng
    Liu, Tianyu
    Chang, Baobao
    Sui, Zhifang
    Sun, Xu
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 6020 - 6026
  • [33] Fine-Grained Graph Learning for Multi-View Subspace Clustering
    Wang, Yidi
    Pei, Xiaobing
    Zhan, Haoxi
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (04): : 2804 - 2815
  • [34] FTAFace: Context-enhanced Face Detector with Fine-grained Task Attention
    Wang, Deyu
    Wen, Dongchao
    Tao, Wei
    Yin, Lingxiao
    Chen, Tse-Wei
    Ito, Tadayuki
    Osa, Kinya
    Kato, Masami
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3427 - 3436
  • [35] Fine-Grained Early Frequency Attention for Deep Speaker Representation Learning
    Hajavi A.
    Etemad A.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (06): : 1413 - 1425
  • [36] Attribute-Aware Attention Model for Fine-grained Representation Learning
    Han, Kai
    Guo, Jianyuan
    Zhang, Chao
    Zhu, Mingjian
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2040 - 2048
  • [37] Local Alignments for Fine-Grained Categorization
    Gavves, Efstratios
    Fernando, Basura
    Snoek, Cees G. M.
    Smeulders, Arnold W. M.
    Tuytelaars, Tinne
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (02) : 191 - 212
  • [38] Local Alignments for Fine-Grained Categorization
    Efstratios Gavves
    Basura Fernando
    Cees G. M. Snoek
    Arnold W. M. Smeulders
    Tinne Tuytelaars
    International Journal of Computer Vision, 2015, 111 : 191 - 212
  • [39] Fine-grained imbalanced leukocyte classification with global-local attention transformer
    Chen, Ben
    Qin, Feiwei
    Shao, Yanli
    Cao, Jin
    Peng, Yong
    Ge, Ruiquan
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)
  • [40] Fine-grained Question-Answer sentiment classification with hierarchical graph attention network
    Zeng, Jiandian
    Liu, Tianyi
    Jia, Weijia
    Zhou, Jiantao
    NEUROCOMPUTING, 2021, 457 : 214 - 224