Local context attention learning for fine-grained scene graph generation

被引：2

作者：

Zhu, Xuhan ^{[1
,2
]}

Wang, Ruiping ^{[1
,3
]}

Lan, Xiangyuan ^{[2
]}

Wang, Yaowei

机构：

[1] Univ Chinese Acad Sci, Beijing 100049, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518000, Peoples R China

[3] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China

来源：

PATTERN RECOGNITION | 2024年 / 156卷

关键词：

Fine-grained scene graph generation; Local context; Location attention network; Local context-consistent label transfer;

D O I：

10.1016/j.patcog.2024.110708

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fine-grained scene graph generation aims to parse the objects and their fine-grained relationships within scenes. Despite the significant progress in recent years, their performance is still limited by two major issues: (1) ambiguous perception under a global view; (2) the lack of reliable, fine-grained annotations. We argue that understanding the local context is important in addressing the two issues. However, previous works often overlook it, which limits their effectiveness in fine-grained scene graph generation. To tackle this challenge, we introduce a Local-context Attention Learning method that concentrates on local context and can generate high-reliability, fine-grained annotations. It comprises two components: (1) The Fine-grained Location Attention Network (FLAN), a multi-branch network that encompasses global and local branches, can attend to local informative context and perceive granularity levels in different regions, thereby adaptively enhancing the learning of fine-grained locations. (2) The Fine-grained Location Label Transfer (FLLT) method identifies coarse-grained labels inconsistent with the local context and determines which labels should be transferred through the global confidence thresholding strategy, finally transferring them to reliable local context-consistent fine-grained ones. Experiments conducted on the Visual Genome, OpenImage, and GQA200 datasets show that the proposed methods achieve significant improvements on the fine-grained scene graph generation task. By addressing the challenge mentioned above, our method also achieves state-of-the-art performances on the three datasets.

引用

页数：13

共 50 条

[41] Towards Fine-grained Flow Forecasting: A Graph Attention Approach for Bike Sharing Systems
He, Suining
Shin, Kang G.
WEB CONFERENCE 2020: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2020), 2020, : 88 - 98
[42] Graph Analytics Through Fine-Grained Parallelism
Shang, Zechao
Li, Feifei
Yu, Jeffrey Xu
Zhang, Zhiwei
Cheng, Hong
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 463 - 478
[43] Construct Fine-Grained Geospatial Knowledge Graph
Wei, Bo
Guo, Xi
Wu, Ziyan
Zhao, Jing
Zou, Qiping
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2023 INTERNATIONAL WORKSHOPS, BDMS 2023, BDQM 2023, GDMA 2023, BUNDLERS 2023, 2023, 13922 : 267 - 282
[44] Towards Fine-Grained Concept Generation
Li, Chenguang
Liang, Jiaqing
Xiao, Yanghua
Jiang, Haiyun
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 986 - 997
[45] Fine-grained Expressivity of Graph Neural Networks
Boeker, Jan
Levie, Ron
Huang, Ningyuan
Villar, Soledad
Morris, Christopher
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[46] Fine-grained Pseudo Labels for Scene Text Recognition
Li, Xiaoyu
Chen, Xiaoxue
Huang, Zuming
Xie, Lele
Chen, Jingdong
Yang, Ming
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5786 - 5795
[47] Fine-Grained Language Identification in Scene Text Images
Li, Yongrui
Wu, Shilian
Yu, Jun
Wang, Zengfu
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4573 - 4581
[48] Knowledge Mining with Scene Text for Fine-Grained Recognition
Wang, Hao
Liao, Junchao
Cheng, Tianheng
Gao, Zewen
Liu, Hao
Ren, Bo
Bai, Xiang
Liu, Wenyu
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4614 - 4623
[49] Attention Bilinear Pooling for Fine-Grained Classification
Wang, Wenqian
Zhang, Jun
Wang, Fenglei
SYMMETRY-BASEL, 2019, 11 (08):
[50] Fine-Grained Machine Teaching with Attention Modeling
Liu, Jiacheng
Hou, Xiaofeng
Tang, Feilong
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 2585 - 2592

← 1 2 3 4 5 →