Adaptive Feature Learning for Unbiased Scene Graph Generation

被引:1
|
作者
Yang, Jiarui [1 ,2 ]
Wang, Chuan [1 ,3 ,4 ]
Yang, Liang [5 ]
Jiang, Yuchen [6 ]
Cao, Angelina [7 ]
机构
[1] Chinese Acad Sci, Inst Informat Engn, Beijing 100085, Peoples R China
[2] Univ Chinese Acad Sci, Sch Cyber Secur, Beijing 100049, Peoples R China
[3] Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China
[4] Shenzhen Key Lab Media Secur, Shenzhen 518060, Peoples R China
[5] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China
[6] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen 518107, Peoples R China
[7] Montgomery Blair High Sch, Magnet STEM Program, Silver Spring, MD 20901 USA
关键词
Unbiased scene graph generation; adaptive message passing; bi-level unbiased training; feature enhancement;
D O I
10.1109/TIP.2024.3374644
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Scene Graph Generation (SGG) aims to detect all objects and identify their pairwise relationships in the scene. Recently, tremendous progress has been made in exploring better context relationship representations. Previous work mainly focuses on contextual information aggregation and uses de-biasing strategies on samples to eliminate the preference for head predicates. However, there remain challenges caused by indeterminate feature training. Overlooking the label confusion problem in feature training easily results in a messy feature distribution among the confused categories, thereby affecting the prediction of predicates. To alleviate the aforementioned problem, in this paper, we focus on enhancing predicate representation learning. Firstly, we propose a novel Adaptive Message Passing (AMP) network to dynamically conduct information propagation among neighbors. AMP provides discriminating representations for neighbor nodes under the view of de-noising and adaptive aggregation. Furthermore, we construct a feature-assisted training paradigm alongside the predicate classification branch, guiding predicate feature learning to the corresponding feature space. Moreover, to alleviate biased prediction caused by the long-tailed class distribution and the interference of confused labels, we design a Bi-level Curriculum learning scheme (BiC). The BiC separately considers the training from the feature learning and de-biasing levels, preserving discriminating representations of different predicates while resisting biased predictions. Results on multiple SGG datasets show that our proposed method AMP-BiC has superior comprehensive performance, demonstrating its effectiveness.
引用
收藏
页码:2252 / 2265
页数:14
相关论文
共 50 条
  • [31] Unbiased scene graph generation via head-tail cooperative network with self-supervised learning
    Wang, Lei
    Yuan, Zejian
    Lu, Yao
    Chen, Badong
    IMAGE AND VISION COMPUTING, 2024, 151
  • [32] Predicate Correlation Learning for Scene Graph Generation
    Tao, Leitian
    Mi, Li
    Li, Nannan
    Cheng, Xianhang
    Hu, Yaosi
    Chen, Zhenzhong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 4173 - 4185
  • [33] Unbiased Scene Graph Generation via Two-Stage Causal Modeling
    Sun, Shuzhou
    Zhi, Shuaifeng
    Liao, Qing
    Heikkila, Janne
    Liu, Li
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12562 - 12580
  • [34] Resistance Training Using Prior Bias: Toward Unbiased Scene Graph Generation
    Chen, Chao
    Zhan, Yibing
    Yu, Baosheng
    Liu, Liu
    Luo, Yong
    Du, Bo
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 212 - 220
  • [35] Constrained Structure Learning for Scene Graph Generation
    Liu, Daqi
    Bober, Miroslaw
    Kittler, Josef
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 11588 - 11599
  • [36] PPDL: Predicate Probability Distribution based Loss for Unbiased Scene Graph Generation
    Li, Wei
    Zhang, Haiwei
    Bai, Qijie
    Zhao, Guoqing
    Jiang, Ning
    Yuan, Xiaojie
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19425 - 19434
  • [37] Bridging Visual and Textual Semantics: Towards Consistency for Unbiased Scene Graph Generation
    Zhang, Ruonan
    An, Gaoyun
    Hao, Yiqing
    Wu, Dapeng Oliver
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (11) : 7102 - 7119
  • [38] Transformer networks with adaptive inference for scene graph generation
    Yini Wang
    Yongbin Gao
    Wenjun Yu
    Ruyan Guo
    Weibing Wan
    Shuqun Yang
    Bo Huang
    Applied Intelligence, 2023, 53 : 9621 - 9633
  • [39] Transformer networks with adaptive inference for scene graph generation
    Wang, Yini
    Gao, Yongbin
    Yu, Wenjun
    Guo, Ruyan
    Wan, Weibing
    Yang, Shuqun
    Huang, Bo
    APPLIED INTELLIGENCE, 2023, 53 (08) : 9621 - 9633
  • [40] Scene Adaptive Context Modeling and Balanced Relation Prediction for Scene Graph Generation
    Xu, Kai
    Wang, Lichun
    Li, Shuang
    Gao, Tong
    Yin, Baocai
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 21 (03)