CIGAR: Cross-Modality Graph Reasoning for Domain Adaptive Object Detection

被引:17
|
作者
Liu, Yabo [1 ,2 ]
Wang, Jinghua [1 ]
Huang, Chao [3 ]
Wang, Yaowei [2 ]
Xu, Yong [1 ,2 ]
机构
[1] Harbin Inst Technol, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen Campus, Shenzhen, Peoples R China
关键词
D O I
10.1109/CVPR52729.2023.02277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unsupervised domain adaptive object detection (UDA-OD) aims to learn a detector by generalizing knowledge from a labeled source domain to an unlabeled target domain. Though the existing graph-based methods for UDA-OD perform well in some cases, they cannot learn a proper node set for the graph. In addition, these methods build the graph solely based on the visual features and do not consider the linguistic knowledge carried by the semantic prototypes, e.g., dataset labels. To overcome these problems, we propose a cross-modality graph reasoning adaptation (CIGAR) method to take advantage of both visual and linguistic knowledge. Specifically, our method performs cross-modality graph reasoning between the linguistic modality graph and visual modality graphs to enhance their representations. We also propose a discriminative feature selector to find the most discriminative features and take them as the nodes of the visual graph for both efficiency and effectiveness. In addition, we employ the linguistic graph matching loss to regulate the update of linguistic graphs and maintain their semantic representation during the training process. Comprehensive experiments validate the effectiveness of our proposed CIGAR.
引用
收藏
页码:23776 / 23786
页数:11
相关论文
共 50 条
  • [1] Cross-Modality Object Detection Based on DETR
    Huang, Xinyi
    Ma, Guochun
    IEEE ACCESS, 2025, 13 : 51220 - 51230
  • [2] MCAFNet: Multiscale cross-modality adaptive fusion network for multispectral object detection
    Zheng, Shangpo
    Liu, Junfeng
    Jun, Zeng
    DIGITAL SIGNAL PROCESSING, 2025, 159
  • [3] Cross-Modality 3D Object Detection
    Zhu, Ming
    Ma, Chao
    Ji, Pan
    Yang, Xiaokang
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3771 - 3780
  • [4] Adaptive graph reasoning network for object detection
    Zhong, Xinfang
    Kuang, Wenlan
    Li, Zhixin
    IMAGE AND VISION COMPUTING, 2024, 151
  • [5] Cross-Modality Learning by Exploring Modality Interactions for Emotion Reasoning
    Tran, Thi-Dung
    Ho, Ngoc-Huynh
    Pant, Sudarshan
    Yang, Hyung-Jeong
    Kim, Soo-Hyung
    Lee, Gueesang
    IEEE ACCESS, 2023, 11 : 56634 - 56648
  • [6] Cross-Modality Data Augmentation for Aerial Object Detection with Representation Learning
    Wei, Chiheng
    Bai, Lianfa
    Chen, Xiaoyu
    Han, Jing
    REMOTE SENSING, 2024, 16 (24)
  • [7] Task-Decoupled Knowledge Transfer for Cross-Modality Object Detection
    Wei, Chiheng
    Bai, Lianfa
    Chen, Xiaoyu
    Han, Jing
    ENTROPY, 2023, 25 (08)
  • [8] Cross-Domain and Cross-Modality Transfer Learning for Multi-domain and Multi-modality Event Detection
    Yang, Zhenguo
    Cheng, Min
    Li, Qing
    Li, Yukun
    Lin, Zehang
    Liu, Wenyin
    WEB INFORMATION SYSTEMS ENGINEERING, WISE 2017, PT I, 2017, 10569 : 516 - 523
  • [9] Ship detection and recognition in SAR images with cross-modality domain adaption
    Song Y.
    Li J.
    Tian T.
    Tian J.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2022, 50 (11): : 107 - 113
  • [10] Self-Attentive Spatial Adaptive Normalization for Cross-Modality Domain Adaptation
    Tomar, Devavrat
    Lortkipanidze, Manana
    Vray, Guillaume
    Bozorgtabar, Behzad
    Thiran, Jean-Philippe
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (10) : 2926 - 2938