One-stage detection for unsupervised domain adaptation with efficient multi-scale attention and confidence-augmented combination

被引:0
|
作者
Xiang, Nan [1 ,2 ,3 ]
Liu, Qianxi [4 ]
Jiang, Yaoyao [4 ]
机构
[1] Chongqing Univ Technol, Liangjiang Int Coll, Chongqing, Peoples R China
[2] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China
[3] Chongqing Jialing Special Equipment Co Ltd, Chongqing, Peoples R China
[4] Chongqing Univ Technol, Coll Comp Sci & Engn, Chongqing, Peoples R China
基金
中国博士后科学基金;
关键词
unsupervised domain adaptation; object detection; unsupervised domain adaptation for object detection;
D O I
10.1117/1.JEI.33.6.063025
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unsupervised domain adaptation for object detection leverages a labeled domain to learn an object detector generalizing to a different domain free of annotations. We propose efficient multi-scale attention, confidence mixing, augmentation, and combination (ECAC), an adaptive object detector learning method based on a region-level confidence sample mixing strategy. Compared with the current methods, our approach crops high-confidence detection regions from both the source and target domains, augments them, and combines them to generate composite samples. In addition, consistency loss is utilized to solve the domain adaptation problem. Furthermore, we introduce the efficient multi-scale attention (EMA) into the detector. To retain channel information and reduce computational overhead, EMA attention restructures part of the channels into the batch dimension and groups the channel dimension into multiple sub-features, ensuring spatial semantic features are evenly distributed within each feature group. EMA employs a shared 1 x 1 convolution branch from the CA attention module, along with a parallel 3 x 3 convolution kernel to aggregate multi-scale spatial structure information. This approach effectively enhances the model's focus on region-level features by integrating local and global information with multi-scale parallel sub-networks and cross-spatial learning. For pseudo-label filtering, we progressively transition from a loose to a stricter confidence threshold. Initially, this allows more pseudo-labels, facilitating the detector's learning of target domain representations. As training progresses, stricter thresholds are applied to select more reliable pseudo-labels, gradually filtering out inaccurate pseudo-detections. Our extensive experiments on three datasets demonstrate that ECAC achieves state-of-the-art performance on two of them. On the third dataset, our method improves the mean average precision by over 2% compared with the latest methods. (c) 2024 SPIE and IS&T
引用
收藏
页数:19
相关论文
共 50 条
  • [21] CADA: Multi-scale Collaborative Adversarial Domain Adaptation for unsupervised optic disc and cup segmentation
    Liu, Peng
    Tran, Charlie T.
    Kong, Bin
    Fang, Ruogu
    NEUROCOMPUTING, 2022, 469 : 209 - 220
  • [22] CADA: Multi-scale Collaborative Adversarial Domain Adaptation for unsupervised optic disc and cup segmentation
    Liu, Peng
    Tran, Charlie T.
    Kong, Bin
    Fang, Ruogu
    Neurocomputing, 2022, 469 : 209 - 220
  • [23] Unsupervised Domain Adaptation Fundus Image Segmentation via Multi-Scale Adaptive Adversarial Learning
    Zhou, Wei
    Ji, Jianhang
    Cui, Wei
    Wang, Yingyuan
    Yi, Yugen
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (10) : 5792 - 5803
  • [24] One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features
    Nguyen, Trung Thanh
    Kawanishi, Yasutomo
    Komamizu, Takahiro
    Ide, Ichiro
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
  • [25] Self-Training and Adversarial Background Regularization for Unsupervised Domain Adaptive One-Stage Object Detection
    Kim, Seunghyeon
    Choi, Jaehoon
    Kim, Taekyung
    Kim, Changick
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6091 - 6100
  • [26] UMS2-ODNet: Unified-scale domain adaptation mechanism driven object detection network with multi-scale attention
    Li, Yuze
    Zhang, Yan
    Yang, Chunling
    Chen, Yu
    NEURAL NETWORKS, 2025, 181
  • [27] Multi-scale feature learning and temporal probing strategy for one-stage temporal action localization
    Yao, Leiyue
    Yang, Wei
    Huang, Wei
    Jiang, Nan
    Zhou, Bingbing
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (07) : 4092 - 4112
  • [28] FAGD-Net: Feature-Augmented Grasp Detection Network Based on Efficient Multi-Scale Attention and Fusion Mechanisms
    Zhong, Xungao
    Liu, Xianghui
    Gong, Tao
    Sun, Yuan
    Hu, Huosheng
    Liu, Qiang
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [29] Prototype-guided multi-scale domain adaptation for Alzheimer's disease detection
    Cai, Hongshun
    Zhang, Qiongmin
    Long, Ying
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 154
  • [30] An Efficient Implementation of FPGA-based Object Detection Using Multi-scale Attention
    Furuta, Masanori
    Ban, Koichiro
    Kobayashi, Daisuke
    Shibata, Tomoyuki
    2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 321 - 325