One-stage detection for unsupervised domain adaptation with efficient multi-scale attention and confidence-augmented combination

被引：0

作者：

Xiang, Nan ^{[1
,2
,3
]}

Liu, Qianxi ^{[4
]}

Jiang, Yaoyao ^{[4
]}

机构：

[1] Chongqing Univ Technol, Liangjiang Int Coll, Chongqing, Peoples R China

[2] Chongqing Univ, Coll Comp Sci, Chongqing, Peoples R China

[3] Chongqing Jialing Special Equipment Co Ltd, Chongqing, Peoples R China

[4] Chongqing Univ Technol, Coll Comp Sci & Engn, Chongqing, Peoples R China

来源：

JOURNAL OF ELECTRONIC IMAGING | 2024年 / 33卷 / 06期

基金：

中国博士后科学基金;

关键词：

unsupervised domain adaptation; object detection; unsupervised domain adaptation for object detection;

D O I：

10.1117/1.JEI.33.6.063025

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Unsupervised domain adaptation for object detection leverages a labeled domain to learn an object detector generalizing to a different domain free of annotations. We propose efficient multi-scale attention, confidence mixing, augmentation, and combination (ECAC), an adaptive object detector learning method based on a region-level confidence sample mixing strategy. Compared with the current methods, our approach crops high-confidence detection regions from both the source and target domains, augments them, and combines them to generate composite samples. In addition, consistency loss is utilized to solve the domain adaptation problem. Furthermore, we introduce the efficient multi-scale attention (EMA) into the detector. To retain channel information and reduce computational overhead, EMA attention restructures part of the channels into the batch dimension and groups the channel dimension into multiple sub-features, ensuring spatial semantic features are evenly distributed within each feature group. EMA employs a shared 1 x 1 convolution branch from the CA attention module, along with a parallel 3 x 3 convolution kernel to aggregate multi-scale spatial structure information. This approach effectively enhances the model's focus on region-level features by integrating local and global information with multi-scale parallel sub-networks and cross-spatial learning. For pseudo-label filtering, we progressively transition from a loose to a stricter confidence threshold. Initially, this allows more pseudo-labels, facilitating the detector's learning of target domain representations. As training progresses, stricter thresholds are applied to select more reliable pseudo-labels, gradually filtering out inaccurate pseudo-detections. Our extensive experiments on three datasets demonstrate that ECAC achieves state-of-the-art performance on two of them. On the third dataset, our method improves the mean average precision by over 2% compared with the latest methods. (c) 2024 SPIE and IS&T

引用

页数：19

共 50 条

[21] CADA: Multi-scale Collaborative Adversarial Domain Adaptation for unsupervised optic disc and cup segmentation
Liu, Peng
Tran, Charlie T.
Kong, Bin
Fang, Ruogu
NEUROCOMPUTING, 2022, 469 : 209 - 220
[22] CADA: Multi-scale Collaborative Adversarial Domain Adaptation for unsupervised optic disc and cup segmentation
Liu, Peng
Tran, Charlie T.
Kong, Bin
Fang, Ruogu
Neurocomputing, 2022, 469 : 209 - 220
[23] Unsupervised Domain Adaptation Fundus Image Segmentation via Multi-Scale Adaptive Adversarial Learning
Zhou, Wei
Ji, Jianhang
Cui, Wei
Wang, Yingyuan
Yi, Yugen
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (10) : 5792 - 5803
[24] One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features
Nguyen, Trung Thanh
Kawanishi, Yasutomo
Komamizu, Takahiro
Ide, Ichiro
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, FG 2024, 2024,
[25] Self-Training and Adversarial Background Regularization for Unsupervised Domain Adaptive One-Stage Object Detection
Kim, Seunghyeon
Choi, Jaehoon
Kim, Taekyung
Kim, Changick
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 6091 - 6100
[26] UMS2-ODNet: Unified-scale domain adaptation mechanism driven object detection network with multi-scale attention
Li, Yuze
Zhang, Yan
Yang, Chunling
Chen, Yu
NEURAL NETWORKS, 2025, 181
[27] Multi-scale feature learning and temporal probing strategy for one-stage temporal action localization
Yao, Leiyue
Yang, Wei
Huang, Wei
Jiang, Nan
Zhou, Bingbing
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (07) : 4092 - 4112
[28] FAGD-Net: Feature-Augmented Grasp Detection Network Based on Efficient Multi-Scale Attention and Fusion Mechanisms
Zhong, Xungao
Liu, Xianghui
Gong, Tao
Sun, Yuan
Hu, Huosheng
Liu, Qiang
APPLIED SCIENCES-BASEL, 2024, 14 (12):
[29] Prototype-guided multi-scale domain adaptation for Alzheimer's disease detection
Cai, Hongshun
Zhang, Qiongmin
Long, Ying
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 154
[30] An Efficient Implementation of FPGA-based Object Detection Using Multi-scale Attention
Furuta, Masanori
Ban, Koichiro
Kobayashi, Daisuke
Shibata, Tomoyuki
2021 IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2021, : 321 - 325

← 1 2 3 4 5 →