Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night

被引：53

作者：

Arruda, Vinicius F. ^{[1
]}

Paixao, Thiago M. ^{[1
,2
]}

Berriel, Rodrigo F. ^{[1
]}

De Souza, Alberto F. ^{[1
]}

Badue, Claudine ^{[1
]}

Sebe, Nicu ^{[3
]}

Oliveira-Santos, Thiago ^{[1
]}

机构：

[1] Univ Fed Espirito Santo, Vitoria, ES, Brazil

[2] Inst Fed Espirito Santo, Vitoria, ES, Brazil

[3] Univ Trento, Trento, Italy

来源：

2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2019年

关键词：

Object Detection; Generative Adversarial Networks; Unpaired Image-to-Image Translation; Unsupervised Domain Adaptation;

D O I：

10.1109/ijcnn.2019.8852008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning techniques have enabled the emergence of state-of-the-art models to address object detection tasks. However, these techniques are data-driven, delegating the accuracy to the training dataset which must resemble the images in the target task. The acquisition of a dataset involves annotating images, an arduous and expensive process, generally requiring time and manual effort. Thus, a challenging scenario arises when the target domain of application has no annotated dataset available, making tasks in such situation to lean on a training dataset of a different domain. Sharing this issue, object detection is a vital task for autonomous vehicles where the large amount of driving scenarios yields several domains of application requiring annotated data for the training process. In this work, a method for training a car detection system with annotated data from a source domain (day images) without requiring the image annotations of the target domain (night images) is presented. For that, a model based on Generative Adversarial Networks (GANs) is explored to enable the generation of an artificial dataset with its respective annotations. The artificial dataset (fake dataset) is created translating images from day-time domain to night-time domain. The fake dataset, which comprises annotated images of only the target domain (night images), is then used to train the car detector model. Experimental results showed that the proposed method achieved significant and consistent improvements, including the increasing by more than 10% of the detection performance when compared to the training with only the available annotated data (i.e., day images).

引用

页数：8

共 50 条

[31] Edge Sensitive Unsupervised Image-to-Image Translation
Akkaya, Ibrahim Batuhan
Halici, Ugur
2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
[32] Unsupervised Image-to-Image Translation with Style Consistency
Lai, Binxin
Wang, Yuan-Gen
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
[33] Rethinking the Truly Unsupervised Image-to-Image Translation
Baek, Kyungjune
Choi, Yunjey
Uh, Youngjung
Yoo, Jaejun
Shim, Hyunjung
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14134 - 14143
[34] Contrastive learning for unsupervised image-to-image translation
Lee, Hanbit
Seol, Jinseok
Lee, Sang-goo
Park, Jaehui
Shim, Junho
APPLIED SOFT COMPUTING, 2024, 151
[35] A night pavement crack detection method based on image-to-image translation
Liu, Chao
Xu, Boqiang
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (13) : 1737 - 1753
[36] MULTI-DOMAIN UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION WITH APPEARANCE ADAPTIVE CONVOLUTION
Jeong, Somi
Lee, Jiyoung
Sohn, Kwanghoon
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1750 - 1754
[37] Domain Adaptive Image-to-image Translation
Chen, Ying-Cong
Xu, Xiaogang
Jia, Jiaya
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5273 - 5282
[38] Single Cross-domain Semantic Guidance Network for Multimodal Unsupervised Image Translation
Lan, Jiaying
Cheng, Lianglun
Huang, Guoheng
Pun, Chi-Man
Yuan, Xiaochen
Lai, Shangyu
Liu, HongRui
Ling, Wing-Kuen
MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 165 - 177
[39] A framework for generalizing critical heat flux detection models using unsupervised image-to-image translation
Al-Hindawi, Firas
Soori, Tejaswi
Hu, Han
Siddiquee, Md. Mahfuzur Rahman
Yoon, Hyunsoo
Wu, Teresa
Sun, Ying
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
[40] Few-Shot Unsupervised Image-to-Image Translation
Liu, Ming-Yu
Huang, Xun
Mallya, Arun
Karras, Tero
Aila, Timo
Lehtinen, Jaakko
Kautz, Jan
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10550 - 10559

← 1 2 3 4 5 →