Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night

被引：53

作者：

Arruda, Vinicius F. ^{[1
]}

Paixao, Thiago M. ^{[1
,2
]}

Berriel, Rodrigo F. ^{[1
]}

De Souza, Alberto F. ^{[1
]}

Badue, Claudine ^{[1
]}

Sebe, Nicu ^{[3
]}

Oliveira-Santos, Thiago ^{[1
]}

机构：

[1] Univ Fed Espirito Santo, Vitoria, ES, Brazil

[2] Inst Fed Espirito Santo, Vitoria, ES, Brazil

[3] Univ Trento, Trento, Italy

来源：

2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2019年

关键词：

Object Detection; Generative Adversarial Networks; Unpaired Image-to-Image Translation; Unsupervised Domain Adaptation;

D O I：

10.1109/ijcnn.2019.8852008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning techniques have enabled the emergence of state-of-the-art models to address object detection tasks. However, these techniques are data-driven, delegating the accuracy to the training dataset which must resemble the images in the target task. The acquisition of a dataset involves annotating images, an arduous and expensive process, generally requiring time and manual effort. Thus, a challenging scenario arises when the target domain of application has no annotated dataset available, making tasks in such situation to lean on a training dataset of a different domain. Sharing this issue, object detection is a vital task for autonomous vehicles where the large amount of driving scenarios yields several domains of application requiring annotated data for the training process. In this work, a method for training a car detection system with annotated data from a source domain (day images) without requiring the image annotations of the target domain (night images) is presented. For that, a model based on Generative Adversarial Networks (GANs) is explored to enable the generation of an artificial dataset with its respective annotations. The artificial dataset (fake dataset) is created translating images from day-time domain to night-time domain. The fake dataset, which comprises annotated images of only the target domain (night images), is then used to train the car detector model. Experimental results showed that the proposed method achieved significant and consistent improvements, including the increasing by more than 10% of the detection performance when compared to the training with only the available annotated data (i.e., day images).

引用

页数：8

共 50 条

[21] Cross-domain object detection using minimized instance shift image–image translation
Gen Liu
Jin Han
The Visual Computer, 2023, 39 : 5013 - 5026
[22] DUNIT: Detection-based Unsupervised Image-to-Image Translation
Bhattacharjee, Deblina
Kim, Seungryong
Vizier, Guillaume
Salzmann, Mathieu
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4786 - 4795
[23] Unsupervised Image-to-Image Translation Using Domain-Specific Variational Information Bound
Kazemi, Hadi
Soleymani, Sobhan
Taherkhani, Fariborz
Iranmanesh, Seyed Mehdi
Nasrabadi, Nasser M.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[24] Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Gomez, Raul
Liu, Yahui
De Nadai, Marco
Karatzas, Dimosthenis
Lepri, Bruno
Sebe, Nicu
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3164 - 3172
[25] Improved Nighttime Vehicle Detection Using the Cross-Domain Image Translation
Guo, Feng
Deng, Yihao
Chang, Honglei
Yu, Huayang
JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2024, 150 (08)
[26] CDTD: A Large-Scale Cross-Domain Benchmark for Instance-Level Image-to-Image Translation and Domain Adaptive Object Detection
Zhiqiang Shen
Mingyang Huang
Jianping Shi
Zechun Liu
Harsh Maheshwari
Yutong Zheng
Xiangyang Xue
Marios Savvides
Thomas S. Huang
International Journal of Computer Vision, 2021, 129 : 761 - 780
[27] Unsupervised content and style learning for multimodal cross-domain image translation
Lin, Zhijie
Chen, Jingjing
Ma, Xiaolong
Li, Chao
Zhang, Huiming
Zhao, Lei
SCIENTIFIC REPORTS, 2024, 14 (01):
[28] CDTD: A Large-Scale Cross-Domain Benchmark for Instance-Level Image-to-Image Translation and Domain Adaptive Object Detection
Shen, Zhiqiang
Huang, Mingyang
Shi, Jianping
Liu, Zechun
Maheshwari, Harsh
Zheng, Yutong
Xue, Xiangyang
Savvides, Marios
Huang, Thomas S.
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (03) : 761 - 780
[29] Unsupervised Image-to-Image Translation with Generative Prior
Yang, Shuai
Jiang, Liming
Liu, Ziwei
Loy, Chen Change
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
[30] Cross-domain object detection using minimized instance shift image-image translation
Liu, Gen
Han, Jin
VISUAL COMPUTER, 2023, 39 (10): : 5013 - 5026

← 1 2 3 4 5 →