Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night

被引:53
|
作者
Arruda, Vinicius F. [1 ]
Paixao, Thiago M. [1 ,2 ]
Berriel, Rodrigo F. [1 ]
De Souza, Alberto F. [1 ]
Badue, Claudine [1 ]
Sebe, Nicu [3 ]
Oliveira-Santos, Thiago [1 ]
机构
[1] Univ Fed Espirito Santo, Vitoria, ES, Brazil
[2] Inst Fed Espirito Santo, Vitoria, ES, Brazil
[3] Univ Trento, Trento, Italy
关键词
Object Detection; Generative Adversarial Networks; Unpaired Image-to-Image Translation; Unsupervised Domain Adaptation;
D O I
10.1109/ijcnn.2019.8852008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning techniques have enabled the emergence of state-of-the-art models to address object detection tasks. However, these techniques are data-driven, delegating the accuracy to the training dataset which must resemble the images in the target task. The acquisition of a dataset involves annotating images, an arduous and expensive process, generally requiring time and manual effort. Thus, a challenging scenario arises when the target domain of application has no annotated dataset available, making tasks in such situation to lean on a training dataset of a different domain. Sharing this issue, object detection is a vital task for autonomous vehicles where the large amount of driving scenarios yields several domains of application requiring annotated data for the training process. In this work, a method for training a car detection system with annotated data from a source domain (day images) without requiring the image annotations of the target domain (night images) is presented. For that, a model based on Generative Adversarial Networks (GANs) is explored to enable the generation of an artificial dataset with its respective annotations. The artificial dataset (fake dataset) is created translating images from day-time domain to night-time domain. The fake dataset, which comprises annotated images of only the target domain (night images), is then used to train the car detector model. Experimental results showed that the proposed method achieved significant and consistent improvements, including the increasing by more than 10% of the detection performance when compared to the training with only the available annotated data (i.e., day images).
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Cross-domain object detection using minimized instance shift image–image translation
    Gen Liu
    Jin Han
    The Visual Computer, 2023, 39 : 5013 - 5026
  • [22] DUNIT: Detection-based Unsupervised Image-to-Image Translation
    Bhattacharjee, Deblina
    Kim, Seungryong
    Vizier, Guillaume
    Salzmann, Mathieu
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4786 - 4795
  • [23] Unsupervised Image-to-Image Translation Using Domain-Specific Variational Information Bound
    Kazemi, Hadi
    Soleymani, Sobhan
    Taherkhani, Fariborz
    Iranmanesh, Seyed Mehdi
    Nasrabadi, Nasser M.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [24] Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
    Gomez, Raul
    Liu, Yahui
    De Nadai, Marco
    Karatzas, Dimosthenis
    Lepri, Bruno
    Sebe, Nicu
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3164 - 3172
  • [25] Improved Nighttime Vehicle Detection Using the Cross-Domain Image Translation
    Guo, Feng
    Deng, Yihao
    Chang, Honglei
    Yu, Huayang
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2024, 150 (08)
  • [26] CDTD: A Large-Scale Cross-Domain Benchmark for Instance-Level Image-to-Image Translation and Domain Adaptive Object Detection
    Zhiqiang Shen
    Mingyang Huang
    Jianping Shi
    Zechun Liu
    Harsh Maheshwari
    Yutong Zheng
    Xiangyang Xue
    Marios Savvides
    Thomas S. Huang
    International Journal of Computer Vision, 2021, 129 : 761 - 780
  • [27] Unsupervised content and style learning for multimodal cross-domain image translation
    Lin, Zhijie
    Chen, Jingjing
    Ma, Xiaolong
    Li, Chao
    Zhang, Huiming
    Zhao, Lei
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [28] CDTD: A Large-Scale Cross-Domain Benchmark for Instance-Level Image-to-Image Translation and Domain Adaptive Object Detection
    Shen, Zhiqiang
    Huang, Mingyang
    Shi, Jianping
    Liu, Zechun
    Maheshwari, Harsh
    Zheng, Yutong
    Xue, Xiangyang
    Savvides, Marios
    Huang, Thomas S.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (03) : 761 - 780
  • [29] Unsupervised Image-to-Image Translation with Generative Prior
    Yang, Shuai
    Jiang, Liming
    Liu, Ziwei
    Loy, Chen Change
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 18311 - 18320
  • [30] Cross-domain object detection using minimized instance shift image-image translation
    Liu, Gen
    Han, Jin
    VISUAL COMPUTER, 2023, 39 (10): : 5013 - 5026