Cross-Domain Car Detection Using Unsupervised Image-to-Image Translation: From Day to Night

被引:53
|
作者
Arruda, Vinicius F. [1 ]
Paixao, Thiago M. [1 ,2 ]
Berriel, Rodrigo F. [1 ]
De Souza, Alberto F. [1 ]
Badue, Claudine [1 ]
Sebe, Nicu [3 ]
Oliveira-Santos, Thiago [1 ]
机构
[1] Univ Fed Espirito Santo, Vitoria, ES, Brazil
[2] Inst Fed Espirito Santo, Vitoria, ES, Brazil
[3] Univ Trento, Trento, Italy
关键词
Object Detection; Generative Adversarial Networks; Unpaired Image-to-Image Translation; Unsupervised Domain Adaptation;
D O I
10.1109/ijcnn.2019.8852008
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning techniques have enabled the emergence of state-of-the-art models to address object detection tasks. However, these techniques are data-driven, delegating the accuracy to the training dataset which must resemble the images in the target task. The acquisition of a dataset involves annotating images, an arduous and expensive process, generally requiring time and manual effort. Thus, a challenging scenario arises when the target domain of application has no annotated dataset available, making tasks in such situation to lean on a training dataset of a different domain. Sharing this issue, object detection is a vital task for autonomous vehicles where the large amount of driving scenarios yields several domains of application requiring annotated data for the training process. In this work, a method for training a car detection system with annotated data from a source domain (day images) without requiring the image annotations of the target domain (night images) is presented. For that, a model based on Generative Adversarial Networks (GANs) is explored to enable the generation of an artificial dataset with its respective annotations. The artificial dataset (fake dataset) is created translating images from day-time domain to night-time domain. The fake dataset, which comprises annotated images of only the target domain (night images), is then used to train the car detector model. Experimental results showed that the proposed method achieved significant and consistent improvements, including the increasing by more than 10% of the detection performance when compared to the training with only the available annotated data (i.e., day images).
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Edge Sensitive Unsupervised Image-to-Image Translation
    Akkaya, Ibrahim Batuhan
    Halici, Ugur
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [32] Unsupervised Image-to-Image Translation with Style Consistency
    Lai, Binxin
    Wang, Yuan-Gen
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VI, 2024, 14430 : 322 - 334
  • [33] Rethinking the Truly Unsupervised Image-to-Image Translation
    Baek, Kyungjune
    Choi, Yunjey
    Uh, Youngjung
    Yoo, Jaejun
    Shim, Hyunjung
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14134 - 14143
  • [34] Contrastive learning for unsupervised image-to-image translation
    Lee, Hanbit
    Seol, Jinseok
    Lee, Sang-goo
    Park, Jaehui
    Shim, Junho
    APPLIED SOFT COMPUTING, 2024, 151
  • [35] A night pavement crack detection method based on image-to-image translation
    Liu, Chao
    Xu, Boqiang
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2022, 37 (13) : 1737 - 1753
  • [36] MULTI-DOMAIN UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION WITH APPEARANCE ADAPTIVE CONVOLUTION
    Jeong, Somi
    Lee, Jiyoung
    Sohn, Kwanghoon
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1750 - 1754
  • [37] Domain Adaptive Image-to-image Translation
    Chen, Ying-Cong
    Xu, Xiaogang
    Jia, Jiaya
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5273 - 5282
  • [38] Single Cross-domain Semantic Guidance Network for Multimodal Unsupervised Image Translation
    Lan, Jiaying
    Cheng, Lianglun
    Huang, Guoheng
    Pun, Chi-Man
    Yuan, Xiaochen
    Lai, Shangyu
    Liu, HongRui
    Ling, Wing-Kuen
    MULTIMEDIA MODELING, MMM 2023, PT I, 2023, 13833 : 165 - 177
  • [39] A framework for generalizing critical heat flux detection models using unsupervised image-to-image translation
    Al-Hindawi, Firas
    Soori, Tejaswi
    Hu, Han
    Siddiquee, Md. Mahfuzur Rahman
    Yoon, Hyunsoo
    Wu, Teresa
    Sun, Ying
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 227
  • [40] Few-Shot Unsupervised Image-to-Image Translation
    Liu, Ming-Yu
    Huang, Xun
    Mallya, Arun
    Karras, Tero
    Aila, Timo
    Lehtinen, Jaakko
    Kautz, Jan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 10550 - 10559