Heading Direction Estimation Using Deep Learning with Automatic Large-scale Data Acquisition

被引:0
|
作者
Berriel, Rodrigo E. [1 ]
Tones, Lucas Tabelini [1 ]
Cardoso, Vinicius B. [1 ]
Guidolini, Ranik [1 ]
Badue, Claudine [1 ]
De Souza, Alberto F. [1 ]
Oliveira-Santos, Thiago [1 ]
机构
[1] Univ Fed Espirito Santo, Dept Informat, Vitoria, ES, Brazil
关键词
Deep Learning; Heading Estimation; Convolutional Neural Networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Advanced Driver Assistance Systems (ADAS) have experienced major advances in the past few years. The main objective of ADAS includes keeping the vehicle in the correct road direction, and avoiding collision with other vehicles or obstacles around. In this paper, we address the problem of estimating the heading direction that keeps the vehicle aligned with the road direction. This information can be used in precise localization, road and lane keeping, lane departure warning, and others. To enable this approach, a large-scale database (+1 million images) was automatically acquired and annotated using publicly available platforms such as the Google Street View API and OpenStreetMap. After the acquisition of the database, a CNN model was trained to predict how much the heading direction of a car should change in order to align it to the road 4 meters ahead. To assess the performance of the model, experiments were performed using images from two different sources: a hidden test set from Google Street View (GSV) images and two datasets from our autonomous car (IARA). The model achieved a low mean average error of 2.359 degrees and 2.524 degrees for the GSV and IARA datasets, respectively; performing consistently across the different datasets. It is worth noting that the images from the IARA dataset are very different (camera, FOV, brightness, etc.) from the ones of the GSV dataset, which shows the robustness of the model. In conclusion, the model was trained effortlessly (using automatic processes) and showed promising results in real-world databases working in real-time (more than 75 frames per second).
引用
收藏
页数:8
相关论文
共 50 条
  • [31] A New Framework for Regional Traffic Volumes Estimation with Large-Scale Connected Vehicle Data and Deep Learning Method
    Khadka, Swastik
    Wang, Peirong Slade
    Li, Pengfei Taylor
    Torres, Francisco J.
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2023, 149 (04)
  • [32] CURE: A deep learning framework pre-trained on large-scale patient data for treatment effect estimation
    Liu, Ruoqi
    Chen, Pin-Yu
    Zhang, Ping
    PATTERNS, 2024, 5 (06):
  • [33] Automatic Detection of Four-Panel Cartoon in Large-Scale Korean Digitized Newspapers using Deep Learning
    Lee, Seojoon
    Kim, Byungjun
    Jun, Bong Gwan
    JOURNAL OF OPEN HUMANITIES DATA, 2024, 10 : 1 - 15
  • [34] A Standardized Method for Large-scale Distributed Data Acquisition
    Peng, Xin-yi
    Huang, Jing-bin
    Huang, Zhi-wei
    2011 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION AND INDUSTRIAL APPLICATION (ICIA2011), VOL II, 2011, : 216 - 221
  • [35] A Standardized Method for Large-scale Distributed Data Acquisition
    Peng, Xin-yi
    Huang, Jing-bin
    Huang, Zhi-wei
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL VII, 2010, : 217 - 222
  • [36] Large-scale transport simulation by deep learning
    Jie Pan
    Nature Computational Science, 2021, 1 : 306 - 306
  • [37] Tractable large-scale deep reinforcement learning
    Sarang, Nima
    Poullis, Charalambos
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 232
  • [38] Large-scale transport simulation by deep learning
    Pan, Jie
    NATURE COMPUTATIONAL SCIENCE, 2021, 1 (05): : 306 - 306
  • [39] The three pillars of large-scale deep learning
    Hoefler, Torsten
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 908 - 908
  • [40] Learning Deep Representation with Large-scale Attributes
    Ouyang, Wanli
    Li, Hongyang
    Zeng, Xingyu
    Wang, Xiaogang
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1895 - 1903