Deep Multi-Modal Vehicle Detection in Aerial ISR Imagery

被引:36
|
作者
Sakla, Wesam [1 ]
Konjevod, Goran [1 ]
Mundhenk, T. Nathan [1 ]
机构
[1] Lawrence Livermore Natl Lab, Computat Engn Div, Livermore, CA 94550 USA
来源
2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017) | 2017年
关键词
D O I
10.1109/WACV.2017.107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since the introduction of deep convolutional neural networks (CNNs), object detection in imagery has witnessed substantial breakthroughs in state-of-the-art performance. The defense community utilizes overhead image sensors that acquire large field-of-view aerial imagery in various bands of the electromagnetic spectrum, which is then exploited for various applications, including the detection and localization of man-made objects. In this work, we utilize a recent state-of-the art object detection algorithm, faster R-CNN, to train a deep CNN for vehicle detection in multimodal imagery. We utilize the vehicle detection in aerial imagery (VEDAI) dataset, which contains overhead imagery that is representative of an ISR setting. Our contribution includes modification of key parameters in the faster R-CNN algorithm for this setting where the objects of interest are spatially small, occupying less than 1.5 x 10(-3) of the total image pixels. Our experiments show that (1) an appropriately trained deep CNN leads to average precision rates above 93% on vehicle detection, and (2) transfer learning between imagery modalities is possible, yielding average precision rates above 90% in the absence of fine-tuning.
引用
收藏
页码:916 / 923
页数:8
相关论文
共 50 条
  • [1] DEEP SEMANTIC SEGMENTATION OF AERIAL IMAGERY BASED ON MULTI-MODAL DATA
    Chen, Kaiqiang
    Fu, Kun
    Sun, Xian
    Weinmann, Michael
    Hinz, Stefan
    Jutzi, Boris
    Weinmann, Martin
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 6219 - 6222
  • [2] Vehicle Detection from Multi-modal Aerial Imagery using YOLOv3 with Mid-level Fusion
    Dhanaraj, Mayur
    Sharma, Manish
    Sarkar, Tiyasa
    Karnam, Srivallabha
    Chachlakis, Dimitris
    Ptucha, Raymond
    Markopoulos, Panos P.
    Saber, Eli
    BIG DATA II: LEARNING, ANALYTICS, AND APPLICATIONS, 2020, 11395
  • [3] Multi-Modal Deep Learning for Vehicle Sensor Data Abstraction and Attack Detection
    Rofail, Mark
    Alsafty, Aysha
    Matousek, Matthias
    Kargl, Frank
    2019 IEEE INTERNATIONAL CONFERENCE OF VEHICULAR ELECTRONICS AND SAFETY (ICVES 19), 2019,
  • [4] Multi-modal People Detection from Aerial Video
    Flynn, Helen
    Cameron, Stephen
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 815 - 824
  • [5] Multi-modal People Detection from Aerial Video Footage
    Flynn, Helen
    Cameron, Stephen
    TOWARDS AUTONOMOUS ROBOTIC SYSTEMS, 2014, 8069 : 190 - 191
  • [6] Deep Multi-modal Object Detection for Autonomous Driving
    Ennajar, Amal
    Khouja, Nadia
    Boutteau, Remi
    Tlili, Fethi
    2021 18TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2021, : 7 - 11
  • [7] Multi-Modal Song Mood Detection with Deep Learning
    Pyrovolakis, Konstantinos
    Tzouveli, Paraskevi
    Stamou, Giorgos
    SENSORS, 2022, 22 (03)
  • [8] A Multi-Modal Optimization Approach to Single Path Planning for Unmanned Aerial Vehicle
    Yang, Peng
    Lu, Guanzhou
    Tang, Ke
    Yao, Xin
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 1735 - 1742
  • [9] Imagery in multi-modal object learning
    Jüttner, M
    Rentschler, I
    BEHAVIORAL AND BRAIN SCIENCES, 2002, 25 (02) : 197 - +
  • [10] EFFECTIVE FUSION OF MULTI-MODAL DATA WITH GROUP CONVOLUTIONS FOR SEMANTIC SEGMENTATION OF AERIAL IMAGERY
    Chen, Kaiqiang
    Fu, Kun
    Gao, Xin
    Yan, Menglong
    Zhang, Wenkai
    Zhang, Yue
    Sun, Xian
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3911 - 3914