Automatic Mixed Precision and Distributed Data-Parallel Training for Satellite Image Classification using CNN

被引:0
|
作者
Nuwara, Yohanes [1 ]
Kitt, Wong W. [2 ]
Juwono, Filbert H. [3 ]
Ollivierre, Gregory [4 ]
机构
[1] Asia Pulp & Paper, Sinarmas Land Plaza MH Thamrin, Jakarta, Indonesia
[2] Curtin Univ Malaysia, CDT 250, Bellevue, WA 98009 USA
[3] Univ Southampton Malaysia, Iskandar Puteri 79100, Johor, Malaysia
[4] OmegaCrop, 71-75 Shelton St, London, England
关键词
Automatic Mixed Precision; Convolutional Neural Network; Distributed Data-Parallel; Graphics Processing Unit; Remote Sensing;
D O I
10.1117/12.2679828
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models for computer vision in remote sensing such as Convolutional Neural Network (CNN) has benefitted acceleration from the usage of multiple CPUs and GPUs. There are several ways to make the training stage more effective in terms of utilizing multiple cores at the same time by processing different image mini-batches with a duplicated model called Distributed Data Parallelization (DDP) and computing the parameters in a lower precision floating-point number called Automatic Mixed Precision (AMP). We would like to investigate the impact of DDP and AMP training modes on the overall utilization and memory consumption of CPU and GPU, as well as the accuracy of a CNN model. The study is performed on the EuroSAT dataset, a Sentinel-2-based benchmark satellite image dataset for image classification of land covers. We compare training using 1 CPU, using DDP, and using both DDP and AMP over 100 epochs using ResNet-18 architecture. The hardware that we used are Intel Xeon Silver 4116 CPU with 24 cores and an NVIDIA v100 GPU. We find that although parallelization of CPUs or DDP takes less time to train on the images, it can take 50 MB more memory than using only a single CPU. The combination of DDP and AMP can release memory up to 160 MB and reduce computation time by 20 seconds. The test accuracy is slightly higher for both DDP and DDP-AMP at 90.61% and 90.77% respectively than without DDP and AMP at 89.84%. Hence, training using Distributed Data Parallelization (DDP) and Automatic Mixed Precision (AMP) has more benefits in terms of lower GPU memory consumption, faster training execution time, faster convergence towards solutions, and finally, higher accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Fusion of Elevation Data into Satellite Image Classification Using Refined Production Rules
    Al Momani, Bilal
    Morrow, Philip
    McClean, Sally
    IMAGE ANALYSIS AND RECOGNITION: 8TH INTERNATIONAL CONFERENCE, ICIAR 2011, PT I, 2011, 6753 : 211 - 220
  • [32] Automatic Mixed Pixel Detection using a new Hybrid Cellular Automata Approach on Satellite Image
    Mahata, Kalyan
    Das, Rajib
    Das, Subhasish
    Sarkar, Anasua
    2017 1ST INTERNATIONAL CONFERENCE ON ELECTRONICS, MATERIALS ENGINEERING & NANO-TECHNOLOGY (IEMENTECH), 2017,
  • [33] Blend AutoAugment: Automatic Data Augmentation for Image Classification Using Linear Blending
    Im, Jonghoon
    Kasahara, Jun Younes Louhi
    Maruyama, Hiroshi
    Asama, Hajime
    Yamashita, Atsushi
    IEEE ACCESS, 2024, 12 : 68770 - 68784
  • [34] Applying brightness information in satellite image data search using distributed genetic algorithm
    Katayama, Kei
    Mackin, Kenneth J.
    Matsushita, Kotaro
    Nunohiro, Eiji
    2006 INTERNATIONAL CONFERENCE ON HYBRID INFORMATION TECHNOLOGY, VOL 2, PROCEEDINGS, 2006, : 84 - +
  • [35] Fast distributed and parallel pre-processing on massive satellite data using grid computing
    Lee, Wongoo
    Choi, Yunsoo
    Shon, Kangryul
    Kim, Jaesoo
    JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2014, 21 (10) : 3850 - 3855
  • [36] Fast distributed and parallel pre-processing on massive satellite data using grid computing
    Wongoo Lee
    Yunsoo Choi
    Kangryul Shon
    Jaesoo Kim
    Journal of Central South University, 2014, 21 : 3850 - 3855
  • [37] Medical image data classification using deep learning based hybrid model with CNN and encoder
    Battula B.P.
    Balaganesh D.
    Revue d'Intelligence Artificielle, 2020, 34 (05): : 645 - 652
  • [38] Fast distributed and parallel pre-processing on massive satellite data using grid computing
    Wongoo Lee
    Yunsoo Choi
    Kangryul Shon
    Jaesoo Kim
    JournalofCentralSouthUniversity, 2014, 21 (10) : 3850 - 3855
  • [39] Simultaneous Segmentation and Classification of Pressure Injury Image Data Using Mask-R-CNN
    Swerdlow M.
    Guler O.
    Yaakov R.
    Armstrong D.G.
    Computational and Mathematical Methods in Medicine, 2023, 2023
  • [40] Robust Classification of Functional and Quantitative Image Data Using Functional Mixed Models
    Zhu, Hongxiao
    Brown, Philip J.
    Morris, Jeffrey S.
    BIOMETRICS, 2012, 68 (04) : 1260 - 1268