Automatic Mixed Precision and Distributed Data-Parallel Training for Satellite Image Classification using CNN

被引:0
|
作者
Nuwara, Yohanes [1 ]
Kitt, Wong W. [2 ]
Juwono, Filbert H. [3 ]
Ollivierre, Gregory [4 ]
机构
[1] Asia Pulp & Paper, Sinarmas Land Plaza MH Thamrin, Jakarta, Indonesia
[2] Curtin Univ Malaysia, CDT 250, Bellevue, WA 98009 USA
[3] Univ Southampton Malaysia, Iskandar Puteri 79100, Johor, Malaysia
[4] OmegaCrop, 71-75 Shelton St, London, England
关键词
Automatic Mixed Precision; Convolutional Neural Network; Distributed Data-Parallel; Graphics Processing Unit; Remote Sensing;
D O I
10.1117/12.2679828
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models for computer vision in remote sensing such as Convolutional Neural Network (CNN) has benefitted acceleration from the usage of multiple CPUs and GPUs. There are several ways to make the training stage more effective in terms of utilizing multiple cores at the same time by processing different image mini-batches with a duplicated model called Distributed Data Parallelization (DDP) and computing the parameters in a lower precision floating-point number called Automatic Mixed Precision (AMP). We would like to investigate the impact of DDP and AMP training modes on the overall utilization and memory consumption of CPU and GPU, as well as the accuracy of a CNN model. The study is performed on the EuroSAT dataset, a Sentinel-2-based benchmark satellite image dataset for image classification of land covers. We compare training using 1 CPU, using DDP, and using both DDP and AMP over 100 epochs using ResNet-18 architecture. The hardware that we used are Intel Xeon Silver 4116 CPU with 24 cores and an NVIDIA v100 GPU. We find that although parallelization of CPUs or DDP takes less time to train on the images, it can take 50 MB more memory than using only a single CPU. The combination of DDP and AMP can release memory up to 160 MB and reduce computation time by 20 seconds. The test accuracy is slightly higher for both DDP and DDP-AMP at 90.61% and 90.77% respectively than without DDP and AMP at 89.84%. Hence, training using Distributed Data Parallelization (DDP) and Automatic Mixed Precision (AMP) has more benefits in terms of lower GPU memory consumption, faster training execution time, faster convergence towards solutions, and finally, higher accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Augmentation of Small Training Data Using GANs for Enhancing the Performance of Image Classification
    Hung, Shih-Kai
    Gan, John Q.
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3350 - 3356
  • [42] Hyperspectral image classification with limited training data samples using feature subspaces
    Willis, C
    ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY X, 2004, 5425 : 170 - 181
  • [43] Multispectral Satellite Data Classification Using Rank Correlation Similarity Index and Automatic Labeling Technique
    Arindom Ain
    Minakshi Gogoi
    Dibyajyoti Chutia
    SN Computer Science, 5 (8)
  • [44] Plankton image classification using novel parallel-training learning vector quantization network
    Tang, XO
    Stewart, WK
    OCEANS '96 MTS/IEEE, CONFERENCE PROCEEDINGS, VOLS 1-3 / SUPPLEMENTARY PROCEEDINGS: COASTAL OCEAN - PROSPECTS FOR THE 21ST CENTURY, 1996, : 1227 - 1236
  • [45] Shoreline Data Extraction from QuickBird Satellite Image Using Semi-Automatic Technique
    Tarmizi, Nazirah Md.
    Samad, Abd Manan
    Yusop, Mohd Shukri Mohd
    2014 IEEE 10TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2014), 2014, : 157 - 162
  • [46] SAR Target Recognition Using Only Simulated Data for Training by Hierarchically Combining CNN and Image Similarity
    Zhang, Chen
    Wang, Yinghua
    Liu, Hongwei
    Sun, Yuanshuang
    Hu, Liping
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [47] Exercise classification using CNN with image frames produced from time-series motion data
    Itoh, Hajime
    Hanajima, Naohiko
    Muraoka, Yohei
    Ohata, Makoto
    Mizukami, Masato
    Fujihira, Yoshinori
    ICAROB 2017: PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS, 2017, : P100 - P103
  • [48] Automated Battery Making Fault Classification Using Over-Sampled Image Data CNN Features
    Din, Nasir Ud
    Zhang, Li
    Yang, Yatao
    SENSORS, 2023, 23 (04)
  • [49] Automatic land cover classification of multi-resolution dualpol data using convolutional neural network (CNN)
    Memon, Nimrabanu
    Parikh, Hemani
    Patel, Samir B.
    Patel, Dhruvesh
    Patel, Vibha D.
    REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2021, 22
  • [50] A framework for automatic classification of mobile LiDAR data using multiple regions and 3D CNN architecture
    Kumar, Bhavesh
    Pandey, Gaurav
    Lohani, Bharat
    Misra, Subhas C.
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2020, 41 (14) : 5588 - 5608