Automatic Mixed Precision and Distributed Data-Parallel Training for Satellite Image Classification using CNN

被引:0
|
作者
Nuwara, Yohanes [1 ]
Kitt, Wong W. [2 ]
Juwono, Filbert H. [3 ]
Ollivierre, Gregory [4 ]
机构
[1] Asia Pulp & Paper, Sinarmas Land Plaza MH Thamrin, Jakarta, Indonesia
[2] Curtin Univ Malaysia, CDT 250, Bellevue, WA 98009 USA
[3] Univ Southampton Malaysia, Iskandar Puteri 79100, Johor, Malaysia
[4] OmegaCrop, 71-75 Shelton St, London, England
关键词
Automatic Mixed Precision; Convolutional Neural Network; Distributed Data-Parallel; Graphics Processing Unit; Remote Sensing;
D O I
10.1117/12.2679828
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning models for computer vision in remote sensing such as Convolutional Neural Network (CNN) has benefitted acceleration from the usage of multiple CPUs and GPUs. There are several ways to make the training stage more effective in terms of utilizing multiple cores at the same time by processing different image mini-batches with a duplicated model called Distributed Data Parallelization (DDP) and computing the parameters in a lower precision floating-point number called Automatic Mixed Precision (AMP). We would like to investigate the impact of DDP and AMP training modes on the overall utilization and memory consumption of CPU and GPU, as well as the accuracy of a CNN model. The study is performed on the EuroSAT dataset, a Sentinel-2-based benchmark satellite image dataset for image classification of land covers. We compare training using 1 CPU, using DDP, and using both DDP and AMP over 100 epochs using ResNet-18 architecture. The hardware that we used are Intel Xeon Silver 4116 CPU with 24 cores and an NVIDIA v100 GPU. We find that although parallelization of CPUs or DDP takes less time to train on the images, it can take 50 MB more memory than using only a single CPU. The combination of DDP and AMP can release memory up to 160 MB and reduce computation time by 20 seconds. The test accuracy is slightly higher for both DDP and DDP-AMP at 90.61% and 90.77% respectively than without DDP and AMP at 89.84%. Hence, training using Distributed Data Parallelization (DDP) and Automatic Mixed Precision (AMP) has more benefits in terms of lower GPU memory consumption, faster training execution time, faster convergence towards solutions, and finally, higher accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Collaborative Classification of Hyperspectral and LIDAR Data Using Unsupervised Image-to-Image CNN
    Zhang, Mengmeng
    Li, Wei
    Wei, Xueling
    Li, Xiang
    2018 10TH IAPR WORKSHOP ON PATTERN RECOGNITION IN REMOTE SENSING (PRRS), 2018,
  • [22] Feature Image-Based Automatic Modulation Classification Method Using CNN Algorithm
    Lee, Jung Ho
    Kim, Kwang-Yul
    Shin, Yoan
    2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 560 - 563
  • [23] An automatic cloud region classification of satellite image by using clustering in local area
    Ozawa, N
    Aoki, T
    Kato, N
    Nemoto, Y
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2003, 86 (06): : 33 - 43
  • [24] Parametric Classification using Fuzzy Approach for Handling the Problem of Mixed Pixels in Ground Truth Data for a Satellite Image
    Sherwani A.R.
    Ali Q.M.
    Annals of Data Science, 2023, 10 (06) : 1459 - 1472
  • [25] CNN HYPERSPECTRAL IMAGE CLASSIFICATION USING TRAINING SAMPLE AUGMENTATION WITH GENERATIVE ADVERSARIAL NETWORKS
    Neagoe, Victor-Emil
    Diaconescu, Paul
    2020 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2020, : 515 - 519
  • [26] High Performance Processing of Satellite Data Using Distributed and Parallel Computing Techniques
    Damahe, Lalit B.
    Bramhe, Sanket S.
    Fursule, Nilay C.
    Shirbhate, Ram D.
    Ajmire, Pournima S.
    Kumar, Girish
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 404 - 409
  • [27] AUTOMATIC FRAMEWORK FOR SEMI-SUPERVISED HYPERSPECTRAL IMAGE CLASSIFICATION USING SELF-TRAINING WITH DATA EDITING
    Wang, Junshu
    Jiang, Nan
    Zhang, Guoming
    Hu, Bin
    Li, Yang
    2015 7TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2015,
  • [28] Remote sensing image scene classification using CNN-MLP with data augmentation
    Shawky, Osama A.
    Hagag, Ahmed
    El-Dahshan, El-Sayed A.
    Ismail, Manal A.
    OPTIK, 2020, 221
  • [29] A new image classification method using CNN transfer learning and web data augmentation
    Han, Dongmei
    Liu, Qigang
    Fan, Weiguo
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 95 : 43 - 56
  • [30] Multiclass Classification for Bangla News Tags with Parallel CNN Using Word Level Data Augmentation
    Amin, Ruhul
    Sworna, Nabila Sabrin
    Hossain, Nahid
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 174 - 177