Analysis of non-linear activation functions for classification tasks using convolutional neural networks

被引:9
|
作者
Dureja A. [1 ]
Pahwa P. [2 ]
机构
[1] Computer Science & Engineering, USICT, GGSIPU, New Delhi
[2] Computer Science & Engineering, BPIT, Rohini, New Delhi
关键词
Activation function; CNN; Deep neural networks; Hidden layers; Machine learning; Non-linear problems;
D O I
10.2174/2213275911666181025143029
中图分类号
学科分类号
摘要
Background: In making the deep neural network, activation functions play an important role. But the choice of activation functions also affects the network in term of optimization and to retrieve the better results. Several activation functions have been introduced in machine learning for many practical applications. But which activation function should use at hidden layer of deep neural networks was not identified. Objective: The primary objective of this analysis was to describe which activation function must be used at hidden layers for deep neural networks to solve complex non-linear problems. Methods: The configuration for this comparative model was used by using the datasets of 2 classes (Cat/Dog). The number of Convolutional layer used in this network was 3 and the pooling layer was also introduced after each layer of CNN layer. The total of the dataset was divided into the two parts. The first 8000 images were mainly used for training the network and the next 2000 images were used for testing the network. Results: The experimental comparison was done by analyzing the network by taking different activation functions on each layer of CNN network. The validation error and accuracy on Cat/Dog dataset were analyzed using activation functions (ReLU, Tanh, Selu, PRelu, Elu) at number of hidden layers. Overall the Relu gave best performance with the validation loss at 25th Epoch 0.3912 and validation accuracy at 25th Epoch 0.8320. Conclusion: It is found that a CNN model with ReLU hidden layers (3 hidden layers here) gives best results and improve overall performance better in term of accuracy and speed. These advantages of ReLU in CNN at number of hidden layers are helpful to effectively and fast retrieval of images from the databases. © 2019 Bentham Science Publishers.
引用
收藏
页码:156 / 161
页数:5
相关论文
共 50 条
  • [31] Non-linear system modeling using LSTM neural networks
    Gonzalez, Jesus
    Yu, Wen
    IFAC PAPERSONLINE, 2018, 51 (13): : 485 - 489
  • [32] Classification of Non-functional Requirements Using Convolutional Neural Networks
    Garcia, S. E. Martinez
    Fernandez-y-Fernandez, C. Alberto
    Perez, E. G. Ramos
    PROGRAMMING AND COMPUTER SOFTWARE, 2023, 49 (08) : 705 - 711
  • [33] Classification of Non-functional Requirements Using Convolutional Neural Networks
    S. E. Martínez García
    C. Alberto Fernández-y-Fernández
    E. G. Ramos Pérez
    Programming and Computer Software, 2023, 49 : 705 - 711
  • [34] NLCMAP: A FRAMEWORK FOR THE EFFICIENT MAPPING OF NON-LINEAR CONVOLUTIONAL NEURAL NETWORKS ON FPGA ACCELERATORS
    Aiello, Giuseppe
    Bussolino, Beatrice
    Valpreda, Emanuele
    Roch, Massimo Ruo
    Masera, Guido
    Martina, Maurizio
    Marsi, Stefano
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 926 - 930
  • [35] Automatic classification of images with beach linear perspective using convolutional neural networks
    Santos-Romero, Martin
    Arellano-Verdejo, Javier
    Lazcano-Hernandez, Hugo E.
    Damian Reyes, Pedro
    2022 IEEE MEXICAN INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE (ENC), 2022,
  • [36] Alternating Transfer Functions to Prevent Overfitting in Non-Linear Regression with Neural Networks
    Seitz, Philipp
    Schmitt, Jan
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2023,
  • [37] Plant Classification using Convolutional Neural Networks
    Yalcin, Hulya
    Razavi, Salar
    2016 FIFTH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS), 2016, : 233 - 237
  • [38] Sound Classification Using Convolutional Neural Networks
    Jaiswal, Kaustumbh
    Patel, Dhairya Kalpeshbhai
    2018 SEVENTH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING IN EMERGING MARKETS (CCEM), 2018, : 81 - 84
  • [39] Clothing Classification Using Convolutional Neural Networks
    Hodecker, Andrei
    Fernandes, Anita M. R.
    Steffens, Alisson
    Crocker, Paul
    Leithardt, Valderi R. Q.
    2020 15TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2020), 2020,
  • [40] Strabismus Classification using Convolutional Neural Networks
    Kim, Donghwan
    Joo, Jaehan
    Zhu, Guohua
    Seo, Jeongbin
    Ha, Jaeseung
    Kim, Suk Chan
    3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 216 - 218