GAAF: Searching Activation Functions for Binary Neural Networks Through Genetic Algorithm

被引:2
|
作者
Li, Yanfei [1 ]
Geng, Tong [2 ]
Stein, Samuel [2 ]
Li, Ang [2 ]
Yu, Huimin [1 ]
机构
[1] Zhejiang Univ, Dept Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Pacific Northwest Natl Lab, Richland, WA 99354 USA
来源
TSINGHUA SCIENCE AND TECHNOLOGY | 2023年 / 28卷 / 01期
关键词
binary neural networks (BNNs); genetic algorithm; activation function;
D O I
10.26599/TST.2021.9010084
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Binary neural networks (BNNs) show promising utilization in cost and power-restricted domains such as edge devices and mobile systems. This is due to its significantly less computation and storage demand, but at the cost of degraded performance. To close the accuracy gap, in this paper we propose to add a complementary activation function (AF) ahead of the sign based binarization, and rely on the genetic algorithm (GA) to automatically search for the ideal AFs. These AFs can help extract extra information from the input data in the forward pass, while allowing improved gradient approximation in the backward pass. Fifteen novel AFs are identified through our GA-based search, while most of them show improved performance (up to 2.54% on ImageNet) when testing on different datasets and network models. Interestingly, periodic functions are identified as a key component for most of the discovered AFs, which rarely exist in human designed AFs. Our method offers a novel approach for designing general and application-specific BNN architecture. GAAF will be released on GitHub.
引用
收藏
页码:207 / 220
页数:14
相关论文
共 50 条
  • [1] Parallel Implementation of Genetic Algorithm for Searching Optimal Parameters of Artificial Neural Networks
    Nugroho, Eko Dwi
    Wibowo, Moh Edi
    Pulungan, Reza
    2017 3RD INTERNATIONAL CONFERENCE ON SCIENCE AND TECHNOLOGY - COMPUTER (ICST), 2017, : 136 - 141
  • [2] Determination of Activation Functions in A Feedforward Neural Network by using Genetic Algorithm
    Ustun, Oguz
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2009, 15 (03): : 395 - 403
  • [3] Optimization of neural networks through grammatical evolution and a genetic algorithm
    Lima de Campos, Lidio Mauro
    Limao de Oliveira, Roberto Celio
    Roisenberg, Mauro
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 56 : 368 - 384
  • [4] Decode to channel binary block codes based on neural networks and genetic algorithm
    Zheng, YJ
    Tang, XY
    Tay, DBH
    APPLIED ARTIFICIAL INTELLIGENCE, 2001, 15 (02) : 141 - 159
  • [5] Wavelets as activation functions in Neural Networks
    Herrera, Oscar
    Priego, Belem
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4345 - 4355
  • [6] Genetic Algorithm-Neural Network (GANN): a study of neural network activation functions and depth of genetic algorithm search applied to feature selection
    Tong, Dong Ling
    Mintram, Robert
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2010, 1 (1-4) : 75 - 87
  • [7] Genetic Algorithm-Neural Network (GANN): a study of neural network activation functions and depth of genetic algorithm search applied to feature selection
    Dong Ling Tong
    Robert Mintram
    International Journal of Machine Learning and Cybernetics, 2010, 1 : 75 - 87
  • [8] Optimizing performance of feedforward and convolutional neural networks through dynamic activation functions
    Rane, Chinmay
    Tyagi, Kanishka
    Kline, Adrienne
    Chugh, Tushar
    Manry, Michael
    EVOLUTIONARY INTELLIGENCE, 2024, 17 (5-6) : 4083 - 4093
  • [9] Simple activation functions for neural and fuzzy neural networks
    Mendil, B
    Benmahammed, K
    ISCAS '99: PROCEEDINGS OF THE 1999 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 5: SYSTEMS, POWER ELECTRONICS, AND NEURAL NETWORKS, 1999, : 347 - 350
  • [10] Improving the Accuracy of Deep Neural Networks Through Developing New Activation Functions
    Mercioni, Marina Adriana
    Tat, Angel Marcel
    Holban, Stefan
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020), 2020, : 385 - 391