Towards Efficient Verification of Quantized Neural Networks

被引:0
|
作者
Huang, Pei [1 ]
Wu, Haoze [1 ]
Yang, Yuting [2 ]
Daukantas, Ieva [3 ]
Wu, Min [1 ]
Zhang, Yedi [4 ]
Barrett, Clark [1 ]
机构
[1] Stanford Univ, Stanford, CA USA
[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[3] IT Univ Copenhagen, Copenhagen, Denmark
[4] Natl Univ Singapore, Singapore, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Quantization replaces floating point arithmetic with integer arithmetic in deep neural network models, providing more efficient on-device inference with less power and memory. In this work, we propose a framework for formally verifying properties of quantized neural networks. Our baseline technique is based on integer linear programming which guarantees both soundness and completeness. We then show how efficiency can be improved by utilizing gradient-based heuristic search methods and also bound-propagation techniques. We evaluate our approach on perception networks quantized with PyTorch. Our results show that we can verify quantized networks with better scalability and efficiency than the previous state of the art.
引用
收藏
页码:21152 / 21160
页数:9
相关论文
共 50 条
  • [21] Quantized Deep Neural Networks for Energy Efficient Hardware-based Inference
    Ding, Ruizhou
    Liu, Zeye
    Blanton, R. D.
    Marculescu, Diana
    2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 1 - 8
  • [22] Towards an Efficient Facial Image Compression with Neural Networks
    Spatafora, Maria Ausilia Napoli
    Ortis, Alessandro
    Battiato, Sebastiano
    IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 : 512 - 523
  • [23] Towards Inductive and Efficient Explanations for Graph Neural Networks
    Luo, Dongsheng
    Zhao, Tianxiang
    Cheng, Wei
    Xu, Dongkuan
    Han, Feng
    Yu, Wenchao
    Liu, Xiao
    Chen, Haifeng
    Zhang, Xiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5245 - 5259
  • [24] Work-in-Progress: Towards Efficient Quantized Neural Network Inference on Mobile Devices
    Umuroglu, Yaman
    Jahre, Magnus
    2017 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2017,
  • [25] Towards Formal Verification of Neural Networks: A Temporal Logic Based Framework
    Wang, Xiaobing
    Yang, Kun
    Wang, Yanmei
    Zhao, Liang
    Shu, Xinfeng
    STRUCTURED OBJECT-ORIENTED FORMAL LANGUAGE AND METHOD (SOFL+MSVL 2019), 2020, 12028 : 73 - 87
  • [26] Towards Formal Verification of Neural Networks in Cyber-Physical Systems
    Rossi, Federico
    Bernardeschi, Cinzia
    Cococcioni, Marco
    Palmieri, Maurizio
    NASA FORMAL METHODS, NFM 2024, 2024, 14627 : 207 - 222
  • [27] Hiding Needles in a Haystack: Towards Constructing Neural Networks that Evade Verification
    Berta, Arpad
    Danner, Gabor
    Hegedus, Istvan
    Jelasity, Mark
    PROCEEDINGS OF THE 2022 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH-MMSEC 2022, 2022, : 51 - 62
  • [28] Towards Verification of Neural Networks for Small Unmanned Aircraft Collision Avoidance
    Irfan, Ahmed
    Julian, Kyle D.
    Wu, Haoze
    Barrett, Clark
    Kochenderfer, Mykel J.
    Meng, Baoluo
    Lopez, James
    2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS, 2020,
  • [29] Zac: Towards Automatic Optimization and Deployment of Quantized Deep Neural Networks on Embedded Devices
    Xiao, Qingcheng
    Liang, Yun
    2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
  • [30] Quantized Neural Modeling: Hybrid Quantized Architecture in Elman Networks
    Li, Penghua
    Chai, Yi
    Xiong, Qingyu
    NEURAL PROCESSING LETTERS, 2013, 37 (02) : 163 - 187