Towards Efficient Verification of Quantized Neural Networks

被引：0

作者：

Huang, Pei ^{[1
]}

Wu, Haoze ^{[1
]}

Yang, Yuting ^{[2
]}

Daukantas, Ieva ^{[3
]}

Wu, Min ^{[1
]}

Zhang, Yedi ^{[4
]}

Barrett, Clark ^{[1
]}

机构：

[1] Stanford Univ, Stanford, CA USA

[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China

[3] IT Univ Copenhagen, Copenhagen, Denmark

[4] Natl Univ Singapore, Singapore, Singapore

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19 | 2024年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Quantization replaces floating point arithmetic with integer arithmetic in deep neural network models, providing more efficient on-device inference with less power and memory. In this work, we propose a framework for formally verifying properties of quantized neural networks. Our baseline technique is based on integer linear programming which guarantees both soundness and completeness. We then show how efficiency can be improved by utilizing gradient-based heuristic search methods and also bound-propagation techniques. We evaluate our approach on perception networks quantized with PyTorch. Our results show that we can verify quantized networks with better scalability and efficiency than the previous state of the art.

引用

页码：21152 / 21160

页数：9

共 50 条

[21] Quantized Deep Neural Networks for Energy Efficient Hardware-based Inference
Ding, Ruizhou
Liu, Zeye
Blanton, R. D.
Marculescu, Diana
2018 23RD ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC), 2018, : 1 - 8
[22] Towards an Efficient Facial Image Compression with Neural Networks
Spatafora, Maria Ausilia Napoli
Ortis, Alessandro
Battiato, Sebastiano
IMAGE ANALYSIS AND PROCESSING, ICIAP 2022, PT I, 2022, 13231 : 512 - 523
[23] Towards Inductive and Efficient Explanations for Graph Neural Networks
Luo, Dongsheng
Zhao, Tianxiang
Cheng, Wei
Xu, Dongkuan
Han, Feng
Yu, Wenchao
Liu, Xiao
Chen, Haifeng
Zhang, Xiang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) : 5245 - 5259
[24] Work-in-Progress: Towards Efficient Quantized Neural Network Inference on Mobile Devices
Umuroglu, Yaman
Jahre, Magnus
2017 INTERNATIONAL CONFERENCE ON COMPILERS, ARCHITECTURES AND SYNTHESIS FOR EMBEDDED SYSTEMS (CASES), 2017,
[25] Towards Formal Verification of Neural Networks: A Temporal Logic Based Framework
Wang, Xiaobing
Yang, Kun
Wang, Yanmei
Zhao, Liang
Shu, Xinfeng
STRUCTURED OBJECT-ORIENTED FORMAL LANGUAGE AND METHOD (SOFL+MSVL 2019), 2020, 12028 : 73 - 87
[26] Towards Formal Verification of Neural Networks in Cyber-Physical Systems
Rossi, Federico
Bernardeschi, Cinzia
Cococcioni, Marco
Palmieri, Maurizio
NASA FORMAL METHODS, NFM 2024, 2024, 14627 : 207 - 222
[27] Hiding Needles in a Haystack: Towards Constructing Neural Networks that Evade Verification
Berta, Arpad
Danner, Gabor
Hegedus, Istvan
Jelasity, Mark
PROCEEDINGS OF THE 2022 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH-MMSEC 2022, 2022, : 51 - 62
[28] Towards Verification of Neural Networks for Small Unmanned Aircraft Collision Avoidance
Irfan, Ahmed
Julian, Kyle D.
Wu, Haoze
Barrett, Clark
Kochenderfer, Mykel J.
Meng, Baoluo
Lopez, James
2020 AIAA/IEEE 39TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC) PROCEEDINGS, 2020,
[29] Zac: Towards Automatic Optimization and Deployment of Quantized Deep Neural Networks on Embedded Devices
Xiao, Qingcheng
Liang, Yun
2019 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN (ICCAD), 2019,
[30] Quantized Neural Modeling: Hybrid Quantized Architecture in Elman Networks
Li, Penghua
Chai, Yi
Xiong, Qingyu
NEURAL PROCESSING LETTERS, 2013, 37 (02) : 163 - 187

← 1 2 3 4 5 →