共 50 条
- [41] Vector quantization of neural networks IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (06): : 1235 - 1245
- [43] FASTEN: Fast GPU-accelerated Segmented Matrix Multiplication for Heterogeneous Graph Neural Networks PROCEEDINGS OF THE 38TH ACM INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2024, 2024, : 511 - 524
- [44] Post-Training Quantization for Energy Efficient Realization of Deep Neural Networks 2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 1559 - 1566
- [45] Binarized Neural Networks for Resource-Efficient Hashing with Minimizing Quantization Loss PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 1032 - 1040
- [47] Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 2704 - 2713
- [50] From Sancus to Sancusq: staleness and quantization-aware full-graph decentralized training in graph neural networks VLDB JOURNAL, 2025, 34 (02):