共 50 条
- [21] GAZELLE: A Low Latency Framework for Secure Neural Network Inference PROCEEDINGS OF THE 27TH USENIX SECURITY SYMPOSIUM, 2018, : 1651 - 1668
- [22] Latency-Aware Inference on Convolutional Neural Network Over Homomorphic Encryption INFORMATION INTEGRATION AND WEB INTELLIGENCE, IIWAS 2022, 2022, 13635 : 324 - 337
- [24] Overflow Aware Quantization: Accelerating Neural Network Inference by Low-bit Multiply-Accumulate Operations PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 868 - 875
- [25] Pruning-Aware Merging for Efficient Multitask Inference KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 585 - 595
- [27] γ-Razor: Hardness-Aware Dataset Pruning for Efficient Neural Network Training IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
- [28] Quantization-Aware Neural Architecture Search with Hyperparameter Optimization for Industrial Predictive Maintenance Applications 2023 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION, DATE, 2023,
- [29] Pruning and Quantization Enhanced Densely Connected Neural Network for Efficient Acoustic Echo Cancellation MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2024, 2025, 2312 : 200 - 211
- [30] Quantization-Aware Interval Bound Propagation for Training Certifiably Robust Quantized Neural Networks THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 14964 - 14973