共 50 条
- [32] Jily: Cost-Aware AutoScaling of Heterogeneous GPU for DNN Inference in Public Cloud 2019 IEEE 38TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2019,
- [33] Sub-Word Parallel Precision-Scalable MAC Engines for Efficient Embedded DNN Inference 2019 IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE CIRCUITS AND SYSTEMS (AICAS 2019), 2019, : 6 - 10
- [34] Efficient Single- and Multi-DNN Inference Using TensorRT Framework SIXTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION, ICMV 2023, 2024, 13072
- [37] On Accelerating Multi-Layered Heterogeneous Network Embedding Learning 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
- [39] A DNN inference acceleration algorithm combining model partition and task allocation in heterogeneous edge computing system Peer-to-Peer Networking and Applications, 2021, 14 : 4031 - 4045
- [40] Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU 2021 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN (ICCAD), 2021,